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TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA QCTGCTGGGC CCACTGGCCG 3000 

TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGQATCT CTGCQTTTTT 3060 

ATACTGAGTQ TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



A1&5 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Acceasion #x 
Protein Accession ft : 
Signal sequence = 
TranBTnembrane domain: 

Cellular localization; 



Cadherin 3, P-cadheriu (placental) 

Hs.2877 

X63G29 

CAA4S177 

1-24 

6S9-675 

pi a e ma membrane 



1 11 21 31 41 SI 

I I 1 I I 1 

MGEPRGPIiAS U^LLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG €0 

QEPABFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR IIKRHKRDWV VAPISVFENG 120 

KGPFFQRLNQ LK8NKDRDTK I FYS I TG PGA DSPPEGVFAV BKffTGWLLLN KPLDREEIAK 180 

YELFGHAV6E KGASVEDPMN JSIIVTDQKD HKPKFTQDTF RGSVLEGVLF GTSVMQVTAT 240 

DEDOA1YTYN GVVAYSIHSQ EPKDPHDLMF riHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

1DMDGDGSTT TAVAWE I3jD ANDHAPMPDP QKYEAHVPBN AVGHEVQRI/T VTDUJAPNSP 360 

AHRATYMMG GDDGEHFTIT THPESNQGIL TTRKGLOFEA KNQHTI»YVEV TNEAPFVLIdi 420 

PTSTATIWH VEDVNEAPVF VPPSKWBVQ EGIPTGEPVC VYTAEDPDKE NQKI6YRILR 480 

DPAGWI*AMDP DSQQVTAVOT liDREDEQFVR NHIYEVMVLA MDNfGSPPTTG TBTLIjIsTLID 540 

VNDHGPVPBP RQIT1CNQSP VRHVLNflTDft BIjGPHT&PFQ AQLTDDSDIY WTAEVHEEGD 600 

TWL SLiKKFli KQDTYDVHLS LSDHGNKEQIj TVIRATVCDC HGHVETCPQP WKQGFIliPVL. 660 

QAVIiAIJjlFIjXj LVLLLLVRKK RKIKEPLIfliP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 

GliEARPEVVL RNDVAPTIIF TPWYRPRPAN PDEIGHFIIE NLKftANTDPT APPYDTIiIiVF 780 
DYEGBQSDAA ShSSVT SShS DQDQDYDYLN BWGSRFKKLA DMYGGGEDD 



A146 PNA SEQUENCE; 
Gene name: 
Unigene number: 
Probeset: Accession #i 
Nucleic Acid Accession # t 
Coding sequence! 



TTK protein 7tinfi.Be 

He. 169840 

M86699 

NM_003318 

1026-3551 {-underlined 



sequences correspond to 



start and stop codons} 



1 11 21 31 41 SI 

111(11 

GGAATTCCTT TTTTTTTTTT TTTGAGATGG AGXTTCACTC TTGTTGGCCA GGCTGGAGTG 60 

CAATGGCACA ATCTCAGCTT ACTGCAACCT CCGCCTCCCG GGTTCAAGCG ATTCTCCTGC 120 

CTCAGCCTCT CAAGTAGCTG GGATTACAGG CATGTGCCAC CACCCCTGGC TAACEAAXIX 180 

CTTTTCTATT TAGTAGAGAT GGGGTTTCAC CATGTTGGTC AGGCTGGTCT TGAACTCCTG 240 

ACCTCAGGTG ATCCACTTGC CTTGGCCTCC CAAAGTGCTA GGATTACAGC CGTGAAACTG 300 

TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTCTCCC TTGGTCGCCC 360 

AGOCTGGAGT GCAGTGGTGC GATCTTGGCT CACTATAACC TCCACCTCCT GGTTTCAAGT 420 

GATCCTCCCA CTTTAGCCTC CTGAGTAGCT GTGATTACAG GCGTGCACCA CCACACOCGG 480 

CTAATTTTTG TATTTTTATT AGAGAOAGGG TTTCACCATG TTGGCCAGGC TGTTCTCAAA 540 

CTCCTGGACT CAAGGGATCC GCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG 600 
AGTCRCCATG CCTGACCTTA rAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 
GG1CCTCACA ACAAATCTGC ATTAGGCGGT ACAATAATCC TXAACTTCAT GATTCACAAA 720 
AQGAAGATGA AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTOCT 780 
GGATGATGAT CCTAAATCCA GATACAGTAA AAATGOGGTA TGGOAAGGTA GAATACAAAA 840 
TTTGQTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TGGATACATT GTTGATOTGA 900 
ATGTAAGACT GTACAGACTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC ATTTCCCCAG 960 

TGCAGTTTTC TGTAOAAATG QAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTQATT 1020 

CCAT AATGA A CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 10 B0 

AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGGAACT GTTAACCAAA 1140 

TTATGATOAT GGCAAACAAC CCAGAGGACT GGTTGAGTTT GTTGCTCAAA CTAGAGAAAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TQBTCGTTAC AGTCAAGCAA 1260 

TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 1380 

CCAGAGCAAA CTGCAAGAAA TTTGCTTTTG TTCATATATC TTTTGCACAA TTTGAACTGT 1440 

CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTOTAGAA CGTQOAGCAG 1500 

TACCACTAGA AATGCTGGAA ATTGOCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1S60 

TTTCAGAGGA GQAAAAGAAG AATTTATCAG CATCTACGGT ATTAACTOCC CAAQAATCAT 1620 

XrTCCGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTGTGATTCC AGAGGACAGA 1680 

CTACTAAAGC CAGGTTTTTA TATGGAGAGA ACATGCCACC ACAAQATGCA GAAATAGGTT 1740 

ACCGGAATTC ATTGAGACAA ACTAACAAAA CTAAACAGTC ATGCCCATTT GGAAGAGTCC IB 00 

CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTIX3TACCTT I860 

GTTTTATGAA AAGACAAACC TCTAGATCAG AATGCCGAGA TTTGGTTGTG CCTGGATCTA 1920 

AACCAAGTGG AAATGATTCC TOTQAATTAA GAAATTTAAA GTCTGTTCAA AATAGTCATT 1980 

TCAAGGAACC TCTGGTGTCA GATGAAAAGA GTTCTGAACT TATTATTACT GATTCAATAA 2040 

CCCTGAAGAA TAAAACGGAA TCAAGTCTTC TAOCTAAATT AGAAGAAACT AAAGAGTATC 2100 

AAGAACCAGA GGTT CCAGAG AGTAACCAGA AACAGTGGCA AGCTAAGAGA AAGTCAGAGT 2160 

GTATTAACCA GAAICCTGCT GCATCTTCAA ATCACTGGCA GATTCCGGAG TTAGCCCGAA 2220 

AAGXTAATAC AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

AGTCACCACC AATATCAACA TCTAAATGGT TTGACCCAAA ATCXATTTGT AAGACACCAA 23 40 



900 



WO 03/042661 



PCT/US02/36810 



GCAGCAATAC 
TTCCACCTGC 
AGCATCAAAT 
AlGAATGCAT 
CTTCAAGCAA 
TGAACTTAGA 
TGAATAAACT 
ACCAGTACAT 
AGAAAAAATC 
TTCACACAAT 
taj3ttgatgg 
CAACAAGTGT 
TCAAAGATAT 
GTGATGTTTG 
AGCAGATAAT 
AATTTCCCGA 
ACCCAAAACA 
CTCATCCAGT 
AACTTGTTGG 
ACTATAGTGG 
BAAAAAAATG 
GTTATACTCT 
ATCAQCAAAA 
TTATGGCACT 
TTQACATCAT 
TAAATAAAGT 



CTTGGATGAT 
1 TQTCAGTTG 
ACTTGCCACT 
TTCGQTTAAA 
GGTATTTCAG 
AGAAGCAGAT 
ACAACAACAC 
CTACATGGTA 
CATTGATCCA 
CCATCAACAT 
AATGCTAAAG 
TGTTAAAGAT 
GTCTTCCTCC 
GTCCTTAGGA 
TAATCAGATT 
TATTCCAGAG 
GAGGATATCC 
TAAOCAAATG 
TCTGAATTCT 
TGGTGAAAGT 
_ATTTGCAGTT 
TOAATCCCTG 
AAAATTCAGT 
GTATATATTG 
TTTACTCTIG 



TACATGAGCT 

TCAACACCTI 

CCACTTCAAA 

GGAAGAATTT 

GTGTTAAATQ 

AACCAAACTC 

AGTQATAAGA 

ATGGAGTGT6 

TOGGAACGCA 

GGCATTGTTC 

CTAATTGATT 

TCTCAGGTTG 

AGAGAGAATG 

TGTATTTTGT 

TCTAAATTAC 
AAAGATCTTC 

ATTCCTGAGC 
GCCAAQGGAA 
CCTAACTCCA 
CATAATTCTT 
ATTCGTAATG 
TGGAAATCTA 
GAOATTATCT 
TAGACTTGTT 
GAATAGTQGG 
AAATGA 



GTTTTAGAAC 
ATGGCCAACC 
ATTTACAGGT 
ATTCCATATT 
AAAAGAAACA 
TTGATAGTTA 
TCATCCGACT 
GAAATATTGA 
AGAGTTACTG 
ACAGTGATCT 
TTGGGATTGC 
QCACAGTTAA 
GGAAATCTAA 
ACTATATGAC 
ATGCCATAAT 
AAGATGTGTT 
TCCTGGCTCA 
CCACTGAAGA 
TTTTGAAAGC 
CATCCTCCAA 
TCAGATAGGA 
CATTXOAAQA 
TTAAAAGAAA 
TTCTCTGTTT 
TQOATAGCAA 



TCCAGTTGTA 
1GCCTGTTTC 
TTTAGCATCX 
AAAGGAGATA 
GATATATGCT 
CCGGAACGAA 
TTATGATTAT 
TCTTAATAGT 
GAAAAATATG 
TAAACOW3CT 
AAAGCAAATG 
TTATATGCCA 
GTCAAAGATA 
TTACGGGAAA 
TGATCCTAAT 
AAAGTGTTGT 
TCCATATGTT 
AATGAAATAT 
TGCTAAAACT 
GACTTTTGAA 
GGTATAAAAT 
CAACATCACT 
ACTGTAAAAA 
TATGCTCTTG 
GTATATTCTA 



AAGAATGACT 
CAGCAGCAAC 
rCTTCAOCAA 
GGAAGTGGAG 
ATAAAATATG 
ATAGCTTATT 
GAAATCACGQ 
TGGCTTAAAA 
TTAGAGGCAG 
AACTTTCTBA 
CAACCAGATA 
CCAGAAGCAA 
AGCCCCAAAA 
ACACCATTTC 
CATGAAATTQ 
TTAAAAAGGG 
CAAATTCAAA 
GTTCTGGGCC 
TTATATOAAC 
AAAAAAAGGG 
ATATTGGACT 
CTGAAGTGTT 
TAGCAAOCAC 
TGTAATCTAC 
AAAAACTTTG 



2400 
2460 
2520 
2560 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
30.20 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3 660 
3720 
3780 
3840 



A147 Protein sequence? 



Unicjene number t 
Probeset Accession ft: 
Protein Accession #s 
Signal sequence: 
Transmembrane domains: 
protein Kinase Domains 
Cellular Iiocalization: 



TTK protein kinase 

He. 169840 

M86699 
NF_003309 
none found 
none found 
510-775 

cytoplasmic and nuclear 



VPL8DALLNK 
ANCKKFAFVH 
EEEKKNLSAS 
NSliRQTNKTK 



pprarsRWFD 

QIIATPLQKIi 
LEEADNQTLD 



SW7D9QVGT 
IINQISKLHA 
FVNQMA&GTT 
K 



11 
1 

FKHSDLTDEL 
LIGRYSQAIE 
ISFAQFELSQ 
TVLTAQESPS 
QSCPFGRVPV 
LKSVQN9HPJC 
WQAKRKSECI 
PKBICKTPS3 
QVT ASS SANE 
SYRiJElAYLK 
YWXNMLEAVH 
VNYMPPEAIK 
IIDPWHEIBF 
EEMKYVUSQIi 



21 
1 

JSXjHKISADTT 
ALPPDKYGQN 
GNVKKSKQLL. 
GSLGKLQNRN 
NLLNSPDCDV 



UQKPAA33NH 
HTLBDYWSCF 
CISVXQRIYS 
KUQQHSDKI I 
TXHQHGZVHS 



PDXPEKDIiQD 
VOIMSPNSIIj 



31 


41 


51 




1 

DNSGTVKQIM 


1 

MMANNPEDWI* 


1 

SLLIiKLEKNB 


60 


ESFARIQVRF 


AKLKAIQRPD 


DARDYFQMAR 


120 


QKAVERGAVP 


LEHLEIALRU 


LNLOKKQpCjIiS 


180 


NSCDSRGQTT 


KARPLYGENM 


PPQDAEIGYR 


240 


KTDDSWPCF 


MKKQTSRSEC 


RDLWPGSKP 


300 


EL1ITDSITL 


KNKTESSLIiA 


KUBETKEYQE 


360 


WQ1PSLARKV 


NTEQKHTTFB 


QPVF8VSKQS 


420 


RTPWKHDFP 


PACQLSTPYG 


QPACPQQQQH 


480 


ILKQIGSGGS 


SKVFQVUHKK 


KQIYAIKYVN 


540 


RLYDYEITDQ 


YIYMVKECGPT 


IDLttSKLKKK 


600 


DUCPANFLIV 


DGMLKIiIDFG 


IANOMQPDTT 


660 


SKSKISPKSD 


VWeiiGdLYY 


MTYGKTPPQQ 


720 


VLKCCLKRDP 


KORISIPSLIf 


AHPYVQIQTH 


7BO 


KAAKTLYEHY 


sggeshu&bs 


SKTFEKKRGK 


840 



A14B PKA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Acceegion #: 
Mucleic Acid Accession fti 
Coding sequence z 



endothelial differentiation, 1 y s opnouplia t i di c acid G-protein-coupled receptor, 7 
He. 258583 
HM_012152 
NM_012152 

43-1104 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I 1 I 1 I 

CTTCTTTAAA TTTCTTTCTA GGAKJTTCAC ITCTTCTCCA C AATGA ATGA GTGTCACXAT 60 

GACAAGCACA TGQACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 120 

GGAACAAAGC TTGTGATTGT 1TTQTGTGTT GGGACGTTTT TCTGOCTGTT TASTTTTTTT 180 

TCIAATTCTC TGGTCATOGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 240 

TACCTGTTGG CTAATTTAGC TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TOTATTCCTG 300 

ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTOACTGTCA ACCCCTGGTT TCTCCGTCAG 360 

DGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTXAT CGCCGTGGAG 420 

AGGCACATGT CAATCATGAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 480 

CTGCTCATTT TQCTTGTCTG GGCCATCGCC ATTTTTATGQ GGGCGGTCCC CACACTGGGC 540 

TGGAATTGOC TCTGCAACAT CTCTGCCTGC TCTTOCCTGG CCCCCATTTA CAGCAGGAQT 6O0 

TACCTTGTTT TCTGOACAGT GTCCAACCTC ATGGCCTTCC TCATCATQOT TGTGGTGTAC 660 

CTGCGGATCT ACGTGTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAGIGGG 720 

TCCATCAGCC GC20BOAGGAC ACCXZATGAAG CTAATOAAGA OGGTGATGAC TGTCTTAGGG 780 

GCSTTTGTOa TATGCTGGAC CCCGGGCCTG GTQGTTCTGC TCCTCQACGG CCTGAACTGC 840 

AGGCAGTGTG GCGTGCAOCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 

GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAOOACA TGTATGGCAC CATQAAGAAG 960 

ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAOAGGCGTC CCTCTOQCAT CCCCtCCACA 1020 

GTCCTCAQCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGOTGCA ID 80 



901 



WO 03/042661 



PCT/US02/36810 



GTCTGCAATA AAAGCACTTC CTAAACTCTQ GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 



A14 9 Protein sequence ? 



Unigene number: 
Probe set Accession It: 
Protein Accession #* 
Signal sequence: 
Transmembrane domains; 
Cellular Localization: 



endothelial differentiation, lysophosphatidic acid G -protein -coupled receptor, 

HB. 258583 

NM_0 12152 

NP_036284 

none found 

31-53, 66-66, 150-172, 190-211, 239-261, 277-295 
plasma membrane 



1 11 21 31 41 51 

1 I I I I I 

MNECHYDKHM DFFYNRSNTD TVBDWTGTKL VIVLCVGTFF CLFIFFSNSL VIAAVIKNRK 60 
FHFFFYJTLIiA NLAAAt>FFAG IAYVPLMFNT GPVSKTLXVN KWFLHQGLIiD SSLTASLTMfL 120 
LVIAVSRHHS IMRMRVHSNL TKKRVTLLIIi IiVWAIAJFMG AVPTLGHNCL CN I SACS SLA 160 
PIYSRSYLVF HTVSHliMAFIi I1WWYLRIY VYVKRKXNVL SPHTSG8TSR RRTPMKLMKT 240 
VMTVIiGAFW CWTPGLVVLL LDGLNCRQCa VQHVKRWFLL IALLNSWHF IIYSYKDEJDK 300 
YGTMKKMICC FSQENPEREP SRIP9TVLSR SDTGSQYIKD SISQGAVCNK STS 



Prostate 

A150 PMA SEQUENCE 

Gene name: ESTs 

Unigene number; Kb, 293616 

Prgbeset Accession #: AW043782 

Nucleic Acid Accession #• none found 

Coding sequence* 38-1075 (underlined sequence corresponds to start and stop cod on) 



1 11 21 31 41 51 

15 1)11 

AGCAACGACG CCGGGCAGCG GGAGCGGOGG CCGCGCCATG TGGCTGCTGG GGCCG.CTGT5 60 

CCTGCTC3CTG AGCAGCGCCG CGQAGAGCCA GCTGCTCCCC GGGAACAACT TCACCAATGA 120 

GTGCAACATA CCAGGCAACT TCATGTGCAG CAATGGACGG TGCATCCCGG GCGCCTGGCA 1B0 

GTGXQACGGG CTGCCXGACT GCTTOGACAA GAGTGATGAG AAGGAGTGCC CCAAGGCEAA 240 

GTCGAAATGT GaCCCAACCT TCTICCCCTG TGCCAGCGGC ATCCATTGCA TCATTGGTCG 300 

CTTCCGGTGC AATGGGTTTG AGGACTGTCC CGATGGCAGC GATGAAGAGA ACIGCACAGC 360 

AAACCCXCTa CTTTGCTCCA CCGCCCGCTA CCACTGCAAG AACGGCCTCT GTATTGACAA 420 

GAQCTTCATC TGCGATGGAC AGAATAACTG TCAAGACAAC AOTGATGAGG AAAGCTGTGA 4 SO 

AAGTTCTCAA GAACCOGGCA GTGGGCAGGT GTTTGTGACT TCAGAGAACC AACTTGTGTA 540 

TTACCCCAGC ATCACCTATG CCATCATCGG CAGCTCCGTC ATTTTTGTGC TGGTGGTGGC 600 

CCTGCTGGCA CTGGTCTTGC ACCACCAGCG GAAGCGGAAC AACCTCATGA CGCTGCCOGT 660 

GCACCGGCTG CAGCACCCTG TGCTGCTGTC CCGCCTGGTG QTCCTGGACC ACCCCCACCA 720 

CTGCAACGTC ACCTACAACG TCAATAATGG CATCCAGTAT GTGGCCAGCC AGGOGGAGCA 780 

GAATGCGTCG GAAGTAGGCT CCCCACCCTC CTACTCCGAG GCCTTGCTGG ACCAGAGGCC 840 

TG CGTGGTAT GACCTTCCTC CAO0GCCCTA CTCTTCTGAC ACGGAATCTC TGAACCAAGC 900 

CGACCTGCCC CCCTACCGCT (XCGGTCCGG GAGTGCCAAC AGTOCCAGCT CCCAGGCAGC 960 

CAGCAGCCTC CTOAGCGTGG AAGACACCAG CCACAGCCCG GGGCAGGCTQ qcCCCCAGGA 1020 

GGGCACTGCT GAGCCCAGGG ACTCTGAGCC CAGCCAGGGC ACTGAAGAAG T ATAAG TCCC 1080 

AGTTATTCCA AAGTCCATAT GGGTTAATCT GCTCTGACTT GTTGCCATTC TAACAATTTG 1140 

TGCXCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 1200 

AACTATCTCT GCATTCCCCT CCTOCCCCAG ACTTCAGAGA TGTTTTTCTG GOGTCtCAGT 1260 

TGACATGATC TGTTGTGCGT CTTTTCTGTC AGGTCACTCT TCCCTTGGGA CCCGAGATCA 1320 

C2ACCCTCATT TTTCACATTA TTCTGTTTCT GTTGGAGAQA CAGCATATAA AACAGTATTG 1380 

AAATAGGCTG GGAGAGAGCA ATGTTTCTGX GCTATATTGG ATGCTCAGAA GTGCAGGAGA 1440 

CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTAXAG CATTTGGGGA TTTGGGTTAG 1500 

ATGATCTAAC CAGGAGGCCA TCACTGGATG GTCACCCCCC CAAAAAAATT CCATTTGAQC 1560 

ATCAAAACCT GCTTTGCACA ATCCTATTTG ATGCCCCCAG tXCAGCAGAG TCAGTGGCCA 1620 

AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAGTCGC AACGTTAITT TGGTTTTGTG 1680 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 1740 

CTCATTCTGA GAGCTTTOCT CAGCAGCAtA TATCATCAGC CTCATCCTAA AATAGGCAGG 1800 

GAGCCCCTOC CATGAGTTTA TCCAAGTTCT CAGCTCCTAA AATGCAGGCT GGCAAGACCC I860 

TACACCTGCC CTGGCTCXAC AGCCACTTAC CTGGTTTCTG GACTGXCACC CTCCCAGCTG 1920 

ACCTGCCCGT AGCCAAGGAA TGAGGACCTA ACTTGAGTTG GCCCAAAGTC TGACCTGGCT 1980 

GTATGTCCCT GTGOCCCACA CCCAGCCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

CTCCAAAGTT CCCTTAACAC TTGCAAAGTC CTTTTTAOCT GTGCATTTGG ACTTGAGGAC 20,00 

ACTGGTTTCT ATCACAGQTG AGAGCCATGT TCAATACCTC CAGCAAGCTC TCCTGGCTCC 2160 

CTGCACTGTG CACGCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTAG TTAGAGTTAG 2220 

GGTCAGGOTC AGGCCTCTCC CAACATCCCA GTAGTTTCTC CTCTQAGACA CATGGGCAAG 2280 

AGACAATTXG GAGTCAAGAT TTTCCATTTG GATCTATTTT AAATCTTTTA GAAATGCATT 2340 

TGAAACAGTG TGTTTGTTTT TTCCCTTCTA GTTAAGGGAC TAXTTATATG TGTATAGGAA 2400 

AGCTGTCTCT TlTTTTGTTT TTCCTXTAAC AAGGXCCAAA GAAAGATGCA AAAGGAOATC 2460 

ACACCCTTGC CCCGCTGAGC CCCGTGATAA CAAQTCACTC CAGACXAACC TGTGTGCCAG 2520 

ACATTTGTGC ATTGTTGCAC TTTGAGGTTA TTATTTATCA AGTTCTTGAA GGAAGCAGAA 2580 

AGAGGGACTC CTCTCTCCCT CCGTGTATAQ TCTCTATGTT TGTGCXAGTT TTTCTTTTTT 2640 

TXCTCTGTGT GCAGTCAGCC ACAGGGCCCG CCTCCCXGCA GGAATAAGGG GTAAAACGTT 2700 

AGGTGTTGTT TGGCAAGAAA CCACACTGAC TGATGAGGGG TAAAATGGAA CCAGGTAGAG 2760 

CCACTCCGGG CAGCTGTCAC CCATTCAGAA CTTCTTTCCG CAGCTGAAGA AATGTTCAGT 2820 



902 
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10 



15 



20 



AACCTGTTTG 
TGATCCTGTT 
A AG CI' AG C T_A 
AACGGAAAGG 
CTGAGTAATC 
CATTTCATCT 
TTTCTGGTGC 
GAGTTAATCT 
AGATAAGGGA 
TG AT TTTTT T 
TTTTTGGGGG 
TTTTT AACTC 
AAAATAGTCT 
AGTTACACTG 
TTTTT AAAAA 
GAAAGGTTGT 
TTATACTTTC 
AAMNIAAAAAC 
TGGQGCGGCG 
GAAACCCTGG 
GGAAACCGCA 
GGACTCAACC 



AOGCTAATTA 
CTGTAGACTT 
CTGGTATTTT 
AACCTAGCTG 
CAATAAAGAA 
CCTGTGAGTC 
TCTGGAAGTT 
CACTCGCTTT 
TGCCTACTAA 
AATGAATGTT 
GAGQGGQTTT 
ATTCCAACCA 
CATCTCTTTT 
TGATGACTGG 
AT G CAACTAA 
GTGTCGTTGC 
TAATAAATTT 
AHYWTTGGGG 
GGGCCCADGT 
CCAAGAAAAA 
GAGTGTTGCG 
AGGAGGACCC 



AAACAGAGCC 



GTTTTGTTTA 
CCTGTATCTT 
CTTTTGATGA 
AGAAGGGCTT 
GTT T AG AGGA 
TCTGCTTCCA 
TGCTTTTTTA 
TTTAAAAATA 
OTTTTCCAAC 
GGAAGCTTTT 
TTTCTCAAAT 
CCTATTACCT 
GTGGTTAATA 
TTTTTGTGTT 
GCAGTTTCAT 
GGGCTTGGGC 
AGGTAOGGCG 
GGTGGCGAGA 
TAAAC CACAC 
AAGGGAACCC 



TGCAGGAAGT 
TTTTAACCAA 
AAAAAAAAAA 
TCATTTTTAA 
CAGCCAGAAT 
TATTTCTCCC 
AAGAATTCTA 
GGCATCTTAG 
AAACAAACAG 
TATAAATAGG 
TCAAGATGGC 
TTATACATTG 
GAGATCCGTG 
QACTCAGCTC 
GTGTGTGACG 



GGGGCTAAAG 
ATCCAAAGGA 
GAAAGAAAGA 
AATAGCRCTT 
GTGTTAGAAC 
TTTGATGGGG 
ATTT T AATT A 
GAAAAACAAA 
GGACATTTTT 
ACACCAAAGC 
ACATTAGTGG 

cctaaatcta 

TTTTATTTTA 
CCTCTACCTT 
CTCAAAGTTA 



TGTGCAAAWG 
CTGQQAAAAA GTTTTTAACA 
ACCACGCGGG CCCAAACGGG 
ATTCTCCACA CCAOAAAAAA 
CCGAAGAGAG AACTCAGAAG 
GATAGAGTAC G 



TGGCATTCAG 
TGTTACAGAA 
AAGAAAGAAA 
GAGTTATTT T 
TCTQGCTGAA 



A7TGCGCAGT 
TGGTTTTAGT 
ATTATAGATT 
GGCAGGGTTT 
CCAGCAATAT 
CGCCAACCAG 
GCATTAAATT 
GAAATTGACA 
ATGTAAACTG 
TTTTTAATTT 
GWMCTAMARH 
CCACTTCGGG 
ACCCCAGAAG 
ACGCGCCGGG 
CACACAAGCG 



2880 
2910 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
34BO 
3540 
360O 
3660 
3720 
3780 
3840 
3900 
3 960 
4020 
4080 



25 
30 
35 
40 
45 
50 



A151 Protein sequence; 

Gone name: 

tin! gene number: 

Probe set Accession #i 

Protein Accession ft; 

Signal sequence * 

T ranfl me mbr ane domains 7 

r.DLa domains: 

Cellular Local! cat ions 



Hs. 293616 
AW043782 
none found 
1-17 
1S9-191 

28-66, 70-1DB, 112-149 
plasma membrane 



11 
I 



21 
I 



31 
I 



41 
1 



51 
I 



KWLLGPLCLIj LSSAAESQliIi PGKNFTNECN IPGNFMCSNG RCIPGAWQCD GLPDCFDKSD 60 

EKECPXAKSK OGPTFFPCAS GIHCIIGRFH CNGFEDCPDQ SDEENCXAHP IdbCSTARYHC 120 

KNGLCIDKSF ICDGQNNCQD H9DEESCESS OEPGSGQVFV TSENQLVVYP &2TYAXIG5S 180 

VIFVLWAXjEj AlfVLHHQRlGR. NNlMTIiPVHR IjQHFVIiIiSRL VVLDHPHECN VTW9VMSG1Q 240 

YVASQABQ1J1A 6EVGSPP3YS EAliLDQRPAW YULPPPPYSS DTBSLNQADL PFYRSRSGSA 30 0 
NSASSQAASS LLSVEDTSHfi PGQPGPQBGT AEPRDSEP3Q GTEEV 



A152 UNA SEQUENCE 
Gene name: 
Unigene number: 
Probes* t Accession #: 
Nucleic Acid Accession 



Transmembrane protease, 
Hs. 105807 
T48536 
NM 005656.1 



serine 2 (TMPRSS2) 



Coding sequence = 57-1535 (underlined sequences correspond to start and stop codona) 



55 
60 
65 
70 
75 
80 



i 
I 

GTCATATTGA 
CTTTGAACTC 
CGGAAAACCC 
CTCAGTACTA 
ACCC0GTCGT 
AGAAAGCACT 
CTGGCCTACT 
CCTCAGGTAC 
GGGAGGAjOGA 
CATCTCAGAG 
GGGCGGCCTG 
ATGACAGCGG 
ATAAAAAACT 
TAGCCTGCGG 
CQCTCCCGGG 
GAGGCTCCAT 
TTAACAATCC 
ATGGAGCOGG 



AACCAGTGTG 
CCGGGTGGGG 
TGCTTCTCAT 
CAOCCATGAT 
GAGGGCCTCT 
GTTCTGGCra 
ACTQGATTTA 
CGTTTTACAA 
ATGATTCAGA 
TGGCATACTG 



11 

1 

ACATTCCAGA 
AGGGTCACCA 
CTATCCCGCA 
CCCGTCCCCC 
CTGCACGCAG 
GTGCATCACC 
CTGGAAGTTC 
CTGCATCAAC 
GAATCGGTGT 
GAAGTCCTGG 
CAGGGACATG 
ATCCACCAGC 
GTACCACAGT 
GGTCAACTTG 
GGDCTGGCCC 
CATCACCCCC 
ATGGCATTGG 
ATACCAAGTA 
CATTGCGCTG 
TCTGCCCAAC 
GGCCACCGAG 
TGAGACACAG 
CTGTGCCGGC 
GGTCACTTCG 
TGCCAAAGCT 
TGGACAAATG 
GAAAACAATG 
GGTCACrrCA 
TGCAGGCTGC 



21 
I 

TACCTAT CAT 
CCAGCTATTG 
CAGCCCACTG 
GTGCCCCAGT 
CCCAAATCCC 
TTGACCCTGG 
ATGGGCAGCA 
CCCTCTAACr 
GTTCGCCTCT 
CACCCTGTGT 
GGCTATAAGA 
TTTATQAAAC 
GATGCCTGTT 
AACTCAAGCC 
TGGCAGGTCA 
GAGTGGATCG 
AOGGCATTTG 
CAAAAAGTGA 
ATGAAGCTGC 
CCAGGCATGA 
GAGAAAGGGA 
AGATGCAACA 
TTCCTGCAGG 
AACAACAATA 
TACAGAOCAa 
AAGGCAAAOG 
GGGCTGGTTT 
TTTTTATTAA 
AGTGGCTCCC 



31 
I 

TACTCGATGC 
GACCTTACTA 
TGGTCCCCAC 
ACGCOCCGAG 
CATCCGGGAC 
GGACCTTCCT 
AGTGCTCCAA 
GGTGTGATGG 
A09BACCAAA 
GCCAAGACGA 
ATAATTTTTA 
TGAACACAAG 
CTTCAAAAGC 
GCCAGAGCAG 
GCCTGCACGT 
TGACAGCCGC 
CGGGGATTTT 
TTTCTCATCC 
AGAAGCCTCT 
TGCTGCAGCC 
AGACCrCAGA 
GCAGATATGT 
GGAACGTCGA 
TCXGGTGGCT 
GAGTGTACGG 
GCTAATCCAC 
TGCTTCCCOG 
ACAGTGAACT 
CTGCCCAGCC 



41 
I 

TGTTGATAAC 
TGAAAACCAT 
TGTCTACGAG 
GGTCCTGAOG 
AGTGTGCACC 
CGTGGGAGCT 
CTCTGGGATA 
CGTGTCACAC 
CTTCATCCTT 
CTGGAACGAG 
CTCTAGCCAA 
TGCCGGCAAT 
AGTGGTTTCT 
GATCGTGGGC 
CCAGAACGTC 
CCACTGCGTG 
GAGACAATCT 
AAATTATGAC 
GACTTTCAAC 
AGAACAGCTC 
AGTGCTGAAC 
CTATGACAAC 
TTCTTGCCAG 
GATAGGGGAT 
GAATGTGATG 
ATGGTCTTCG 
TGCATGATTT 
TGTCTGGCTT 
TGCTCTCCCT 



51 

1 

AGCAAGATGG 
GGATACCAAC 
GTGCATCCGG 
CAGGCTTCCA 
TCAAAGACTA 
GCGCTGGCCG 
GAGTGCGACT 
TGGOCCGGOT 
CAGATGTACT 
AACTAOQGGC 
GGAATAGTGG 
GTCQATATCT 
TTACGCTGTT 
GGTGAGAGCG 
CACGTGTGCG 
GAAAAACCTC 
TTCATGTTCT 
TCCAAQACCA 
GACCTAGTGA 
TGCTGGATTT 
GCTGCCAAGG 
CTQATCACAC 
GGTGACAGTG 
ACAAGCTGGG 
GTATTCACGG 
TOCTTGACGT 
ACTCTTAGAG 
TGGCACTCTC 
AACCCCTTGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 



903 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CCGCAAGGQG 
GTTGGAGGCT 
TQAGCATGGA 
GGAAAGGGAQ 
TCCCCftGCCT 
GATGGTQGCC 
AAGQGGAACA 
OAGGGAAGCA 
CATTGGGTGG 
TCCTAGCACC 
ATGTCGGCCT 
ATGCTCAGTT 
CTGAGTTCAA 



GCCCCCATTG 
GCTGTCACTT 
ACAGCCAC5QT 
ACTTCACAAG 
AGAAATAAAG 
GAAACATTTT 
ATTGAAAAGG 
GGCTCCTGGG 
CTGGAGAGTG 



TAAGGTACAC 
AGCCATCTT 



CTGGTTGTGG 
AGATCTTCCT 
CTCAGCTGCT 
GGCACCTGCA 
GGGATTTTGC 
GGACCAGCCC 
T GT TCTTATG 
AACTTGCCCT 
AGGGAGACTC 
AATGCCCCTT 
GATAGTCATT 
TGTTTCCATG 



GCACTGGCGG 
GCTGAGTCCT 
GGATGACTTQ 
GCGGCTGCCC 
TGATGGCJTTC 
TTCATGGGTG 
GGGTGAGAAT 
GAGCACTCCT 
AGCCTTCCTC 
GGTCCCTGGC 
GGAAATTGAG 
TTATGTTTCT 



TCAATTGTGG 
TTCOVGGGGC 
AGATGAAAAA 
TCTGGGGCCA 
TTAGAGCCTT 
GTGAOGTQGT 
ATAOACAGTG 
GGTGCAGG1C 
CTCATCCTCC 
AGGGCGCCAA 
GTCCATGGGG 
ACACAT TGA.T 



AAGGAAGAGG 
CAATTTTGGA 
GGAGAGACAT 
CTTGGTAGTG 
AGCAGCCCTG 
AGTCACTTGT 
CCCTTGGTGC 
TCCACCTGCA 
CTGACCCTGC 
DTTTGQCACC 
GAAATCAAGG 
GGTGGTOACC 



leoo 

LB60 
1920 
1380 
2040 
2100 
2160 
2220 
22B0 
2340 
24QO 
2460 



A153 Protein gequencet 
Gene name; 
Unigena number: 
Probeaet Accession tti 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



Transmembrane protease* 

Hs. 105807 

T48536 

JJP_0 056 4*7.1 

none found 

85-107 

plasma membrane 



serine 2 (tkprss2> 



1 
I 

KAIfflSGSPPA 
SSJPWCTQPK 
DSSGTCINPS 
GRAACRDMGV 
CIACGVNLNS 
PLNKPWHWTA 
VKFVCIiENPG 
TPAWICAGFL 
TDWTYHGjMKA 



11 
I 

IGFYXBNHGY 
SPSGTVCTSK 
NWCDOVSHCP 
KNNFYSSQGI 
SRQSRIVGGE 
FAGILR0SFM 
KMLQPEQLCW 
QGNVDSCQGD 
KG 



21 
I 

QPENPYPAQP 
TKKAbCTT/LT 



SAIjPQAWPKQ 
FYGAGYGfVQK 
ZSGWGATKEK 
9GGPLVTSNN 



31 
I 

TWPTVYEVH 
IjGTFIiVGAAIi 
IiYGPNFZLiQH 
KLNTSAGNVD 
V-5LHVQNVHV 
VISHPKFVDSK 
GKT8EVLNAA 
NIWWLIQDTS 



PAQYYPSFVP 
AAG1>I.KKFM3 



IYKKIiYHSDA 
CGGSIITPEW 
TKHKDIALMK 
XVIiLIBTQRC 
W3SGCAKAY2 



51 
1 

OTRPRVLTQA 
SKCSltfSGIEC 
VOQDDMNBNY 
CSSKAWSLR 
ZVTAAHCVEK 
LOKPliTFKDL 
NSRYVyDNl»Z 
PGVYGKVMVP 



A1S4 DMA SEQUENCS 
Gene name: Homo sapiens 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession it : 
Coding sequence: 



FW13501 f£s, clone PLACB1O09039 
Hs. 129179 
AI694767 
AI694767 

130-1086 (underlined sequences 



1 11 21 31 41 51 

I t 1 I 1 1 

CAGAGAGGCT GTATTTCAGT GCAGCCTGOC AGACCTCTTC TGGAGGAAGA CTGGACAAAG 
GGGGTC2VCAC ATTCCTTCCA TACQGTTGAG CCTCTACCTG CCTGGTGCTG GTCACAGTTC 
AGCTTCTT CA TGA TQGTGGA TCCCAATGGC AATGAATCCA GTGCTACATA CTTCATCCTA 
AXAGGCCrCC CIGGTTTAGA AGAGGCTCTAG TTCTGGTTGG CCTTCCCATT GTGCTCCCTC 
TACCTTATTG CTGTGCTAGG TAACTTGACA AT CATCT ACA TTGTOG3GAC TGAGCACAGC 
CTGCATGAGC CCATGTATAT ATTTOTTTGC ATGCTTTCAG GCATTGACTaT CCTCATCTCC 
ACCTCATCCA TGCCCAAAAT GCTGGCCATC TTCTGGTTCA ATTCCACTAC CATCCAGTTT 
GATGCTTGTC TGCTACAGAT GTTTGCCATG CACTCCTTAT CTGGCATGGA ATCGACAGTG 
CTGCTOGCCA T3GCTTTTGA CCGCTAXGTG GCCATCTGTC ACCCACTGCG CCATGGCACA 
GTACTTACGT TGCCTOGTGx CACCAAAATT GGTGTGGCTG CTGTGGTGCG GGGGGCTGCA 

ctgatggcac cccttcctgt cttcatcaag cagctgccct tctqccgctc CAAfATCCIT 

TCCCATTCCT ACTGCCTACA CCAAGATGTC ATGAAGCTGG CCTGTGATGA TATCOGGGXC 
AATGTCGTCT ATGGCCTTAT CGTCATCATC TCCGCCATTG GCCTGGACTC ACTTCTCATC 
TOCTTCTCAT ATCTGCTTAT TCTTAAGACT GTGTTGGGCT TGACACQTT3A AOCCCAQ3CC 
AAGGCAXT*JCG GCACTTGCGT CTCTCATGTG TGTGCTGTGT TCATATTCTA TGTACCTTTC 
ATTGGAXTGT CCATGGTGCA TCGCTTTAGC AAGCGGOGTG ACTCTOCACT GCCCGTC&TC 
TTGGCCAATA TCTATCTGCT GGTTCCTCCT GTK5CTCAACC CTAATTGTCTA TGGAGTGAAG 
ACAAAGGAGA TTCGACAGCG CATCCTTCGA CTTTTCCATG TOGCCACACA CGCTTCAGAG 
OCCraGGTGT CAGTGATCAA ACTTCTTTTC CATTCAGAGT CCTCTGATTC AGATTTTAAT 
GTTAACATTT TGGAAGACAG TATTCAGAAA AAAAATTTCC TTAATAAAAA TACAACTCAG 
ATCCTTCAAA TATGAAACTG GTTGGGGAAT CTOCATTTTT TCAATATTAT TTTCTTCTTT 
GTTTTCTTGC TACATATAAT TATTAATACC CTGACTAQGT TGTGGTTGGA GGGTTATTAC 
TlTTCATTTT ACCATGCAGT CCAAATCTAA ACTGCTTCTA CTGATGGTTT ACAGCATTCT 
GAGATAAGAA TGGTACATCT AC3AGAACATT TGCCAAAGGC CTAAGCACAQ CAAAGGAAAA 
TAAACAJCAGA ATATRATAAA ATGAGATAAT CTAGCTTAAA ACTArAACTT CCTCTTCAGA 
ACTCCCAACC ACATTGGATC TCAGAAAAAT ACTGTCTTCA AAATGACTTC TAC3U3AGAAG 
AAATAATTTT TCCTCXGGAC ACTAGCACTT AAGGGGAAGA TTGGAAGTAA AJGCCTTGAAA 
AGAGTACATT TACCTAOGTT AATGAAAGTT GACACACTGT TCTGAGAGTT TTCACRGCAT 

ATGGACOCTG tttttcchat ttaattttct tatcaaccct ttaattaggc aaagataita 

TTAGTACCCT CATTGTAGCC ATGGGAAAAT TGATGTTCAG TGGGGATCAG TC3AATTAAAT 

GGGGTCATAC aagtataaaa attaaaaaaa aaagacttca tgcccaatct catatgatgt 

GGAAGAACTG TTAAAGAGAC CAACAGGGTA GTGGGTTAGA GATTTCCAGA QTCTTACATT 
TTCXARAGGA GGTATTTAAT T T CTTCTCAC TCATCCAQTG TTGTATTTAG GAATTTCCTG 
GCAACAGAAC TCATGGCTTT AATCCCACTA GCTATTGCTT ATTGTTGCTQG TCCAATTGCC 
AATTACCTGT GTCXTGGAAG AAGTGATTTC TAGGTTCTACC ATTATGGAAG ATTCTTATTC 
ABAAAGTCTG crtagggctt ATAGCAAGTT atttattttt aaaagttcca taggtgtttc 
TGATAGGCAG TGAGGTTAGG GAGCCACCAG TTATGATGGG AASTATGGAA TGGCAGGTGT 
TGAAGATAAC ATTGGC3CTTT TGAGTGTGAC TCGTAGCTGG AAAGTGAGGG AATCTTCAGG 



60 
120 
1B0 
240 
300 
360 
420 
480 



orrespond to start and stop cod one } 



60 
120 
180 
240 
200 
360 
420 
480 
540 
600 
660 
72 O 
7 B0 
840 
900 
960 
1020 
1000 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1G80 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
22B0 



904 



WO 03/042661 



PCT/US02/36810 



ACCATGCTTT ATTTGGGGCT TXGTGCAGTA TGOAACAOGG ACTTTGAGAC CGGGAAAGCA 2340 

ATCTGACTTA OGCATGGGAA TCAGGCATTT TTGCTTCTGA GOQGCTATTA CCAAGGGTTA 2400 

ATAGGTTTCA TCTTCAACAG GATATGACAA CAGTCTTAAC CAAGAAACTC AAATTACATA 2460 

TACTAAAACA TGTGATCATA TATGTGGTAA GTTTCATTTT CTTTTTCAAT CCTCAGGTTC 2520 

CCTGATATGG ATTCCTATKA CATGCTTTCA TCCCCTTTTG TAATGGATAT CATATTTGGA 2580 

AATGCCTATT TAATACTTGT ATTTGCTGCT 6GACTGTAAG CCCATGAGGG CACTGTTTAT 2640 

TATTGAATGT CATCTCTGTT CATCA1TGAC TGCTCTTTGC TCATCATTGA ATCCCCCAGC 2700 

AAAGTGCCTA GAACATAATA GTGCTTATGC TTGACACCGG TTATTTTTCA TCAAACCTGA 2760 

TTCCTTCTGT GCTGAACACA TAGCCAQGCA ATTTTCCAGC CTTCTTTGAG TTGGGTATTA 2820 

TTAAATTTTA GCCATTACTT CCAATGTGAG TGGAAGTGAC ATGTGCAATT TTTATACCTG 2 8B0 

GCTCATAAAA CCCTCCCATG TGCAGCCTTT CATGTTGACA TTAAATGTGA CTTGGGAAGC 2940 

TATGTGTTAC ACAGAOTTAA TTAACCM3AA AGGCCTGGNA ATTTTTTGHN AAKHAAACTG 3000 

TGGCCKtTGAG GCCCNCAACC CTTTTTNKNA ATTTGGCAAN NTCCCACTTT GTAMTTTGGT ^ 060 
AAGGAGGCCA GTTGGATAAG TGAAAAATAA AG TACT ATT G TGTC 



A155 PROTEIN SEQUENCE 

Gene name: Homo sapiens cDNA FLJ13S81 fis, clone PLACE1 009039 

Unigene number; Hs. 129179 

Protein Accession #x not available 

Signal sequence: none found 

Pfam domain x 7tm_l [43-293] 

Transmembrane domains; 29-51, 57-79. B2-104, 203-225, 239-261, 273-295 

Cellular Localization i not determined 



1 11 21 31 41 51 

! I 1 1 I I 

MTOFNGNESS ATYFILIGLP GI/EEAQFWLA FPLCSLYL1A VLGKLTIIYI VRTSESLHEP 60 
MYIFLCMLSG ITJILISTSSM PKMLAIPHPN STTIQFDACL LQMFAIHSLS GMESTVLLAM 120 
AFDRYVATCH PXKHATVUTIj PKVTKIGVAA WRGAALMAP laPVFX KQliPP CRfiNILSHSY 180 
CLHQDVMKLA CDDIRVNWY GLIVIISAIG LDSLLISFSY LLILKTVLGL TRBAQAKAFG 240 
TCVSHVCAVP IFYVPFIGLS MVHRFSKRRD SFLPVILANI YLLVPPVLNP IVYGV5TKEI 300 
RQRILRLFHV ATHASEP 

A156 DHA SEQUENCE 

Gene namei vasoactive intestinal peptide receptor 1 

Unigene number: Ha. 19 872 6 

Probeaet Accession #s X77777 

Nucleic Acid Accession ft: nm_oo 4624,1 

Coding sequencer 57~1544 (underlined sequences correspond, to start and stop codons) 



TCGGAGCCTG CGGAGGGTGG TGGTGGTGGT GGTGGTGGCC CTCGCCCGCC TCACT CATGC 60 

CTCCTCCTCC TCTGCTCTCG CTCAGGCGCC TCGGTGGCGG TFGGTCBGCG GTTACGCGGC 120 

TGGTGGTCGC GGCGGCCGGG GCTCGCTCTC GGGGAGGCCG GGGCGGATCT CGCGGCGCAG ISO 

GCGGCGGCGG CCGAGGTGGG GTCGCGCGGC GGAGGCGGCT CGAGCTTCGT GCTGCGCGCT 240 

CGCTCTTGGG CTCCTCGCTG CAGGAGGAGT GTGACTATGT GCAGATGATC GAGGTGCAGC 300 

ACAAGCAGTG CCTGGAGGAG GCCCAGCTGG AGAATGA0AC AATAGGCTGC AGCAAGATGT 360 

GGGACAACCT CACCTGCTGG CCAGCCACCC CTCGGGGCCA GGTAGTTGTC TTGGCCTGTC 420 

CCCTCATCTT CAAGCTCTTC TCCTCCATTC AAGGCCnCAA TGTAAGCCGC AGCTGCACCG 480 

ACGAAGGCTG GACGCACCTG GAGCCTGGCC CGTACCCCAT TGCCTGTGGT TTGGATGACA 540 

AGGCAGCGAG TTTGGATOAG CAGCAGACCA TGTTCTACGG TTCTGTGAAG ACCGGCTACA 600 

CCATTGGCTA CGGCCTGTCC CTCGCCACCC TTCTGGTCGC CACAGCTATC CTGAQCCTGT 6SD 

TCAGGAAGCT CCACTGCACG CGGAACTACA TCCACATGCA CCTCTTCATA TCCTTCATGG 720 

TGAGGGCTGC CGCTGTCTTC ATCAAAGACT TGGCCCTCTT CGACAGOGGG GAGTGGGACC 780 

AGTGCTCCGA GGGCTCGGTG GGCTGTAAGG CAGCCATGGT CTTTTTCCAA TATTGTGTCA 840 

TGGCTAACTT CTTCTGGCTG CTGGTGGAGG GCCTCTACCT GTACACCCTG CTUGCDGTjCT 900 

CCTTCTTCTC TGAGCGGAAG TACTTCTGGG GGTACATACT CATCGGCTGG GGGGTACCCA 960 

GCACATTCAC CATGGTGTGG ACCATCGCCA GGATCCATTT TGAGGATTAT GGTCTGCTCA 1020 

OGTGCTGOGA CACCATCAAC TCCTCACTGT GGTGGATCAT AAAGGGCCCC ATCCTCACCT 1080 

CCATCTTGGT AAACTTCATC CTGTTTATTT GCATCATCCG AATCCTGCTT CAGAAACTGC 1140 

GGCCCCCAGA TATCAGGAAG AGTGACAGCA GTCCATACTC AAGGCTAGCC AGGTCCACAC 1200 

TCCTGCTGAT OCCCCTGTTT GOAGTACACT ACATCATGTT GGCCTTCTTT CCGGACAATT 1260 

TTAAGCCTGA AGTGAAGATG GTCTTTGAGC TCGTCSTGGG GTCTTTCCAG GGTTTTGTGG 1320 

TGGCTATCCT CTACTGCTTC CTCAATGGTG AGGTGCAGOC GGAGCTGAGG CGGAAGTGGC 1380 

GGCGCTGGCA CCTGCAGGGC GTCCTGQGCT GGAACCCCAA ATACCGGCAC CCGTOTGGAG 1440 

GCAGCAACGG CGCCACGTQC AGCACGCAGG TTTCCATGCT GACCCGCGTC AGCCCAGGTG 1500 

CCCGCCGCTC CTCCAGCTTC CAAGCCGAAG TCTCCCTGGT CTGACCACCA GGATCCCAGC 1560 

CCAAGCGGCC CCXCCCGCCC CTTCCCACTC GCAGCAGACQ CCGGGGACAG AGGCCTGCCC 1620 

GGGCGCGCCA GCCCCGGCCC TGGGCTCGGA GGCTGCCCCC GGCCCCCTGG TCTC3GGTCC 1680 

GGACACTCCT AGAGAACGCA GCCCTAGAGC CTGCCTGOAG CGTTTCTAGC AAGTGAGAGA 1740 

GATGGGAGCT CCTCTCCTGG AGGATGCAGG TGGAACTCAG TCATTAGACT CCTCCTCCAA lflOO 

AGGCCCCCTA CGCCAATCAA GGGCAAAAAG TCTACATACT TTCATCCTGA CTCTGCCCCC I860 

TGCTGGCTCT TCXGCCCAAT TGGAGGAAAG CAACCGGTGG ATCCTCAAAC AACACTGGTG 1920 

TGACCTGAGG GCAGAAAGGT TCTGCCCGGG AAGGTCACCA GCACCAACAC CACGGTAGTG 1980 

CCTGAAATTT CACCATTGCT GTCAAGTTCC TTTGGGTTAA GCATTACCAC TCAGGCATTT 2040 

GACTGAAGAT GCAGCTCACT ACCCTATTCT CTCTTTACGC TTAGTTATCA G CT TT TT AAA 2100 

GTGGGTTATT CTGGAGTTTT TGTTTGGAGA GCACACCTAT CTTAGTQGTT CCCCACCGAA 2160 

QTGGACTGGC ccctgggtca gtctggtggg aggaoggtgc AACCCAAGGA CTGAGGGACT 2220 

CTGAAGCCTC TGGGAAATGA GAAGGCAGCC ACCAGCGAAT GCTAGGTCTC GGACTAAGCC 2 2 B0 

TACCTGCTCT CCAAGTCTCA GTGGCXTCAT CTGTCAAGTG GGACTCTGTC ACACCAGCCA 2340 

TTCTTATCTC TCTGTGCTGT GGAAGCAACA GGAATCAAGA GACTGCCCTC CTTGTCCACC 2400 

CACCTATGTG CCAACTGTTG TAACTAGGCT CAGAGATGTG CACCCATGGG CTCTGACAGA 2460 



905 



WO 03/042661 



PCT/US02/36810 



AAGCAGATCC 
GTGAAAGCAC 
TTATTTGTTT 
COCTCCCTGG 
CTGGTCACAG 
CCTCTGCCftG 
GGAAAAAAAA 



TCACCCTQCT 
GGACTCTTAC 
ACCACT TOT A 
AGTGTGGCTG 
CCTCCTCTQT 
AAGATCCCCT 
AAAA 



ACACATACAG GATTTGAACT CAGATCTGTC TGATAGGAAT 252 0 

TGCTAACTTT TGTGTATCST AACCAGCCAG ATCCTCTTGG 2580 

TTATTAATGC CATTATCCCT GAATTCCCCT TGCCACCCCA 2640 

AGGAGGCCTC CATCTCA1GT ATCATCIGGA TAGGAGCCTG 270O 

CTGCCCTTCA CCCCAGTGGC CACTCAGCTT CCTACCCACA 2760 

CAGGACTGCA ACAGGCTTGT GCAACAATAA ATGTTGGCTT 2820 



A157 Protein sequence; 
10 Gene name » 

Unigene number; 

Probeseb Accession 41; 

Protein Accession #t 

Signal sequence : 
15 Transmembrane domains; 

Cellular- Localization! 



vasoactive intestinal peptide receptor 1 

HS.15B726 

X77777 

JC2195 

none found 

181-202, 214-236, 255-277, 290-311, 332-354, 377-399, 408-430 
plasma, membrane 



20 
25 
30 1 
35 



i 
I 

MPPPPLLBLR 
RSIiLGSSLQB 
CPLIFKLFSS 
YTIBYGliSIA 
DQC3EGSVGC 
PSTETMVWTI 

WAILYCFLN 



11 
I 

SLGGGWSAVT 
ECDYVQMIEV 
IQGRNVSRSC 
TIiLVATAXiiS 
KAAMVFPQyC 
ARIKFEDYGL 
SSPYSRLARS 
GEVQASLRKK 
BVSLV 



21 
1 

RLWAAAGAR 
QHJCQCLSEAQ 
TDEGWTHLEP 
LFRKLHCTRN 
VWAX3FFWLX.V 
LHCWDTINSS 
TLXiIjIKLFGV 
WRRWHLQGVL 



31 
I 

LENETIGCSK 
GPYPIACGUD 
YIHMHLFIfiF 

eglylytua 

LWWIIKGPIL 
HYLMFAFFFD 
GWNPKYRHPS 



41 
I 

AGGGGRGGVA 
MWDHLTCWPA 
DKAASUJEQQ 
ILRAAAVFIK 
VSFFSKRKYF 
T9ILVNFII»F 
NFKPEVKMVF 
GGSNGATCST 



51 

1 

RRRRTiTTTiRAA 60 

TPRGQVWLA 120 

TMFYGSVKTG 180 

DLALFDSGES 240 

WGYILIGWGV 300 

ICIIRIiiLQK 36Q 

BLWGSFQGF 420 

QVSMLTRVSP 4 B0 



A153 DNA 5EQUEHCE 

Gene name: ESTs 
Unigene number: Hs.29383 
Probeset Accession #: AW207206 
Nucleic Acid Accession # r AL133619 

Coding sequence i 1-2070 (underlined sequences correspond to start and stop codons) 



40 
45 
50 
55 
60 
65 
70 
75 
80 



1 11 21 31 41 51 

1 \ I I I I 

ATGAGCGGTG OGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 

CGGOGCCGGC GCCAGGGGCC CTCXGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 120 

CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 

CAGCAGCAGC ACTCGGAGAT GCTGGGCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG 240 

GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGQCCC TGCCTCCCCA GGCACACTCA 300 

ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 

GGGGGAACAC AGGACGGGGA GOCOCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

CCTGTATGCC AACCCAGTGG CTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 

AGCCQTOGCT GGAC5ATGTT ATGCA0GCAA GCACAGCACG TGCXGCTCTC GGGAAGCCCA 540 

GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 

CCAAGTAGAG CTGAAATGQO AAGQAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 

CCTCAGATTG CTGCTGIGGC CAGGCCCAGG ATTTCCAGCC CTATGOCTCT GAGTCCTCAG 720 

ATGCTQGGQG CCCAGGGQAT ATGGACACAC TCCATCCAGG GATGCCTTCC TGCCATCTGG 7 BO 

GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA. CTTGTGCAAO 840 

GCACTTCCGC ATCCFGACAG GOGCCCCCAC CCAGCCGAGG ATCCTGGGCT GTGGTCTCAA 900 

GCTCACTTCC GATTATCTTT GGGGCTGGGG CTGACATCAG GAGQACATCT GACTGGTGGA 960 

TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

GACATGGAGA AGGGGGTTQA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 

CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC OCTGCAGTGC TGGGGACGCT 1X40 

GACAGOACAC □GGAAGAGOC CATC3CTTTCC CTQGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 

CCCTCCTGCT TTCCAGATGG OCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 

GGCGCTCGCT GGGTCTGCAr CAAjCGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 

AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GGAAGGGTGG GCGTCTTGCG 1380 

GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 

TCTGTCAAGT GCATCTCTAA TTCAGOCAAC TCTCAAGGCA AGGCCAQGCC CCAGCCCGGC 1500 

TCCTTCAACA AGCAAGATTC AAAAGCTOAC GTCTCCCAGA AGGGGGACCT GGAAGAGGAG 1560 

CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGQCA GGCCAOAAAG 1620 

GAGAAAGCAG AGGCCTCTAA TGCAGGAQCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 

AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 

ACCACACTTA GGCAGTGOGA AGTGCTCA.TC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1800 

ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAQAGGCC CCAGGCAGCC 1860 

CCGGAGGAAG CTAGCTTTCC CAGGOACCAA GAAGCCACGC ATtTCCCCAA GGTCTCCACC 1920 

AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 

CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAAOGGCGCC TGCATCGCTC AGTGCTTTGA 



A159 Protein sec-genre i 



Unigene number: 
Probeset Accession r 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



ESTS 

Hs. 29383 
AW207206 
T43457 

none found 
303-322 

not determined 



906 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 

! 

MSGAGVAAGT 
QQQESEKhML 
QGTQDGfiPIiQ 
GPEVIAGRQV 
MLGAQGIWTH 
AHFPI>SLGLG 
LFW AKCGPS R 
GARWVCINGV 
&VKSI SUSAN 
EKAEASNAGA 
TQEljUHLKSIi 
LKQTPKNNFA 



11 21 

1 I 
KPPSSPTPGS RRRRQRPSVG 



TVLAHLAALA 
ATGCSPDLPP 
SIQGSLPAIW 
LTSGGHLTGG 
QPQPC9AGDA 
WVEPGGESPA 
SQGKKRPQPG 
ACWGMSQHQG 
IiEGSQRPQAA 
ERQKRLQAMQ 



PVCQPGGYEP 
PSJIAEMGRNP 
AATMGIKGGS 
WSQPGNIAAG 



SFNKQDSKAD 
RQMGAGAKPP 
PEEASFPRDQ 
KRHLHRSVI) 



31 
I 

VQSLftPQSPQ 
RPALPPQAHS 
WGTHTDAATS 
WDSPCPARSI* 
R VU? PCKL SK 
AVPRALPSQG 
LOTCC9MCPK 
RPGGKRGRLA 
VSQKADLKEE 
MILPLPLRKP 
EATHFPKVST 



TLPLPQHRNT 
BRGWTKLCSQ 
PQIAAVARPR 
ALPKPDSGPH 
DMEKGVEGGP 



51 
i 

LDliKKSLOPL 
AINSSTRLGS 
AQHVLLSGSP 
I6SPMALSPH 
PAQDPGLWSQ 
FPSRCGNSBE 
NHLSRASAPL 



GGSADTVRSP 
PLiflNSKLDK 
TTLRQCFVIiI 
KSLSKKCLSP 



VPGVQGQARK 
RELKNTNIjIiQ 
PVAERAILPA 



60 
120 
180 
2-10 
30O 
360 
420 
480 
540 
60O 
660 



A160 DMA SEQUENCE 

Gene name: LIV-1 protein, 

Unigene numbers 

Probeset Accession #; 

Nucleic Acid Accession fti 

Coding sequences 



11 



estrogen regulated 
Hs. 79136 
U41060 
NM_012319.2 

138-2405 (underlined sequences corn 



eopond to start and stop codons} 



51 



21 31 41 

I I I I I I 

CTGGTGCCGA ATTCGGCACG AGACCGCGTG TTCGOGCCTG GTAGAGATTT CCCGAAGACA 
CCAGTGGGCC OGTGTGGAAC CAAACCTGCG CGCGTGGCCG GQCCGTGGGA CAACGAGGCC 
GCGGAGAOGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCXTTGCCC 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 
AAATTAGTCC GAATTQGGAA TCTGGCATTA ATGTTGACTT GGCAATTTOC ACACGGCAAT 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATflA TTCTrTGTCA GTTGAAGGGT 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 
ACCACGACCA TCACTCAGAC CAOGAGCRTC ACTCAGACCA TGAGCGTCAG TCAGACCATG 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 
CTGCTTCTGG TAAAAATAAG CQAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAQAACAT GCCAGTGGTA 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAA3TGAC CTCAACTGTG TAjCAACACTG 
TCTCTGAAGG AACTCACITT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 
CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGOdGG QTGAGCCQGC 
TGGCTGGTAG GAAAACAAAT G AAT CTGTGA GXGAGCCCCG AAAAGGCTTT ATGTATTCCA 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACT3 ACATCTCATG 
GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 
TCAACCAAAT TQATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAOAAG GCTOAAATCC 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 
TCA3TTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 
1ACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAfl TCATAGCGAT GAAGAACCAG 
CAATGGAAAT GAAAAGAGGA CCJACTTTTCA GTCATCTGTC TTCTCAAAAG ATAGAAGAAA 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 
AGAAGAAACC TGAAAATGAT GATGATGTGQ AGATTAAGAA GCAGTTGTCC AAGTATGAAT 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 
GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCR GCCTGCAGTC TTGGAAGAAG 
AAGAOGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT OGGCCAGTCA GACGATCTGA 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 
CTCACAGTGA CAGCCAGOGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 
TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATIT CAGCGATGGC CTAGCAATTG 
GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 
ATGAGTTGCC TCATOAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGlXA 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 
GAATTITCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 
GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 
GTGACCATGG ATGTAGCCGC TBGOOGTATT TCTTTTTACA GAATGCTGGG ATGCTXTTGG 
GTTTTGGAAT TATGTTACTT ATT TCCATAT TTGAACATAA AATCGTGTTT OGTATAAATT 
T CTAGT TAAQ GTTTAAAXGC TAGAOTAGCX TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGGGTTTAAA GTTAGTGGGT TTTGTGATTT 
TTQTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTEA ATA'llTAAGT 
TATTCTATCT TGGAGATAAA ATCTGXATGT GCAATTCACC GGTATTACCA GTTTATTATG 
TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 
TXTTCAAGAA CTAACACAGT TATTCCTATA CXGGATTTTA GGTCTCTGAA GAACTGCTGG 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 
AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 
CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 
OTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTBTATAAT ACZRGAAATCT 
AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 
TTCGTGCX3GG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 
CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 
TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTOT 

907 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84Q 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1B0O 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



WO 03/042661 



PCTAJS02/36810 



GAGCAATTGT CTTTATATAC GGTACTGTAG CCA.TACTAGG 
QATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGC1T G 



GCATTCTCTA 3420 



A161 PHQTSIN SEQUENCE 



Unigene number: 
Protein Accession II: 
Signal ee<juence: 
Pfara domain i 
Tran a membrane domains i 
Cellular Localii&tiom 



estrogen regulated 



Liv-l protein, 
Hs. 79136 

NPJ>36451 

1-21 

Zip [591 -743] 

330-346, 352-368, 427-444. 
plasn 



663-679, 688-703, 730-745 



1 
I 

MARKLSVIL1 
FYRYGENNSIj 



SVSAfiSVTST 
KESTV3EFRKG 
RS CLTHTfi EK 
IiVAI*AVGTL3 
TWKGLTALGG 
EEKVDTDDRT 
HSHFHDTLGQ 
MGDGLKMFSD 
NALSAMLAYI-, 
RWGYFFLQKA 



11 
1 

LTFAI+SVTNP 
SVBGFRKIjIiQ 
EHMHAASGKN 
VYNTV9EGTH 
FMYSRNTNEN 
KAE1PPXTYS 
GDAFLHUjPH 
LYFMFLVSHV 
EGYUUVDSQE 



21 
I 

LHELKAAAFP 
WIGIDKIKRI 
KRKALCPDHD 



31 

! 

QTTEKISPNTfl 



41 
I 



51 
I 



PQECFNASKL 
LQIAWVGGFI 



ItTLlKQFKDK 
PSHFOSQQPA 



SDSSGKDPRN 
GKI, FPKTfVS 9 
LTSHGMGIQV 
AISIXSFIiSli 
HEEPAMEMKR 
KKKNQKKPEN 
VLEEEEVMIA 



GIiAIGAAFTE GLSSGLSTSV AVFCHELPHE 
GMATGIPIGH YAENVSMWIF ALTAGLPMYV 
GMLliGFGXMIj LISIFEHKIV FRINF 



BSGINVDLAI 


STRQYHLQQL 


6D 


DEEHHSDHER 


HSDHEHHBDH 


120 


SQGKGAHRPE 


HAEGRRNVKD 


1BD 


STPPSVTJ3KS 


KVSRLAGRKT 


240 


PLNATEFMYL 


CPAIINQIDA 


300 


LGVILVPLMJ 


RVFFKFLLSF 


360 


GPLFSELSSQ 


NIEESAYFDS 


420 


BDDVSIKKQL 


SKYESQLSTH 


480 


HAHPQEVYNE 


YVPRGCKWKC 


540 


RYSHE EltKD A 


GVATIAWMVI 


600 


LGDFAVliLKA 


GMTVKQAVL.Y 


660 


ALVDMVPEML 


HTJDASDHGCS 


720 




A162 SNA sequence 
Gene: name: 
unigene numbers 
Probeset Accession #s 
Nucleic Acid Accession Hi 
coding sequence * 



1 
1 

CGCGGGGCGC 
GAGGACGCGG 
GTGAAAGGAA 
CKPAACCATT 
TGCCATAAGT 
AATGTGGGCA 
TTGCGTTGTA 
GAOGGATATT 
GGTTGCCTAG 
AGAAGATCAA 
CTGCCTCCAT 
ATATCTGTGA 
TATAAAAGAC 
ATTCCTCCTG 
TCAGGCCTCC 
ATTGGAAAAG 
GTGAAAGTGT 
ACAGTGTTGA 
GGGTCCTGOA 
TATCTGAAGT 
AGTGGCTTAT 
CATCGAGATC 
GACCTGGGCC 
ACTOGAQTTO 
AATCACTTCC 
GTTGCTAGGA 



bone morphogenetlc protein receptor IB (ALK.-6) 
Hs. 87223 
AA250737 
HM_0 01203 

274-1782 (underlined sequences correspond to start and stop codona) 



CGCCCCTCAT 
ATGACAGAAT 
ACACTTGCCA 
CATCTCTGCA 
TAAGCATCCA 
CTTTCAGGGA 
TCTGTTTGTA 



AGGAAGATCA 
TGGCTCTGAG 
GAGAAGCAAA 
CCAAGAAAGA 
AATGCCACCA 
GTTTCACGAT 
GACTAGAAGG 
TTQAATGCTG 
TGAAAAACAG 
CTGTCTGTAG 
AAGAAACCAG 
GAGAATCCCT 
CTCTGCTGGT 
GTCGCTATGG 
TCTTCACCAC 
TGAGGCATGA 
CCCAGTTGTA 
CCACCACCCT 
GTCATTTACA 
TGAAAAGTAA 
TGGCTGTTAA 
GCACCAAACG 
AGTCTTACAT 
QATGTSTATC 
GTGACCCCTC 
TCCCAAACCG 
GCTGGGCTCA 
AAATGTCAGA 
GAAAGCCAAC 



TTTCATGCCT 
CTATGACAAG 
CTTC3CTTGAT 
GGATGGTGAG 
CCATTGTCCA 
GATAGAAGAG 
CTCRGATOT 
CACAOAAAGG 
AGATTTTGTT 
TTTUCTCTTC? 
ACCTOGATAC 
GAGAQACTTA 
CCAAAGGACT 
GGAAJ3TTTGG 



GGACGCGGGC 
T 

TGTTGATAAA 
AGAGGAAACA 
AACATGCTTT 
AGTACAGCCC 
GAAGACTCAQ 
GATGACTCTG 
CAGTGTCGGG 
AACQAATQTA 
GATGGACCTA 



GCGA CC TGGG 
GGCGGAGAAA 



aaacattttq 
cctaatcaca 
agacgctaaa 
cacagaaatc 
aaacattctg 

ATTTATTAGT 
CTATATGCCT 
CATGGCTGAC 
AGGAGGTATA 
TTATGAGGAC 
GTGGAGCAGT 
CAATCCTGCA 
GTCCCAGGAC 

aggtactctt 

CTTGAACATC 
CAAAGACAGA 



AGCATTGGGT 
ATTGAGCAGT 
ATAGCTAAGC 
ATGGGAAAGX 
AGCTGGTTCA 
GGTTTCATTG 
GACTATCATG 
TCAATGCTGA 
TTTAGTACTC 
GTGAAGAAAA 
GATACAAATG 
CCAGAAGTGT 
A1GTATAG7T 
GTGGAAGAAT 



GATGAGTGTC 
TCAAGGCTGA 
ATTAAACTCT_ 
CTGTTTGTGG 
GTCCTGCTTC 
GAAGCTCCCA 
ACTTGTTCAA 



41 
I 

AGTGCGGAGA 
CAGCCTACTC 
GGTTCAGACT 
AAAAGTTAAA 
MCGAAGTGC 
CCACCCCCCG 
TC3LACAATAT 
GGTTGCCTGT 
ACACTCCCAT 
ATAAAGACCT 
TACACCACAG 
TATTATTTTG 
TAGAACAGGA 
CTCAGAGCTC 
AGATTCAGAT 
GGOGTGGCGA 
GAGAGACAGA 
CTGCAGATA7 
AAAATGGTTC 

agttagccta 
aaggcaaacc 
atggaacttg 

AAGTTGACAr 
TGGACGAGAG 

ttggOctcat 

ACCAGCTTCC 
TTGTGTGCAT 
TAAGGCAGAT 
CAGCCCTGCG 
GATAGGAGAG 
GCAGAGCAAA 
CCAGTGGGTT 



51 
1 

CCGCGGCGCT 
TTTCTTAQAT 
TCTGCTGATT 
CTTACAAGCC 
AGGAAAATTA 
TCCAAAGGTC 
TTGCAGCACA 
GGTCACTTCT 
TCCTCATCAA 
ACACCCTACA 
GGCTTTACTT 
TTACTTCCGG 
TGAAACTTAC 
AGGAAGTGGA 
GGTGAAACAG 
AAAGGTAGCT 
AATATATCAG 
CAAAGGGACA 
CCTTTATGAT 
CTCTTCTGTC 
AGCAATTGCC 
CTGTATTGCT 
ACCACCTAAC 
CTTGAACAGA 
CCTTIGGGAG 
TTATCATGAC 



GAT ATGATG C 



GGGARAACTC 
GGTXAAGAAA 
GAAAAGTAAG 
AGACATCAAA 
CAGACCTCAC 
TTGATCCGTG 
AT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1BO0 
1860 
1920 
1980 



A163 Protein sequence 
Gene name: 
Unigene number: 
Probes et Accession # : 
Protein Accession tt: 
Signal sequence t 
Transmembrane domains z 
PFAM domains; 
Cellular Localization; 



bone morphogenetlc protean receptor IB (auc-6) 

HS. 72472 / Kb. 87223 

AA250737 / V89326 

NP_001194 

1-13 

128-144 

activin_receptor [30-111J , protein kinase [204-491J 
plasma membrane 



908 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 51 

1 1 I I I 1 

MLLRSAGKLN "VGTKKEDGE6 TAPTPRPKVL RCKCHHHCPE DSVNNXC5TD GYCFTMIEED 60 

DSGLPWTSO CLGLEGSDFQ CRDTP IPHQR RSIECCTERN ECNKDLHPTX PPLKNRDFVD 120 

GPIHHRALLI fiVTVCSLLLV LIILFCYFRY KRQETRPRYfi IGLEQDETYI PPGESLRDLI 1BQ 

EQSQSBGSGS GkPLLVQRTT AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV XVFPTTEEAS 240 

WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGfiLYDY LKSTXIiDAKS 30D 

MLK1AYSSV8 GLCHLHTEIF STQGKPAIAH RDLKSKHILV KKNGTCCJAD LGLAVKFISD 360 

TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIHAUM Y6FGLILWBV AHHCVSGGIV 420 

EBYQCLPYUDIf VPSDPSYEDM REIVCIKKLK PSFPNRMSSD ECLRQMGKLM TECWAHKPAS 480 
RLTALRVKKT IiAKMSESQDI KL 



M64 DMA sequence 

Gene name: 
Unigene number > 
Probeset Accession ft: 
Nucleic Acid Accession #: 
Coding Sequence: 



ESTe 

He. 157601 

W074S9 

AC005383 

328-27S1 (underlined sequences correspond to start 



and stop codons) 



1 11 21 31 41 51 

1 1 I I I 1 

GACAGTGTTC GCGGCTGCAC CGCTCaOAOG CTGGGTGACC CGCGTAGAAG TGAAOTACTT 6D 

TTTTATTTGC AGACCTGGGC CGATGCCGCT TTAAAAAAGG OGAGGGGCTC TATGCACCTC 120 

CCTGGCGGTA GTTCClCCGA CCTCAGCCGG GTCGGGTCGT GCCGCCCTCT CCCAGGAGAG 180 

ACAAACAGGT GTCCCACGTG GCAGCCGOGC CCCGGGCGCC CCTCCTGTGA TCCOGTAGCG 240 

CCCCCTGGCC CGAGCCGCGC CCGGGTCTGT GAGTAGAGCC GCCCGGGCAC CGAGCGCTGG 300 

GTTTTCCTGT TTTCCAGAGT GCCCCCA.TCT CXCCCTCTCC AGGAAGTCCA TGTAAGCAAA 420 

GAAACCATCG GGAAGATTTC AGCTGCCAGC AAAATGATGT GGTGCTCGGC TGCAGTGGAC 480 

ATCATGTTTC TGTTAGATGG GTCTAACAGC GTCGGQAAAG GGAGCTTTGA AAGGTCCAAG 540 

CACITTGCCA TCACAGTCTG TGACGGTCTG GACATCAGCC CCGAGAGGGT CAGAGTGGGA 600 

GCATTCCAGT TCAGTTCCAC TCCTCATCTG GAATTCCCCT TGGATTCATT TTCAACCCAA 660 

CAGGAAGTGA AGGCAAGAAT CAAGAGGATG GTTTTCAAAG GAGGGOGCAC GGAGACGGAA 720 

CTTGCTCTGA AATACCTTCT GCACAGAGGG TTGGClGGAG GCAGAAATGC TTCTQTGCCC 7BO 

CAGATCCTCA TCATCGTCAC TGATGGGAAG TCCCAGGGGG ATGTGGCACT GCCATCCAAG 840 

CAGCTGAAGG AAAGGGGTGT CAGTGTGTTT GCTGTGGGGG TCAGGTTTCC CAGGTGGGAG 900 

GAGCXGCATQ CACTGGCCAG OGAGCCTAGA GGGCAGCAOG TGCTGTTGGC TGAGCAGGTG 960 

GAGGATGCCA CCAACGGCCT CTTCAGCACC CTCAGCAGCT CGGCCATCTG CTCCAGCGCC 1020 

AGGCCAGACT GCAGGGTCGA GGCTCACCCC TGTGAGCACA GGAGGCTGGA GATGGTCCGG 1080 

GAGTTCGCTG GCAATGGCCC ATGCTGGAGA GGATO30GGC GGACCCTTGC GGTGCTGGCT 1140 

GCACACTGTC CCTTCTACAG CTGGAAGAGA GTGTTCCTAA CGCACCCTGC CACCTGCTAC 1200 

AGGAOCACCT GCCCftGGCCC CTOTQACTOG CAGCOCTGCC AGAATGGAGG CACATGTGTT 12 GO 

GCAGAAGQAC TGGACGGCTA OCAGTGCCTC TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC 1320 

TGTGCCCTGA ASCTGAGCCT GGAATGCAGG GTCGAOCTCC TCTTCCTGCT GGACAGCTCT 13 BO 

GOGGGCACCA CTCTGGACGG CTTCCTGCGG GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG 1440 

GCCGTGCTGA GOGAGGACTC TCGGGCCCQA GTGGGTGTGG CCACATACAG CAGGQAGCTG 1500 

CTGGTGGCGG TGCCTGTGQG GGAGTACCAG GATGTGCCTG ACCTGGTCTG GAGCCTOGAT IS 60 

GGCATTCCCT TCCGTGGTGG CCOCACCCTG ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG 1620 

CGTGGCTTOG GGAG03CCAC CAGGAGAGGC CAGGACCGGC CACGTAGAGT GGTGGTTTTG 1680 

CTXC3ACTGAGT CACACTCCGA GGATGAGGTT GGGGGCCCAG CGCGTC3VCGC AAGGGOGC5C3A 1740 

GAGCTGCnCC TGCXGGGTGT AGGCAGTGAG GCOGTGCGGG CAGAGCTGGA GGAGATCACA 1800 

GQCAGCCCAA AGCATGTGAT GGTCTACTCG GATCCTCAQG AXCTGTTCAA CCAAATCCCT 1860 

GAGCTGCAGG GGAAGCTGTG CAGCCGGCAG CGGCCAGGGT GCOGGACACA AGCCCTGGAC 1920 

CTCGTCTTCA TGTTGGACAC CTCTGCCTCA GTAGGGCCCG AGAATTTTGC TCAGATGCAG 1980 

AGCTTTGTGA GAAGCT0TGC CCTCCAQTIT GAGGTGAACC CTGACGTGAC ACAGGTCGGC 2040 

GTGG1GGTGT ATGGCAGCCA GGTGCAGACT GCCTTOGGGC TOGACACCAA ACCCACOCGG 2100 

GCTGCGATGC TGCGGGCCAT TAGOCAGGOe CCCTACCTAG GTtSGGGTGGG CTCAGCOGGC 2160 

ACCGCCCTGC TGOVOVTCTA TGACAAAGTG ATGACCGTCC AGAGGGGTGC CCGGCCTGGT 2220 

GTCCCCAAAG CTGTGGTGGT GCTCACAGGC GGGAGAGGGG CAGAGGATGC AGCCGTTOCT 2280 

GCCCAGAAGC TGAGGAACAA TGGCATCTCT GTCTTGGTOG TGGGCGTGGG GCCTGTCCTA 2340 

AGTGAGGGTC TGOGGAGGCT TGCAGGTCOC CGGGATTCCC TGATCCACGT GGCA0OTTAC 2400 

GCCGACCTGC GGTACCACCA GQACGTGCTC ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG 2460 

CCAGTCAACC TCTGCAAACC CAGCCGGTGC ATGAATGAGG GCAGClGCGT CCTGCAGAAT 2520 

GGGAGCTACC GCTGC31AGTG TCGGGATG3C TOGGAGGGCC CCCACTGOGA GAACCGTGAG 2580 

TGGAGCTCTT GCTCTGTATG TGTGAGCCAG GGATGGATTC MGAGACGCC CCTGAGGCAC 2640 

AW3GCTC3CCG TGCAGGAGGG CAGCAGCOST ACCCCTOCCA GCAACTACAG AGAAGGCCTG 2700 

GGCACTGAAA TGGTGCCTAC CTTCTGGAAT GTCXGTGCCC CAGGTCC TTA GAATGTCTGC 2760 

TTCCCGOOGT GGCCAGGACC ACTATTCTCA CTGAQGGAGG AGGATGTCCC AACTGCAGCC 2820 

ATGCTGCTTA GAGACAAGAA AGCAGCTGAT GTCACCCACA AACGATGTTG TTGAAAAGTT 2880 

TTGATGTGTA AGTAAAtTACC CACTTTCTGT ACCTGCTGTG CCTTGTTGAG GCTATGTCAT 2940 

CTGCCACCTT TCCCTTGAGG ATAAACAAGG GGICCTGAAG ACTTAAATTT AGCGGCGTGA 30OO 

CGXTCCTTTG CACACAATCA ATGCTCGOCA GAATGTTGTT GACACAGTAA TGCCCAGCAG 3060 

AGGCCTTTAC TAGAGCATCC TTTGGAC3GGC GAAGGCCAOG GOCTTTCAAG ATGGAAAGCA 312 0 

GCAGCTTTTC CACTTCCCCA GAGACATTCT GGATGCATTT GCATTGAGTC TGAAAGGGGG 3180 

CTTGAGGGAC GTTTGTGACT TCTTGGCGAC TGCCTTTTGT GTGTGGAAGA GACTTGGAAA 3240 

GGTCtCAGAC TGAATGTGAC CSATTAACCA G CT TGGTTGA TGATGGGGGA GGGGCTGAGT 3300 

TGTGCATOGa CCCAGGTCTG GAGGGCCAOG TAAAATCGTT CTGAGTCGTG AGCAGTGTOC 3360 
ACCTTGAAGG TCTTC 



909 



WO 03/042661 



PCT/US02/36810 



A3. 6 5 Protein sequence 
Gene name: 
tfnigene number * 
Protein Accession #t 
Signal sequence* 
Transmembrane domains: 
VGw domainst 
JBGP domains: 
Cellular Localization : 



ESTs 

Hs. 157601 
none found 
1-17 

none found 

49-223; 341-518; 529-706 
298-333; 715-748 
secreted 



1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFIiFSRVFF SDPIiQEVHVS KETIGKISAA fiKHMWCSAAV DIMFLiLDGSM 60 

3VGKCSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH IiEFPIiDSFST QQEVXARIKR 120 

MVPKGGRTEX EIiAUCYIjIjHR G&POORNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATWGLF3 TLSSSAICSS ATPDCRVEAH 240 

PCBHRTLEKV REFAGNAPCW RGSRRTLAVI, AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGKA NCALKLBLEC RVDLLFLLDS SAGTTI/DGFL 360 

RAKVFVXRFV RAVLSBDSRA RVQVATYSRB LIiVAVFVGEY QDVPDLVWSL DGlPFRGGPT 420 

liTGSALtaQAA ERGFG8ATRT GQDRPRRWV LLTBSHSEDE VAGPARHARA REW^LGVGS 480 

EAVRAEI.EEI TGSPKHVMVY SDPQDIjFNQI PELQGKLCSR QRPGCRTQAX* DLVFMLETSA 540 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTiCPT RAAMURAISQ 600 

APYLGGVGSA GTALLHIYDK VHTVQRGARP GVPKAVWLT GGRGA5DAAV PAQKLRNNGI 660 

SVLWOVOPV LSEGIJtRIiAG PKDSLIHVAA YADLRYHQDV LIEWX.CGEAX QFVHLCKPSP 720 

CNNEGSCVLQ NGSYRCKCRD GWEGPHCEKR EWSSCSVCVS QGVJILETPLR HMAPVQEGSS 780 
RTPPSUYREG U3TEMVFTFW UVCAPGP 



A166 DMA sequence 

Gene name: 
Untgene number: 
Probsset Accession # : 
Nucleic Acid Accession ft: 
Coding Sequence r 



Homo sapiens beta-l adrenergic receptor tnRHA, 3* UTR 

Hs. 37744 
AA011176 
AF272B90 

87-152 0 Cunderllned sequences correspond to start and stop codons} 



1 
I 

TGCTACCOGC 

ccggcccccg 

AGCCCGGTAA 
TGCTGGTGCC 
CGCTGTCTCA 
TCGTGGCGGG 
TCACCAACCT 
TGCCGTTCGG 
AGCTGTGGAC 
TTGCCCTGQA 
GCGCGCGGGC 
TGCCCATCCT 
ACCCCAAGTG 
CCTTCTAOGT 
AGAAGCAGGT 
CGCCCTCGCC 



11 
I 

GCCCGGGCXT 
GCCTCCGCAC3 

cgcgtcgccg 
gcagtgsaca 
caatgtgctg 
cttcatcatg 
ggccaccatc 
ctcagtggac 

CCGCTAC CT C 
GCGGGGCCTC 
CATGCACTGG 
CTGCGACTTC 
GCCCCTGTGC 
GAAGAAGATC 



21 
I 

CTCGGCATGG 
GCCGCACCGC 
CCCGCCTCGT 
GCGOGCATGG 
GTGATCGTGG 



GtGCtGtGCG 
GCCATCACCT 
GTGTGCACCG 



TCTT.CACGCT 
AGCTGGTGGC 
TCAACCCCAT 
GCTGCGCGCG 
CGGGC1GTCT 
ACGACGATGT 
ACGGGGGGGC 
CCTCGGAATC 
GGGAACGAGG 
CCTCGTCTGA 
3TTGGGAAGG 



CGCOGCCACC 
GGCCCTACOC 
CW3CTGGCTG 
CGAOCGCCTC 
CATCTACTGC 
CAGGGCTGCC 
GGCCCGGCCC 
CGTOGGGGCC 
GGOGGCGGAC 
CAAGGTGTAG 



ATCATCCGAG 



GTCACCAACC 
ATCATGGCCT 
GACAGCTGCG 
CCDGTCCCCG 
GOCCCGCTGG 
GAGC&OAAGG 
OCCTTCTTCC 
TTCGTCmCr 
CGCAOCCCCG 
CGCOGGCGCC 
GQACCCCCQC 



AGCGACTCQA 
GGCCCGGCGC 
TTACT TAAG A 
GCAAAGAGAA 
TGGCTTGCTG 



31 

1 

CCCCAACCAC 
GCX3CGGGGQT 
TCCCCGACGG 
TGCTGCCTCC 
GTCTGCTGAT 
CCATCGCCAA 
GCGCCGACCT 

TGACGGCCAG 
CGCCCTTCCG 
TGTGGGGCAT 
AGAGCGACGA 
GGGOCTACGC 
TCGTQTACCT 
AGCGCCGTTT 



CGCTCAAGAC 
TGGCCAACGT 
TCAACTGGCT 
ACTTCCGCAA 
ACGCGACCCA 
CATCGCCCGQ 
CGCGCCTGCT 
GCCTGGACGA 



41 
I 

GGCCCAGCCC 
GCTOGTCCTG 
GGGGGCCACC 
CGCCAGCGAA 
GGCGCTCATC 
GACGCCGCGG 
GGTCATGGGG 
GTACGGCTCC 
CATCGAGACC 
CTACCAGAGC 
CTOGGCGCTG 
GGCGCGCOGC 
CATCGCCTCG 
GCGGGTGTTC 
CCTCGGCGGG 
OCCGCCCGGA 
TGCGGGTAAG 
GCTGGGCATC 
GGTGAAOGCC 
GGGCTACGCC 
GGGCTTCCAQ 



GGAGOCCTGG 
GCCGTGCCGC 
CTCCGGGCAC 



51 

i 

TGCCACACCC 



CCGATAGCAG 
AAGOCA OGGA 
AT^TTCCTTG 



CCGTTGCACA 
TTG 



GCGGCGCGGC 
AGCCCCGAGC 
GTGCTGGTCA 
CTGCAGACGC 



CTGTGTGTCA 
CTGCTGACQC 



TGCTACAACG 
TCCGTAGTCT 
CGCGAGGCCC 
CCAGCGCGGC 
CCCCCGCGCC 
CGGOGGCCCT 
ATCATGGGCG 
TTCCACCGCG 
AACTCQGOCT 
GGACTGCTCT 

GACGACGACG 
GCCGGCTGCA 
CCCGGCTTCG 
GGCTTCCCAG 
AGCCCACAAT 
AAAAGGAAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 



A167 Protein 



Gene naraei 
Unigene number: 
Protein Accession # = 
Signal sequence: 
Transmembrane domains; 
Pfam domlanr 
Cellular Localization: 



Homo sapiens beta.-l adrenergic receptor mRHA, 
Kb. 37744 
AA011176 
none found 

62-H4, 95-117, 135-157, 177-19B, 226-248 
7tm_l f75-377] 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MGAGVIiVlGA SBPGNL8SAA PLPDGAATAA RLLVPASPPA SLLPPA3K3P EPl*SQQWTAG 60 
t«31>LMAI,XVt> LXVAGKVDVI VAIAKTPRLQ TliTNLFlMSL ASADI,VMGI*I» WPPGATIW 120 

910 



WO 03/042661 



PCT/US02/36810 



HGRWEYGfiFF CELWTSVDVL CVTASTETLC VIAUDRYIAI TSPFRYQSLL TRARARGIjVC 180 

TVWAISALVS FLPILMHVIWR AESDEARRCY HDPKCCDFVT NRAYAIASSV VSFYVPLCIM 240 

AFVYIiHVFRE AQKQVKKIDS CERRFLGGPA RPPSPSPSFV PAPAPPPGPP RPAAAAATAP 300 

IiANGRAGKRR PSRLVALREQ KALKTLGUM FUVNWKAFH REL.VPDRLFV 360 

FFNWI/3YAN5 AFNPIIYCRS PDFRKAFQGL LCCARRAARR HHATHGDRPR ABGCLARPGP 420 
PPSPGAASDD DDDDWGATP PARliLBPWAG CNGGAAADSD SSLDEPCRPG FASBSKV 
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Alfrg DMA sequence 
Gene naraei 
Unigeno number; 
Pxobeset Accession #; 
Nucleic Acid Accession #t 
Coding sequences 81-3 080 



CEGP1 
Hs. 222399 
AA25G4B5 
AJ400877 

(underlined sequences correspaod to start and 9 top codons} 



1 
I 

GGCGTCCGCG 
OCGCAACCGC 
CGGTGCTGCT 
CGGGTCGGGG 
ATGACTGCCA 
AGCCTGGCTA 
TCAATGGAGG 
TTQATQOCTT 
AGAACAATGG 
GCAAGOAGGG 
GCCTGAGCTG 
GCAGCGTCQC 
TCTTGACCTG 
GCCCAGAGTG 
AC3CGAGAGGA 
ATAAACGOGT 
AC03CACCTG 
TCCAGTTGGA 
OTGATCATTT 
AATTATTAAC 
GTGACCACAG 
CCCTGTATGQ 
GTCAGCAGGT 
AGCTCCACTS 
CTCCCOGTGT 
GTCACTCTGQ 
AGCTAAATGA 
CAGCACTAOC 
GCAGCTCTGG 
TTATCACTGT 
TtJAQCTGCAT 
AGGCOGXCCA 
AAAA3CCTCC 
CAGAAAACCA 
G CAT TUTATG 
GCCCAAGACC 
GTGGAGGTCT 
GTGCCCTGGG 
GCCTTGCCAC 
GTTCACCTGG 
CATACCAGCC 
ACTTTGATGG 
GAGATTTCAC 
AGTQZAGGTG 
TCTTCCTGCC 
CCAATTCTGT 
CCAGGTCAAA 
TCCAGGTCOC 
GAQATGGCAG 
TCAAGGCTCr 
AGTCOGGAGA 
TTTTQAGACC 

ggttggxggq 
ccotatcagt 
gaacttggtt 
cagcttctca 

T1GGTCABCC 
TGTAGTGGAA 
CO0C3CCCTCT 
CAAGAGGGGA 
ACXCAGTTTC 
AGTTCTAAGC 
AGCACTTCTG 



11 
I 

CACACCTCCC 
TGAGOCATCC 
GCTGCTGCTG 
CCGTGCCGCG 
TGCCGACGCC 
CCAAGGGGAA 
CTGTOTCCAT 
CATGTTGGCT 
OGGCTGCCAG 

CATGAATAAG 
CTC3TGAGTGC 
TAACCATGGG 
CAGCTGCCAT 
CACK5TCCTG 
GAAAOGGCGG 
TAAGGATACT 
TGGQAAGACA 
CTGCAAAAAC 
AGATGAGAAG 
CTGCATCAAC 
CTTCACCCAC 
CTGTGTGAAC 
GAATAAAAAA 
GTCCCTGCAC 
CATT CACCTC 
AGGCAAGTGT 
AGAGAAGCAC 
CAAGOAAGTC 
TGAGTTTGAG 
CGXAAAGOGA 
CAGGGAGCAG 
CAGAACATCT 
ATGTGTCAGT 
TOCAAATGGA 
AGGAAATTCT 
GTGTCAACCT 
CACGTTCUAG 
GAAACATCAG 

acatttctac 
tgaatttgga 

CTCCACAAAC 
TGGGTACATT 
GACCATCAAC 
CATAOAGGAC 
GACAACATAT 
GAAGCTGTOG 
ATACGTGACA 
GCTCTATOCA. 
GTTTGATGTC 

gatgtttcca 
ttacaa atga 
acaoagctgt 
gactcattag 

TTTCTTTCCC 
CTGCTGTGGO 
TAGGTGAGAC 
AGGAGGCCAC 
CTAAGGGAGC 
GGGAAGGAGA 
TCCACAQCCT 
AGTGCTOGTG 
GAGACAT 



21 

i 

CGCGCCGCCG 
ATGGGGGTCG 
CTGCTGCCQC 

CTGTGTCAGA 
GGCAQGCAGT 
GACTGTTTGA 
CATGACGGTC 
CATAOCTGTG 
AGTGACAATC 
GATCACGGCT 
AGGCCTGGTT 
AACGGTGGGT 
CCACAGTACA 
GAGGTGACAG 
CTGCTCATGG 
TCOACAGGTG 
TGTAAAGATA 
ATCGTGGGCA 
TCTTGCCAAG 
CACCCTGGCA 
TGTGGAGACA 
ACAGTGGGCA 
GACTGIGTGG 
TGCGGTAAGA 
TCXXCAGATG 
AGTTTGAAAA 
AGCTCAGTAA 
CCAGGAGCCC 
CTTGAAACTA 
ACCGAGAAGC 
TTTCACCTCC 
GAACGCCAGG 



31 
I 

CCQCCACCQC 
CGGGCCGCAA 
CACTGCTGCT 
AGGATGTAGA 
ACAC3VCCCAC 
GTGAGGACAT 
ATATTCCAGG 
ataattgtct 
TCAACGTCAT 
AGCACACCTG 
GTAGTCACAT 
TTGAGCTGGC 
GCCAGCACTC 



AGAGCAACAC 
AAAOQTGTGC 
TCCACTGCAG 
TTGATGAGTQ 
GTTTTGACTG 
ATGTGGATOA 
CATTTGCTTG 
CCAATGAGTG 
GCTATGAATG 
AAGTGAAGGG 
GTGGTGGAGG 
TCACJCA.CCAT 
ATGClGAGCT 
AAGAGAGCTT 
CTGGCCGACC 
ACCAAAAGGA 
GGCTCCGTAA 
A3CTCTCAGG 
CAGAGTCCTG 



41 
I 

CCGCACTCCG 
CCGTOCOGGG 
GCTGGCGGGG 
TGAGTGTGCC 
CTCCTACAAG 
CGATGAATGT 
CAATTATCQT 
TGATGTGGAC 
GGGGAGCTAT 
CATTCACCGC 
CTGCAAGGAG 
CAAGAACCAG 
CTGTGACGAT 
AGATGGGAGG 
CACATGAGTG 
TGTCAACAAT 
TTGTCCTGTT 
CCAGAGCOGC 
CGGCTGCAAG 
GTSCTCTTTG 
TGCTTGCAAC 
CAGCATCAAC 
CCAGTGCCAC 
GCXCCTGCCC 



CAGGACAAGT 
GTTTCCCGAQ 
CCGCTACGTA 
AAGCACOCCT 
GGTGACAGCr 
AGCCATCCGC 
CATGAACCTC 
TGGAGTGGGC 



ACCTTCCAAA 
GGGGCCCTGA 
GGTQAATATT 
CCTGAAGCTG 
GQAGCTACXT 
AACACCACCA 
AAAAATAATT 
ATAACCCAGT 
GAATCCCCAA 

cavccccocA 

GACTGTGGGG 
GAAACCTGCC 
ATTCAGTTCA 
TATGATGAGG 



CTGGCCCATC 
AGATCGTTCA 
CTCAGCCCAC 
CTTCCTTCTG 
AGTTCAATTT 
AGCATCGTGG 
OGGATGTCTT 
TCACCTGTCC 
AGAATAAGCT 
CCTCTGCACT 
GCCCTGCAGG 
TCTCCAGCCT 
AAAAAAAAAA 



AGACCCCAGA 
CTGCAGATGG 
GTCGAACTTC 
CCTTTCAGGA 
CTCAOCGATG 
GTX3TTTCTTG 
GTAAAAACAG 
ACTACCCAGG 
AGCGCCGCAT 
ACTATCTQGT 
AGAOCTAOGA 
AGTCCRATQA 
ACTACCAGGA 
ATCAGGAAAT 
CCCAGAACTA 
TCCGATTQCT 
GTGCCACTCA 
CATGTCAGCA 
TTATAGATAA 



AGCTTGGAAT 
CTTTGCACCT 
CTGCTTCCOC 



TATTCGTTt3C 
OCCAGGAAAT 
AAGATGTGGA 
CAATTACCCA 
CCTGATCGTG 
GATGCGQAAA 
AOGCGCCATC 
AGGGAACAGC 
ACTCATTGAA 



51 
1 

CCGCCTCTGC 
GCSGCCTGOG 
GCCGTCCCGC 
CAAGGGCTAG 
TGCTCCTGCA 
GGAAATGAGC 
TGCACTTGTT 
GAGTGCCTGG 
GAGTGCTOCT 
TCGGAAGAGG 
GOaCCAAGGG 
AGAGACTGCA 
ACAGCCGAIG 
AGCTGCCTTG 
GTGGATGGGG 
GGAGGCTGTG 
GGATTCACTC 
AATGGAGGTT 
AAAGGATTXA 
GATAGGACCT 
GGAGGGTACA 
AAOGGAGGCT 
CCTGGGTACA 
ACAAGTGTGT 
TXCCTCAGAT 
GTAACCTTTA 
GGTCTGGGAC 
AACCTTACAT 
AAGGAAATGT 
TCTTGTGACC 
ACGOTCAGAA 
GACGTGGCTA 
CAGGGrCATG 
CGAGAACGCT 
TGTGAACCAT 
ATGTCTGAAT 
TGCGAGCTCT 
TGTGGAGGAG 
AGAGTTCAAT 
CCAGTGGGAA 
ACTAOGACTG 
GGGGAGCTGG 
GCCAACACCG 



GGATAGATCA 
TTCTGGGGTC 
GCTTATTCTG 



CTCCCTCCAC 

gtgtgataca 



TTTCAAGTAC 

acgttccaaa 
atacaaatgt 
cagtcgggta 

TAC3VGATCATT 
AGAATGGCTT 
CGGGCTGGCT 
TTACTCCTOC 
AAACXTCAGC 
CTCTGACCAG 
CCACCTTGAG 
AGTTTGATCC 
TTAGAAATAA 



ACCTCTTCAT 
GCCTTCACCT 
GCIAGAGGGT 
GAGATAGITC 
AAGAAACTTA 
ACAGCCCAGG 
GTGTCCAGGT 
TCTGCEATAG 

TTGGTAAATT 
TGAGTGGCAT 
GAGCIGGACT 
TCAAGGAGTC 
irCCTCTAJSC 
GCAGAACAGG 
ACCTGGGAGG 
CAGGAACTTG 
ATAAAAACTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S80 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



A169 Protein sequence 

Gene name: 

Oal gene number i 



CEGPl 
Hb. 222399 



911 



WO 03/042661 



PCT/US02/36810 



Probe set Accession fti AA256485 
Protein Accession ff; CAB92285 
Signal aequeace: i-3l 
Tra rug membrane domains ; none 

PFAM domains; EGF-like__domain6 [49-84,132-167,177-213 ,286-321 ,407-442] CUB_domain [809-9181 

Cellular localization: may be secreted 



1 11 21 31 41 51 

I I I 1 I I 

MGVAGRNRPG fiAHA V LIiLLIj KL.PPLU.UMS AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 

LCQNTPXSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH EKZLJTIPGNYR CTCFDGFMLA 120 

HDQHNCL33VD ECLENNGGCQ HTCVNVMGSY ECCCKBGFFL BDNQHTClHR SBEGLSCMHK 180 

DHGCSKICKE APRGSVACEC RPGFEliAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 

PQYKMETDGR BCLEREDTVL BVTE3NTTSV VDGDKRVKRR LLMETCAVMtf GGCDRTCKDT 300 

3TGVHCSCPV GFTLQLDGKT CKD1DECQTR NGGCDHFCKN IVGSFDCGCK XGFJOJ^TDEK 36Q 

SCQDVDECSI. DRTCDHSCIK HPGTPACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 

tvgsyecqch fbyklhwnkk dcvevkgllp tsvsprvslh cgk9ggqdgc fisrchsgihl 480 

ssnvttirts vtfki^iexskc slknaeiiffe gr.rpat.pekh ssvkesfryv nltcssgkqv 540 

pgapgrfstp kemfitvefe letnqkevta scdlscivkr tekrlrkair tlrkavhrsq 600 

ehi^lsgmnl dvakkpprt9 erqaescgvg qghaenqcvs cragtyydga rsrcliicphg 660 

tfqneegqmt cepcprpgns galktpeawn hsecgglcqp geysadgfap cqlcalgtfq 72 d 

peagrt9cfp cgggioatkhq gatsfqdcet rvqcspghpy nttthrcirc pvgtyqpefg 780 

KNNCVSCPGN TTTDFDGSTN ITQCKMRRCG GSLGDFXGYI ESPKYPQHYP ANTECTWTIN 84 Q 

PPPKRRTLIV VPEIFLPIED DCGDYIjVMRX T3SSNSVTTY ETCQTYERPI AFTSRSKKIjH 900 

IQFKSNEGNS ARGFQVPYVT YDEDYQELIE DIVRDGRLYA SENEQEIiKD KKL I KALrFD V 960 
LAHPQHYFKY TAQESREHFP RSFIRLLRSK VSRFLRPYK 



A170 DMA sequence 

Gene naraet deme-6 protein (KIAA0452) 

Unigene number r Hs .12S783 

Probeset Accession fci AL039402 
Nucleic Acid Accession #: AF007170 

Coding sequence: 1-1725 [underlined sequences correspond to stop codon) 

1 11 21 31 41 51 

I I I I I 1 

AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 

CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 

AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGGCACCATC 180 

CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 

ATGAAGGAGG CACAGATGCT GTOTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 

TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 

OAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCGT GCAGGAGGAG 420 

AACATGOTGA GCTTCATCAA AGQCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 480 

GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA DCCGCACTTT 540 

GAAGGAGGAB TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCXACT 600 

AGGATCCTGA GGCTGTTGGA OTTTGTGGGG TTTTCAGGAA ACAAGGAOTA TGGGCTGCTO 660 

CAGCTGGAGG AGGGAGCGTC AGGGCACAGC TTCCQCTCTO TGCTCTGTGT CATGCTCCTG 720 

CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATGGAGGAG 780 

GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 

•PTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTC33AG 900 

OAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTOCTA CTGGGAGCTG 960 

ATGTGOTGCT TCACCTACAA GOGCCA0TGG AAOATGTCCT ACTTCTACGC CGACCTGCTC 1020 

AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCOBC CTAOCTCAGC 1080 

ATGTTTGGGA AGGAGGACCA CAAQCCQTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1140 

OTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAMCICTAC CCACAGAOAA GTTTGCCATC 1200 

CGOAAGTCCC GGGGCXACTT CPCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 

ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCQAAACT CACGGATGGG 1320 

AlACTTGAGA TTATCACTAA GGGTGAAGA3 ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 1380 

TCAGTGGATG AOGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 

CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTQ CCAATOAAAA GAAGATTAAA 1S00 

TATQACCACT ACTTGATCCC AAADGCCCXG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1560 

GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATClGCCA AGCAAAACTA CAAGAATTAC 1620 

TCCATGGAGT CAAGGACACA CTrTCGAATC CAGGCAGOCA CACTCCAAGC CAAGTCTTCC 1680 

CTAGAGAACA GCAGCAOATC CATGGTCTCA TCAGTGTCCT TGTAGCTTTG TOCAGCAGTT 1740 

CCGGGCXGGA AGACAGAGAC ASCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 

CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGQATGG CACAACATAG I860 

TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTT1GCCAAG 1920 

GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 19BO 

GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 

CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 

TATCATCTGG AGGTAAATCT CTTXATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 

TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 

AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 

CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 

AAjGCACQTCT TrTTAGTOAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 

AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 

AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 

CTGAAACCAC TOGOAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 

CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACT3TTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 



A171 Protein sequence 



912 



WO 03/042661 



PCT/US02/3681© 



Gene name: 
"Dai. gene number: 
Probeeet Accession ft? 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular Localization: 



DEME-6 protein (KIAA0452) 

Hs. 125783 

AB033402 

AAC39582 

none 

210-226 

plasma membrane 



1 11 21 31 41 51 

I I ] I I I 

MT ALDLP LTN QFSEAL9YIiK PRTKESMYHS LTYATILEKQ AMMTPDPQDI LLAGMMMKEA 60 

QMIjCQRHRRK SSVTDSPSSli VNRPTLGQFT KEEIKAEVCY AECIXUQRAAIj TFLQDENWVS 120 

FIKGGIKVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTHlLR 180 

LIiEFVGFSGN KDYGLIiQI/EE GASGHSFRSV IiCVMT.TiTiCYH TFLTFVLGTG NVNIEEAEXL 240 

LKPYLNRYPK GAI FLFFAGR lEVIXGHIDA AIREFBECCE AQQHWKQFHH MCYWELM&OF 300 

TYKOQWKWSY FYADLLSKEK? CWSKATYIYM KAAYLSMFGK EDKKFFODDE VELFRAVPGL 360 

KLKIAGKSLP TEKFAIRKSR RYF3SWP I SI* PVPALEMMYI WNGYAVIGKQ. PKLTEGILEI 420 

ITKAEEMLEK GPKNEYSVDD ECLVKLLKGli CLKYLGRVQE AEBNFRSISA NEKKIKXDHY 480 

liISMALI^EA LDLMEQDRNE EAIKLLESAX QNYKNYSMES RTHFRIQAAT LQAXSSLENS 540 
SRSMVSSVSI- 



A172 SKA sequence 
Gene namei EST 
Uuigene number: Ha. 2 00102 

Probeset Acceaaicm #:. AI.117406 
Nucleic Acid Accession #r none found 

Coding sequence j 1-4044 (underlined sequences correspond to start and stop codone) 

1 11 21 31 41 SI 

) I 1 I I I 

ATGA CTAGGA AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCQTGQC 60 
ATCGACATAG GCGATGACAT GGTTTCAGGR CTTATTTATA AAADCTATAC TCTCCAAGAT 120 
GGCCCCTGGA GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCQ 180 

TOOC3GGAAGT ATGATGCTGC CTTGAGAACC ATQATTCCCT TCCGTCCCAA GCCGAGGTTT 240 
CCTGGCCCCC AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC 300 

ACCCCGCTCA TGATCCAAAG CTTAC3GGAGT CGCTTAGATQ AGAACACCAT CCCTCCACTG 360 

TCAGTCCATG ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA 420 
GTCTCAAGGC GAGGGATTGA AAAAGCTTCA OTQCTTCTGQ TGATGCTGAG GTTCCAGAGA 480 

ACAAGGTTGA TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG 540 
CCAATATTGA TTATACCAAA GATCCTGCIAA TATTCAQAAG AQCAGTTOGG GAATGTTGTC 600 
CATGGAGTGG GACTCTGCTT TGCOCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC 660 
TCCTCCAGTT GGATCATCAA CCAAOGCACA GCCATCAGGT XCOGAGCAGC TGTTTCCTCC 720 
TTTGCCTTTQ AGRAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAOAGGCC 780 
ATCAGCTTCT TCACCGGTQA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA 840 
GTACTGATCA CCTGCGCATC GCTGGTCATC TGCAGCATM CTTCCTACTT CATTATTQGA 900 
TACACTGCAT TTATTGOCAT CTTATOCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG 960 

ACAAGAATGG CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCOT 1020 

GTGACOWJTG AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA 1080 

TTTGCAAAAA TCATTGAAGG TATGGAAAGT CTGACTTTCT GCTCCAAACC TGGTGATGGC 1140 

ATGGCCTTCA GCATGCTGGC CTCCTTGAAT CTCCTTCGGC TGTCAGTGTT CTTTGTGCCT 1200 

ATTGCAGTCA AAGGTCTCAC GAATTCCAAG TCTGCAGTGA TGAGGTTCAA GAAGTTTTTC 1260 

CTCCAGGAGA GOCCTGTTTT CTATGTCCAG ACATTACAAG ACCCCAQCAA AGCTCTGGTC 1320 

TTT GAGGA GG CCAGCTTQTC ATGGCAACAG ACCTGTCCCG GGATCGTCAA TGGGGCACTG 1380 

GAGCTGGAGA GGAACGGGCA TGCTTCTGAG GGGATGACCA GGCCTAGAOA TQCCCTCGGG 1440 

CCAGAGGAAG AAQGOAACAG CCTGGGCCCA GAGTTGCACA AGATCAACCT GGTGGTGTCC 1500 

AAGGGGATGA TGTTAGGGGT CTGCGGCAAC ACGGGGAGTG GTAAGAGCAG CCTGTTGTCA 1560 

GCCATCCTGG AGGAGATGCA CTTGCTCGAG GGCTCGGTGG GGGTGCAGGG AAGCCTGGCC 1620 

TATGTCCCCC AGCAGGCCTG GATCGTCAGC GGGAACATCA GGGAGAACAr CCTCATGGOA 1680 

GGCGCATATG ACAAGGCCCG ATTACCTCCAG GTGCTCCACT GCTGCTCCCT GAATCGGGAC 1740 

CTGGAACTTC TGCCCTTTGQ AGACATGACA GAGATTGGAG AGCGGGGCCT CAACCTCTCT 1800 

GGGGGOCAOA AACAGAGGAT CAGCCTGGCC GGCGCCGTCT ATTCCGACCG TCAGATCTAC I860 

CTGCTGGACG ACCCCCTGTC TGCTGTGGAC GCCCACGTGG GGAAGCACAT TTTTGAGGAG 1920 

TGCATTAAOA AGACACTCAG GGGGAAGACG GTCGTCCTGG TQAjCCCACCA OCTGCAGTAC 1980 

TTAGAATTTT GTGGCCAGAT CATTTTGTTG GAAAATGGGA AAATCTGTGA AAATGGAACT 2040 

CACAGTGAGT TAATGCAGAA AAAGGGGAAA TATGCCCAAC TTATCCAGAA GATQCACAAG 2100 

GAAGCCACTT CGGACATGTT GCAGGACACA GCAAAGATAG CAGAGAAGCC AAAGGTAGAA 2160 

AGTCAGGCTC TGGCCACCTC CCTGGAAGAG TCTCTC3VACG QAAATGCTGT GCCGGAGCAT 2220 

CAGCTCACAC AGGAGGAGGA GATGGAAGAA GGCTCCTTGA GTTGGAGGGT CTACCACCAC 2280 

TACATCCAGG CAGCTGGAGG TTACATGQTC TCTTGCATAA TXTTCTTCTT CGTGGTGCTG 2340 

ATCGTCTTCT TAACGATCTT CAGCTTCTGG TGGCTGAGCT ACTGGTTGGA GCAGGQCTCG 2400 

GGGACCAATA GCAGCCGAGA GAGCAATGGA ACCATGGCAG ACCTGGGCAA CATTGCAGAC 2460 

AATCCTCAAC TGTCCTXCTA CCAGCTGGTG TACGGGCTCA ACGCCCTGCT CCTCATCTGT 2520 

GTGGGGGTCT GCTCCTCAGG GATTTTCACC AAAGTCACGA GGAAGGCATC CACGGCCCTG 2560 

CACAACAAGC TCTTCAACAA GGTTTTCCGC TGCCCCATGA GTTtCTTTGA CACCATCCCA 2640 

ATAGGCCaOC TTTTGAACXG CTTCTCAGGG QACTTGOAAC AGCTGGACCA GCTCTTGCCC 2700 

ATCTTTTCAG AGCAGTTCCT GGTCCTGTCC TTAATGGTGA TCGCCGTCCT GTTGATTQTC 2760 

AGTGTGCTGT CTCCATATAT CCTGTTAATG GOAGCCATAA TCATG3TTAT TTGCTTCATT 2620 

TATTATATGA TGTTCAAGAA GGCCATCGGT GTGTTCAAOA GACTGGAGAA CTATAGCCGG 2880 

TCTCCTTTAT TCTCCCACAT CCTCAATTCT CTGCAAGGCC TGAGCTCCAT CCATGTCTAT 2940 

GGAAAAACTG AAGACTTCAT CAGCCAGTTT AAGAGGCTQA CTGATGCGCA GAATAACTAC 3000 

CTGCXGTTGT TTCTATCTTC CACACGATGG ATGGCATTGA GGCTGGAGAT CATGACCAAC 3060 

CTTOTGACCT TGGCTGTTGC GCTGTTCGTG GCTTTTGGCA TTTCCTCCAC CCCCIACTCC 3120 



913 



WO 03/042661 



PCT/US02/36810 



TTTAAAGTCA TGGCTGTCAA CATCGTGCTG CAGCTGGCGT CCAGCTTCCA GGCCACTGCC 318D 

CGGATTGGCT TQ3AOACAQA GGCACAGTTC ACGGCTGTAG AGAGQATACT GCAGTACATG 3240 

AAGATGTGTG TCTCGGAAGC TCCTTTACAC ATGGAAGGCA CAAGTTGTCC CCAGGGGTGG 3300 

CCACAGCATG GGGAAAICAT ATTTCAGGAT TJVTCACATGA AATACAGAGA CAACACACCC 33 GO 

ACCGTOCTTC ACGGCATCAA CCTGACCATC CGCGGCCACG AAGTGGTGGG CATCGTGGGA 3420 

AGGACGGGCT CTGGGAAOTC CTCCTTGGGC ATGGCTCTCT TCCGCCTGGT GGAGCCCATG 34B0 

GCAGGCCGGA TTCTCATTGA CGGCGTGGAC ATTTGCAGCA TCGGCCTGGA GGACTTGCGG 3540 

TOCAAGCTCT CAGTGATCCC TCAAGATCCA GTGCTGCTCT CAGGAACCAT CAGATICAAC 3G00 

CTAGATOOCT TTGACCGTCA CACTGACCAG CAGATCTGGG ATGCCTTGGA GAGGACATTC 3660 

CTGACCAAGG CCATCTCAAA GTTCCCCAAA AAGCTGCATA C3U3ATGTGGT GGAAAACGGT 3720 

GGAAACTTCT CTGTGGGGGA GAGGCAGCTG CtCTGCATTG CCAGGGCTGT GCTTCGCAAC 3730 

TCCAAGATCA TCCTT AT CG A TGAAGCCACA GCCTCCATTG ACATGGAOAC AGACACCCTG 3840 

ATCCAGGGCA CAATCCGTGA AGCCTTCCAJS GGCTGCACCG TGCTCGTCAT TGCCCACCGT 3900 

GTCACCACTG TGCTGAACTG TGACCACATC CTGGTTATGG GCAATGGGAA GGTGGTAGAA 3960 

TTTGATOGGC CGGAGGTACT GCGGAAOAAG CCTGGGTCAT TGTTCGCAGC CCTCATGGCC 4020 

ACAGCCACTT CTTCACTGAG ATAAG GAGAT GTGGAGAC7TT CATGGAGGCT QGCAGCTGAG 4080 

erCAGAGGTT CACACAGGTG CAGCTTCGAG GCCCACAGTC TGCGACCTTC TTGTTTGGAG 4140 

ATGAGAACTT CTCCTGGAAG CAGGGGTAAA TGTAGGGGGG QTGGGGATTG CTGGATGGAA 4200 

ACCCTGGAAT AGGCTACTTG ATGGCTCXCA AGACCTTAGA ACCCCAGAAC CATCTAAGAC 4260 

ATGGGATTCA GTGATCATGT GGTTC TCCTT TTAACTTACA TGCTGAATAA TTTTATAATA 4320 

AGGTAAAAGC TTATAGTTTT CTGATCTGTG TTAGAAGTGY TGCAAATGCT GTACTGACTT 4380 
TGTAAAATAT AAAACTAAGG AAAACTCAAA AAAAAAAAAA AAAAAAA 



A173 Protein sequence 
Gene name: 
Unioene number: 
Probeaet Accession #s 
Protein Accession #7 
Signal sequence: 
Transmembrane domains: 

943,101.8-1034 

PPAM domains i 
Cellular Localization: 



EST 

Hb. 200102 
ALU 74 06 
none found 
none found 

169-185, 199-215, 275-291, 304-320, 387-403, 770-786, 829-845, 907-923, 927- 

ABC_transporter [502-673], ABC_membrane_regioa [163-432, 771-iogo] 
ATP-binding_domaina E508-51G, 1139-1147T 
plasma membrane 



1 11 21 31 41 51 

I I I I I I 

KTRKRTYWVP NSSGGLVNRG IDIGDDMVSG LIYKTYTLQD GPW8QQEHNP EAPGHAAVPP 60 

KGKYDAAIiRT MIPFRPKPRF PAPQPLDNAG laPSYIiTVSWL. TPLMIQSLRS RLDEETIPPL 120 

SVHDASDKNV QRLHRLWEEE VSRRGISKAS VUjVMtRPQR THJb I FD ALLG ICFCTASVLG 1B0 

PIIiTlPKILB YSEEQLGJIW HGVGLCFALF LSECVKSLSF SSgWimORT AIRFRAAVSS 240 

FAFEKLIQFK SVIHlTfiGEA ISFFTGDVfiJY LFSGVCYGPL VLITCASLV1 CSISSYFI1G 300 

YTAFIAILCY LLVFPXjftVFH TRMAVKAQHH TSBVSDQRIR VTSBVLTCIK JjIKMYTWEKP 360 

FAKIIEGMES LTFCSKPGDG MAFSMIASLH LL&LSVFFVP 1AVKGLTMSK SAVMRFKKFF 420 

LCESPVFYVQ TLQDPSKALV FEEATLSWQQ TCPGIVNGAI. ELERNGHABE GMTRPRDALG 430 

PBEEGNSliGP ELHKISLVVS KQMHK3VCQN TOSOKSSLLS AI&EEMHDIjE GSVOVQGSIiA 540 

YVPQQAHIVS GNXREKILMG GAYBKARYLQ VLHCCSLNRD LELLPFGDMT EIGERGLNLS 600 

GGQKQRISLA RAVYSDRQIY LLDDPL9AVD AHVflKHIFEE CXKKTIiRGKT VVLVTHQLQY 660 

CiEFCGQlXItl* ENGKICENGT HSELMQKKGK YAQLIQKMHK EATSDMLQDT AKIAEKPKVE 720 

SQALATSLEE BLNG&AVPEH OLTQHEEMEE GSLGWRVYHH. YIQAAGGYMV fiCIlFFFWL 780 

IVFLTIPSPW WL^X^LEQGS GTKSSRESHG TMADLGNIAD NPQLSFYQLV YGfcNALL&IC 840 

WVCSSGIFT KVTHKASTAli HHKLFNKVFR CPMSFFDTIP IGRLLNCFAG DLEQLDQLIjP 900 

IFSBQFLVLS IMVIAVLliIV STrtiSPyiLIM GAIIKVICFI XYMMFRKAXG VFKRltENYSR 960 

SPLPSHXLHS IiQGLSSIHVY GKTEDFISQF KRLTDAQNNY LLI/FLSSTRW MAIsRLEIMTN 1020 

IiVTTAVALFV AFGlSQTFYS FJCVMAVNIVL QIiASSFQATA RJOIrETEAQF TAVBRXLQYM 10 BO 

KMCVBEAPIS HEGTSCPQGW PQHGBIIFQD YHMKYRDNTP TVLHGINLTI RGHEWGIVG 1140 

RTGSGKSBLG HAIiFRLVEPM AGE I Ij HX3 VD ICSXOIiEDLIL SKIiBVXPQDP VLIiSGTlRFK 1200 

ItDPFDRHTDQ QIWDALERTF LTKAISKFPK KLHTDWENG GMFSVGERQL LCIARAVLRN 1260 

SKIXLIDKAT AS1DMETDTL IQRTIEEAFQ GCTVLVIAHR VTTVLKCDHI LVH3NQKWE 1320 
FDRFEVIiRKK PGSLFAALMA TATSSLR 



A174 DBPV, oequence 
Gene name : 
Unigene number: 
Probeaet Accession #i 
Nucleic Acid Accession *s 
Coding sequences 



Hs. 128399 

AA9S3251 

AA983251 

1-1749 {under lined sequences correspond to start and atop codons) 



1 
I 

ATGCTGTCTG 
GGAAAGAAAC 
GACCGOGAGA 



CGCCTGCGTC 
TCCGGACGAC 
ACTCATAAGG 
GCTCCTGGAC 
CCGCGCGGAA 
CTTCTCGGAG 
TGTGGGGCGC 
TGCTGCTGGC 



11 
I 

GCTTCTTGAT 
TTCCGTGGGA 
GGGAGAGCCG 
AGAAGGGGAA 
CGCCAGCTGG 
CTGGACGTTC 
AGCCCCGGGG 
CAGTOCCTAA 
CTAGGGCCCG 
AGCGCCGCGG 
ACGGTCCTGC 
TCGCCGCTCG 
TGCGCTGCTG 



21 
I 

GAGTCCCAGT 
GGCTTCCATC 
GCCGGAGGCT 
CCGGQGCGAG 
GCAGGCTCCC 



GCCTTCTGAC 
GGGGACCOGG 
GCGTQGTCGC 
GACAGTCAGT 

TCCGTCTCCT 
GCGGCGGGGG 



31 
I 

ACCCAGCACA 
GGTGCGCACA 
GCCGGGCTCC 
CCGCCCGCCT 
GGGACTGCGG 
GTCCGGTTGC 
TGCATCCCGA 
CCACCGGCTG 
CTCCTGGGCG 
GACGAGGCCC 
GACGCGCTGT 
CATCCTGGAA 
CGAGGGCCCA 



41 
I 

GAGCACAGTA 
CCTCCCGAGG 
TGTGGGACCG 
GGATCCGCGC 
CTGGGGGCGC 
CAGTGAAACC 
GATTTCCATC 
AGGACGGGGA 



51 
I 

CACTCCCGGA 
GCGAQGCAGC 
CGCTGCAGCC 
CCAGCAGCAG 
GCAGGACCCT 
TCCAGAGGCT 
AGCGAGTGCA 
TGGCTTAGGA 



GGGGGTCGCC GGGGCCACGA 
CCGCGOCCAQ QGTGGTGCCA 
CGCCGCTTCG CTCCTGCAGC 
GCGGCGAGTA CTXSCCACGaC 



6D 
120 
180 
240 
3D0 
360 
420 
480 
540 
600 
660 
720 
7B0 



914 
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TGG CTGGACG CGCAGGGCGT 
GGOGACGCCA CCA.TC7TGCTG 
GCGCGCCTGG ACCAGGGCGG 
OGGGCGGACA AAGACGGGCC 
GGAGACBGGG AGGGTGCGCC 
TCCCCGAAAG GAAGGCAGCT 
CGCOTATTCC CATCXXCTCC 

axcxacgxgc cgttcctcat 

CTGGTGGCAG CCTGTTGCTG 
GOCCCAGGQQ GTAACOGCTT 
CGGGGGTCGX CCICACGCCA 
GGGGCCCGGG CGCCCCCAAC 
AACAACGTGT ATGTCAACAT 
CAGATTGXGC CACATCAAGG 
CACGACTCTG XGCOCAXGAC 
AGGCAGATTC AGXCCCCCTT 
ACTGTATAAC CGAGAGTCAC 
GATTCTCGAG GTGGAAGXCC 
GCTTCATTTG CCCCOAGACX 
CATCCAGGGT ATCAT TGATT 
TTGCTGATGG GTGTATAACA 
AAGAATTTTA TAAACTGATA 
XXGXXGTTGA CTGCACAGGA 
TTTTTTTTTT TTTTTTTTAA 
TGGTGCGATC TCGGCTCACT 
AGCCTCCCAC GTGGCTGGGA 
TGTAGAQATG GGGTTTCACC 
TCTGCCTGTC TCAGCCTOCC 
TGAQCCTTTT TTTTTTTCTA 
ATTCTAAAAG GAAACCTGTT 
GGCACAOCTT AATTTCATTG 
GGGCCTATXT XGIGCTXXXT 
TTAAAAAGAO ACTGAAATAA 
TGGGATGATT TTAATCTTGA 
T TGATTGT AT CTTAATTTTC 
TATATTGAAA TCATAAACTA 
TGGTTATGGT TTGGCGTTTC 
GAGTGCAAGr AATTAC TATA 
AAAAATTXTC TTGTCTTAGT 
TAGCTCCTCA GCCAXAACCT 
GCAQATCATA AGGCTTTTTA 
GATTGTTAAG AGAAAAGCTT 
ATTTCCTCTC TCTTTCAGGC 
CAAATTCAAG XGAATTTATT 
ATOTGCAAGT ACATTTCCAA 
GXATGGTAXG GTCTTCTAGA 
TGXTXCAXGA CTGAATAATG 
TTTTTTAATA ACCAAAGGCA 
TGAAAAAAAA AAAAAAAAAA 



CTGGCGCAXC GGCTTCCAGT GXCCCGAGCG CTTCGACGGC 840 

CGGCAQCTGC GCGXTGCGCT ACTGCTGCTC CAGCGCCGAG 9O0 

CXGCGACAAX GACCGCCAGC AGQGCGCXGG CGAGCCTGGC 960 

OCGACQGCTG GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 

CCCACCCGTG AGGGCCTGGC AGCGGTGCXC CCCTGAAGGC 1080 

CCTCAGGQCT TTCCCGGGGC TGCTGCCCCG XGCCAGACGC 1140 

ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCGC 1200 

TGTTGGCTCC QTGTTTQTCG CCTTTATCAT CfTGGQQTCC 1260 

CAGATGTCTC CGGCCTAAGC AGGATOCOCA GCAGAGCCGA 1320 

GATGGAGACC AXCCCCATGA TCCCCAGTGC CAGCACCTCC 13 B0 

GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGOCAACICC 1440 

AAGGTCACAG ACCAACTGTT GCTTGCCQGA AGGGACCATG 1500 

GCCCACGAAI TTCTCTGTGC TGAACXGXCA GCAGGCCACC 1S60 

GCAGTATCTG CAXCCCCCAX ACGTGGGGXA CACGGTGCAG 1620 

AGCTGXGCCA OCXTXCAXGG ACGGCCTGCA GCCTGGCTAC 1680 

CCCTCACACC AACAGTGAAC AQAAGATGTA CCCAGCGGTG 1740 

IGGTGGGTXC CTTTACTGAA GGGAGAGGAA GGCAGGGGTG 1800 

GCACATGTCG GTGGTATTTA TGGCACGATT CCXXXGGATG I860 

GXATGAAAAC ATCTCCGAAT TAGCATXICI GGAXATGXTT 1920 

TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTOTQATG I960 

AATGCTTGAG TCOGAAGTGC CCXTGAGATA TGGTTGACGA 2040 

AATTAAGGAT XXXTAXXATG TTGTTATTAT TATTTCTTTT 2100 

TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

TCAQACAGGG TCTTGCTCTG TTGCCCAGGC XGGAGTGCAG 2220 

GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 

TTACAGGTGC CTGCCCCCAT GQCTAATTTT TTGTAIXTTT 2340 

ATGTTGGCXG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

TGAACTCTGT GAGAXCAGXC ATCAGTCTCA GTATTCCACA 25 B0 

TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 

TACCTTATGT AGAGATCTTA TTACAAAGXG AXIXICXACA 2700 

ATXGTAXAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

AGXAGTAGGX GGXATAGTCA TAAAACCATX CATCCCCXTC 2820 

TGGCTTTAAG GIGACATCTG AGAGGTAATG CATTCTTTTT 2BB0 

TCACCCGCTG CXTCTCXGAG TTACIXTXAA TITTGCCTXG 2940 

CTXCTGXTTG GTXTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

CXXGTAAATG AAGATCAGTA TTTCTGGCXA GATGTGATAA 3060 

TAXAAAAATT CAAAGAAATG TGXTACAAAG AXACXXAGXA 3120 

GAGACTXGGG AXOAAATTTA AACCAGATAC GATXTACTTI 3180 

TACTCTXGXT ATCAAAATGG CTTATTXXTC AGGGACXAAG 3240 

TTCAACGAAG GATTGCCTTT CTXCICCCAC ACTGTTCTTG 3300 

CXCAACAGGC ACTGXATXGA TTGCCAATGT XCCAAATTAT 3360 

TGTGTGXTCT T TACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

CTGCTAGCAC AACCAGTATT TXGTAATTAA ACAAAXCGCX 3480 

CAT T TATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 

TAAAACCAGX GTTGGCAATT GGTATCATCA ATGAXACTCA 3600 

GGGGAAAAXC ATTTTACTXA TTAAXAAAXA TXXXATGATG 3660 
AAAAAAAAAA 



A3. 7 5 Protein sequence 
Gene namei 
TJnigene number: 
Protein Accession. #: 
Signal sequence; 



Cellular localization: 



EBTe 

Hs»l2BB99 

none found 

l-ll 

402-424 

not determined 



1 
I 

NDuSGFLMSPS 



11 
I 

TQHHAQYTPG 
PPAKIRAQQQ 
CIPRFPSASA 



CCWIjRCWRPJG 
ARLDQGGCDK 
fiPKGRQLIiRA 
IiVAACCCSCL 
GAHAPPIJISQ 
HDSVPHTAVP 



RGPSGEYCHG 
DRQCjGAGEPG 
FPGLLPRARR 
RPKQDPQQSR 



21 
1 

GKKLFWBASI 
PRPPPAGQAP 
THKAVPKGTG 

ItLDAQGVHRI 
RADKDGPRRIi 
KGE*PQSPRGG 
APGGNRLMET 



31 
I 

GAHTSRGHGS 
GTAAGGAQDP 
PPABDGDGLG 
DALBAPRWP 



41 



GRASCLRGXQ 
PSPIiQRPAliP 
IPMIPSASXS 
PSVIUCQQAT 
PPMEGLQPGY RQIQSPFPHT WSBQKMYPAV 



RLRPGRSRGR 
APGPRA&RRfi 
CGAIAARPSP 
GDATICCGSC 
GDGEGAPPPV 
1YVPFI.1VGS 



SI 
1 

AJSUliVJDRAAA 
VRIiPVKPPBA 
LLGVAAEGSG 
EPGXPIiRSCS 
ALKYCCSSAE 
RAWQRCSFBG 
VFVAFIIliGS 



QIVPEOGQYL 
XV 



HPPYVGYTVQ 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 



A176 DNA SEQOEHCB 

Gene names ESTe, Weakly similar to CGHU7Ii collagen alpha l(iri) chain precursor [H. sapiens] 
Unigene numbers Hs , 15322 

Probeset Accession #: AA0BB458 
KUclaic Acid Accession #t AA088458 

Coding sequence: 862-1995 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

l 1 I I ) l 

OCCCTTGGAC ACTGACfcTQG ACTOAAOGAG TAGAATGGAQ CACXJAGQACA CTGACATGGA 



60 



915 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CTGAAQAAAA 
GCaCGGGGCC 
CTGGGCCAGA 
CGGCTACTGC 
TGTGCCAGCC 
ACCTCACCCC 
CTCACCCAGG 
GCGCTCATTA 
GATTCCACCT 
AGCCCTTCGA 
GCCCAGGCAC 
GCCTGCCCOC 
ACATGGGCTG 
TGOACAGTGG 
GGTCCCATCT 
AGAGGGCGOG 
CAGGACGAGG 
GTAAGCGGGG 
CTGGCCAAGG 
GGCCTGCATG 
TTGCCCCACG 
GACAGCTCCC 
CTGGGGTCCT 
GAGAGGCCAC 
GGCAGGTCCC 
GGAGTXCGCA 
GAACCAGGGG 
TCAGTGTGTG 
CCCGATGCGG 
ACACTGTCCC 
CCTTCCGGAG 
TGCTGCACCT 
GCOCTCCTAC 
ACCTCCTGGG 
AGGTGGACTG 



TGGOQGATCC 
GGTGACTTCA 
OAGACAGGCT 
AAAGAAATAG 
CACGAGGGGA 
GCAGACCCTG 
QAGCAGCGTC 
GCGXGCACAC 
CAOAAGTGTC 
TTTTGTGTTG 
CTGQAATCCC 
CCCCATCTCT 
TCATAAACAC 
TAGACCCAQA 
AGAAATAAAA 



AGSAGCTGGA 
GCGACTGGTA 
GCAGAGCCAG 
CCAAGGTACA 
GGQCCCTGCC 
GGGTCTGGCA 
AGGTGACCGA 
AGCAGCTGTT 
TCATCTAGTC 
GGGTGGGCGC 



GGCTGGTCCC 
QGGGCTCTCT 
GGTACCCCTC 
TCAGQOAAAG 
GGGOGGCTCC 
TGGCTQTAGC 
GGTGCCTGCC 
CTGAGGGACC 
TGCCTCCCAC 
TTGAGTCCCA 
AGGCACGTCA 
GCTCACCCCC 
CTCCCTCAGC 
CTTGGGTQTC 
CTGGTGGGGG 
CACGGCAACA 
TQOGGCGCAG 
GGTCAGTGCG 
ACAAGGCACC 
CCCAGCrCCA 
GOTCTG CAGG 
CCTGAAGATG 
CAGGAAAGGG 
CAGOGCAGTG 
TGCCCACCAG 
TGGCATCTTT 
TCAGGAOACC 
GGCACCTCCG 
GTOCTCCCAG 
GAATTTAAAG 
CCTGGAGCCT 
CCTGGGCTCT 
TGTGATGACA 
CCCAGTTGAO 
ATCAAQTTCC 
AGCACTTGAG 
ACAAHAAAAA 
CACAAGGAAA 
TACTAGAATT 
GAGATTTCTG 



GCAGOAGAAG 
CCAGCAGCAG 
CGCCGACTTT 
AOAGGTGGCC 
CCCGTCCTCC 
GCAGCAGACC 
GAAGAGTGAG 
TGAGGCCCOC 
CTTGTGGGCC 
CCCATCGCAC 
TGGGCGOCTT 
CGCACCQAGC 
TGRGTCOGCA 
CA1GA GTTAG 
GCACTGCCCA 
GACGCGGGTC 
TCGGACQGAC 
TGGCTGOGQA 
CTGGCTGCAG 
AGACCCIQGG 
CACAACATCC 
TAGGCAAAGC 
CTTTGCTCTC 
CAAGGAAAAC 
ACTCDCTCAG 
GGCCCTGCTC 
GCATCGATQQ 
GGCCTCCGAT 
TGGGGGGOGC 
TGTCTCAGAG 
TGCTAACCTG 
GGTGTCCCAG 
GGAGTGGQCT 
TGCAGGTOCT 
GGXGGGCCAG 
GOCCTCCCCA 
ACTGGACTGG 
GCCCACATAG 
GAAAAACTGC 
TTTACAGCTT 
GOCCCGGCTG 
GCCCTAGGAC 
ATOCGCGAGG 
CCCGGAAATG 
AATCTGCCCC 
AAGGAAAAGG 
GCCAGGAGTT 
AAAAAGAAAG 
CAATACACTA 
ATCAGAGAGA 
GAAACATGAA 



GAGGTGCTGC 
CTGCAACGAG 
GGGGCTGCAG 
CGGTGCCTGG 
TCCGGGCCCC 
ATCCTCATGC 
CGCATCACGC 
GCCCTGAGCC 
GCGTGGGCCC 
CCACCCTCTC 
CCTGCCGCCC 
GCT TQACTCC 
TAGTCCGCAG 
CGTCCCCCCG 
CGCCAGGCTG 
CAAGGGCAGC 
GGAAGTAGAT 
GCCCCAGGGA 
CGGATCGGCA 
QTGATGGCCT 
TGTGAGCCTG 



ACGCCCAGCC 
GAGAACCCCC 
CCCCTGCCCA 
AGCGCAACeC 
GTTCTGC^GC 
GCGGGGTCAG 
AGGGCCCCCT 
GAGGGGCCCT 
CCCACAGCAA 
GACAGGCCCA 
TTCCAOGGQA 
GAGGGCCTGT 
TGGCAGCCAG 
CGTCTGCCTT 
AAGCAGGAOA 
AGCTGGACCC 
CTTTCAGCCT 
GAAATCAGGC 



GCTGGGCGGG 
TGCCAGTAGC 
TCTCAGGATG 
AGAGGAACAC 
AACATCTCAG 
CCAGAGCAGC 
AAAGAAAATG 
TGAGACCCAG 
ATA7AAAGTA 
AAAAAA 



TGCAGGGTTT 
TGCAGGAGCG 
GGAGCCCCCG 
GGGAGCTGCT 
CCTGCCCTGC 
TGAAOGAGCA 
AGCTGGAGCA 
AGCAGGACGG 
CCAGGGCCAG 
TGGCTGGAGA 
TTGCCAGATG 
GTTTKGGCTC 
CTACTACTGG 
TTTCCAGCGG 
CACTTCCAAC 

GGAGGGGGTG 
TAGCGGTCGG 
CGCCGGGTGG 
TCCCOCTCTT 
GCTCCCCAGG 
CGACTCAOGA 
TGTCCCCAGG 
AGGGTACAGG 
GGCCCACTCC 
GGAGGGTCCC 
CCAGGGCCCC 
TGCGTGGGGG 
CGTGTCCAGG 

CCCCACAGAG 
AGTCAGCCCA 
CATAAG3ATG 
GCCCCACAGC 
GGAQAAGCCC 
TGAGGGTGCC 
CAGAAC3VGTG 
OGCAGCTGAA 
TGGTGTTCCG 
TAGTGAGTGG 
GGTGGCTGGC 
TCAGTCTCCG 
GTGTGCAGGT 
1TGAAATGTG 
ACCCACACCA 
CGGGGCGTGG 
CTGGGCAACG 
AGAGATCCAG 
CAGAAGCAAC 
ACAGTGTTTT 



GGAGATGATG 
CCA5CGCCX3C 
CCCACTGGGG 
GGCTGCAGCC 
CCTGACGTCC 
QAACGGACTTC 
GGAGAAGTOG 
GGGACCTCTG 
CCTGGCACTC 
CCCCCGGCAG 
GGCTCCCCAG 
CTGGTTGYTG 
CCGCTGTCAG 
TGCCGCCCTG 
AACGGGCAGC 

GGGACGGCCT 
ACTTCAGGTT* 
GCGAGAGClT 
GGCCGGGACG 
AGGGCCCCCA 
TTTCCAAGGC 
TTTCAGCTGG 
AGGAOQCTGG 
CGCTGGTGCT 
AGTGTCACCA 
CGATGOGGGG 
GOGCAGGGGC 
GCACXTTGGT 
GTGGCAACTC 
CCACATTCCC 
GCATGCAGCT 
TCAGGCCTGG 
CCCAGCACCC 
CCCGTCAQCA 
TGCCATGCCC 



GCGGAAATGT 
TGCAAGGTGA 
CCCTGGAGAC 
AGAGGCACAT 
TGCAGGATGT 
ACATACACGT 
TCCTTGGGGQ 
GGCCTCAGGA 



CAGTGAGAGA 
GTTTAAAAAT 
AGATTGACTC 
ATATATCTAA 



12 0 
ISO 
240 
300 
360 
420 
460 
S4D 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1B6D 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2B20 
28B0 
2940 
3000 
3060 



A177 DMA SEQUENCE 
Gene name: 
Itaigene number: 
Probeeet Accession #i 
nucleic Acid Accession # = 
Coding sequences 



Cadherln 3, P-cadherin (placental) 

2s. 2877 

X63629 

X63629 

54-2543 (start and stop codons are underlined) 



1 
I 

OOGGAACACC 
TCCCTCGTGG 
CCTCCGAGCC 
CGGAGCAGGA 
CAGCTCTGTT 
AAAGAAGGTC 
GAAGACACAA 
CCTTCCCCCA 
ACAGCATCAC 
AGACAGGCTG 
TCTTTGGCOi. 
TCATCQTGAC 
GTGTCTTAGA 
ATGATGCCAT 
AGGACCCACA 
CCAOTGGCCT 
TGGAtCOGOGA 
ACAATGCTCC 
GCCATGAGGT 
GTBCXZACCTA 
CTGAGAGCAA 
AGCACACCCT 
CCACAGOGAC 



11 

1 

GGCCCGCCGT 
ACCTCTCGCG 
GTGCCGGGOG 
GCCCGGCCAG 
TAGCACTGAT 
ACTGAAGGAA 
QAGAGATTGG 
GAGACTGAAT 
G3GGCCGGGG 
GTTGTTGTTG 
CGCTGTGTCA 
OSACCAGAAT 
GGGAGTCCTA 
CTACACCTAC 
OGACCTCATG 
GGACCGGGAA 
OGGCTCCACC 
CATGTTTGAC 
GCAGAGGCTG 
OCTTATCATQ 
CXZAGGGCATC 
GTACGTTGAA 
CATAGTGGTC 



21 
1 

CGCBOCAGCT 
TCTCTCCTCC 
GTCTTCAOGG 
GCGCTGGGGA 
AATGATGACT 
AGQAATCCAT 
GTGGXTGCTC 
CRGCTCAAGT 
GCAGACAGCC 
AATAAGCCAC 
GAGAATGGTG 
GACCACAAGC 
CCAGQTACTT 
AATGGGGTGG 
TTCACAATTC 
AAAGTCCCTG 
ACCACGGCAG 
CCCCAGAAGT 
ACGGTCACTG 
GGOOQTGACa 
CTGACAACCA 
GTGACCAACG 



31 
I 

GCTTCAOCCC 
TTCTCCAGGT 
AGGCTGAAGT 
AAGTATTCAT 
TCACTCTGCG 
TGAAGATCTT 
CAATATCTOT 
CTAATAAAGA 
CCOCTGAGGG 
TGGACCGGGA 
CCTCAGTGGA 
CCAAGTTTAC 
CTGTGATGCA 
TTGCTTACTC 



AGTACACACT 
TGGCAGTAGT 
AOGAGGGCCA 
ATCTGGACGC 
AGGGGGAOCA 
GGAAGGGTTT 
AGGCCCCTTT 
ATGTGAATGA 



41 
I 

TCTCTCTGCA 
TTGCTGGCTG 
GACCJTGGAG 
GGGCTGOCCT 
GAATGGCGAG 
CCCATCCAAA 
OCCTT3AAAAT 
TAGAGACACC 
TGTCTTCGCT 
GGAGATTGCC 
GGACCCCATG 
CCAGGACACC 
GGTGACAGCC 
CATCCATAQC 
AGGCACCATC 
GACCATCCAG 
GGAGATCCTT 
TGTGCCTGAG 
OOCCAACTCA 
TTTIACCATC 
GGATTTTGAS 
TGTGCTGAAG 
GGCACCTGTG 



51 

I 

GCCATGGGGC 
CAGTGCGCGQ 
GCGGGAGGCG 
GGGCAAGAGC 
ACAGTCCAGG 
CGXATCTTAC 
GGCAAGGGTC 
AAGATTXTCT 
GTAGAGAAGG 
AAGTATGAGC 
AACATCTCCA 
ITCOGAGGGA 
ACAGATGAGG 
CAAORACC3VA 
AGOGTCATCT 
GCGACASAOV 
GATGCCAATG 
AA1GCAGTGG 
CCAGCGTGGC 
ACCACCCACC 



CTCCCAACCT 
TTTGTCCCAC 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
700 
840 
90O 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 B0 



916 



WO 03/042661 



PCT/US02/36810 



CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAQ 1500 

CAGGGTGGCT AGCCATGGAC CCAGACAGrG GGCAGGTCAC AQCTGTGGGC ACCCTCGACC 1560 

OTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 16 2 O 

ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

ACCATGGCCC AGTCCCIGAG CCCCGTCAQA TCAGCATCTG CAACCAAAGC CCTGTGCGCC 1740 

ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 

TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT SACACAGTGG 1860 

TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 

ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCQAC TGCCATGGCC 1980 

ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 

TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 

TCAAC3GAQCC CCTCCTACTC CCAGAAGATG ACACOCGTGA CAACGTCTTC TACTATGGCG 2160 

AAGAGGGGGG TGGCGAAGAG GACCAGGACT AT3ACATCAC CCAGCTCCAC CGAGGTCTGG 2220 

AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 

TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 

AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 

AGGGCAGCGG CTCOGAOGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 

ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCaGaCaIGT 1 2S20 

ACGGTGGCGG GGAGGACGAC TAGGCQGCCT GCCTGCAOOG CTGGGGACCA AACGTCAGGC 258 0 

CACAGAGCAT CTCCAAGGGG TCTCAGfTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 

GGAAGTGGCC GTAGCAACTT GGCGGAQACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 

TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 

•TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACX1TCTCT 2880 

CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGGTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3080 

TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG QACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



A178 Protein sequence: 
Qene names 
Unigene number: 
Probeset Access ion 
Protein Accession #: 
Signal sequences 
Transmembrane domain; 



Cadherin 3, 

He, 2 877 

X63629 

CAA45177 

1-24 

659-675 



P- cadherin 



atal> 



Cellular localization s 



plasma 



1 11 

I I 
MGbPRGPLAS LLLLQVCHIjQ 

QEPAiFSTDN ddftvrngbt 

KGPFPQRLNQ LKSNKDTtDTK 



21 
I 

CAASEPCRAV 



DEDDAIYTxN 
TDMDGDGS1T 
AWRATyiilMG 
PTGTATIWH 



VNDHGPVPEP 
TWLSLKKPI* 
GAVLALLFLL 
QUBARPEWIj 
DYEGSGSDAA 



GWAYSIESQ 
T^AVAWBIIJ? 
GDDGDHFTIT 
VEDVUEAPVF 
D3QQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLLLLVRKK 
RMDVAPTIIP 
SLSSLTSSAS 



IFYSITGPGA 
ISIXVTDCJTO 
BPKDPHDLMF 
AHEffJAPMFDP 
THPESNQGIIj 
VPPSKOTEVQ 
U3RBDBQFVR 
VRKVLN1TDK 



31 
I 

FREAEVTUEA 
NPLKIFPSKR 
DSPPEGVFAV 
ItKPKFTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTMCGLDFSA 
EGIPTGEPVC 
NNIYEVMVLA 
DLSFHTSPFQ 
TV1RATVCDC 



41 
I 

GGAEQEPGQA 
ILRRHKRDW 



RGSVLEGVX»P 
VI3SGLDRBK 
AVGKKVQRTjT 
KNQHTLYVEV 
VYTAKDPDKE 
MDNGSPPTTG 
AQLTDDSDrY 
HQHVBTCPGP 



RKIKEPLLJjP 
tphyrprpan 
DQDQDYDYUff 



FDEIGNFIIB ULKAANTDPT 
BWGSHFKKIiA DHYGGQBDD 



51 
I 

IiGKVFMGCPG 
VAPISVP2HG 
KPLDREEIAK 
GT9VMCFVTAT 
VPEYTLTIQA 
VTDLDAIWSP 
TNEAPFVLKL 

TGTLrLLTLID 
WTAEVUEEGD 
WKGGFIIiFVI* 
OUlDITQLIHR 
APPYDTUiVF 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A179 DMA SBQUBHCS 
Gene names 



Probe set Accession # : 
Nucleic Acid Accession $i 
Coding sequence: 



endothelial differentiation., lyaopboBphatiaic acid G-protein- coupled receptor , 7 

H9.25B5B3 

NM_012152 

WM~012152 

43-1104 (underlined sequences correspond, to start and stop codons} 



1 11 21 31 41 51 

1 i 1111 

CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA C AATGA ATGA GTGTCACTAT 60 

GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 120 

GGAACAAAGC TTGTGATTGT TTTGTGXGTT GGGACGTTTT TCTGCCTGTT TATTTTTTTT 180 

TCTAATTCTC TGGTCATGGC BGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 240 

TACCTGTTGG CTAATTTAGC TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TGTATTCCTG 300 

ATOTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCGGTCAG 360 

□GGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTTAT CGCCGTGGAG 420 

AGGCACATGT CAAXCATGAG GA1GCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 4 BO 

CTGCTCATTT TGCTTGTCTG GGCCATOQCC ATTTT TATGG OGGCGGTCCC CACACTGGGC 540 

TGGAATTGCC TCTGCAACAT CTCTGCCTGC TCTTCCCTGG CCCCCATTTA CRGCAGGAGT 600 

TACCTTGTTT TCTGGACAGT OTCCAACCTC ATGGCX3TCC TCATCATGGT TGTGGTGTAC 660 

CTGCGGATCT AEGTCTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAOTGOG 720 

TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC TGTCTTAGGG 780 



917 



WO 03/042661 



PCTAJS02/36810 



GCGTTTGTGG TATGCTGGAC COCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGflACTGC B40 

AGGCAGTGTG GCGTGCAGCA TGTGAAARGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 

GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TOTATQGCRC CATGAAGAAG 960 

ATOATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 

GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAOTATTAG CCAAGGTGCA 1080 

GTCTQCAATA AAAGCACTTC CTAAA CTCTG GATGCCTCTC GGOCCACCCA GGTGATGACT 1140 
GTCTTAGG 



A180 Protein, sequence ; 



Unigene number: 
Probes et Accession tt; 
Protein Accession # = 
signal sequence s 
Transmembrane domains? 
Cellular localization; 



endothelial differentiation, lysophosphatidic acid G-protein-coupled receptor. 

He -258583 

NM_0l2152 

NP_0362B4 

none found 

31-53, 66-BB, 150-172, 190-211, 239-261, 277-295 
plasma membrane 



1 11 21 31 41 51 

I I I II 1 

MNBCHYDKHM DFFYNRSNTD TVpDWTGTXL, VIVLCVGTFF CLFZFFSNSL VIAAV1KNRK 60 
FHFPFYYLIA JODAAADPFAG lAYVFLMFNT GFVfiKTLTVN RWFLRQGLLD SSLTASDTNIi 120 
LVIAVKRRMS tMSMRVHSNli TKKKVTLIiIl. UVWAIA1FMG AVPTLGWNCL CNI SACS SLA 180 
PIYSRSYLVF HTVSNLMAFL IMVWYLRIY VYVKRKTNVL SPBTSGSISR HRTPMKLMICr 240 
VMTVLGAFW CWTPGIATVLL IiDGLNCRQCG VQHVXRWFLL LAMJISVVNP IIYSYKDEDM 300 
YGT MKKM2 C C FSQENPERRP SRIPSTVLSR SDTGSQYIKD SISQGAVCNK STS 

A181 PHA 



Gene name; estb 
"Unigene mimben EB, 162859 

Frobeeet Accession tt: AA569531 
Nucleic Acid Accession #: AA569533 

Coding sequence* 1-504 (underlined sequences correspond to start and stop codons) 

l 11 21 31 41 51 

I 1 I I i I 

ATGACCTACA GTTACTCATT TTTCAGGCCT GftGTTGATOG TTAATCATCT TAATTATGTT 60 

CATTCTGAAG CCAACAGGAG AACCAAGACC AAAACTTTAT TGTCTCTGCT TTCATTTCTT 120 

GATGAAACCT CTGGACTAAG CACACATCTT CCTTGTTTAT CTCTCTCAAA GGAQTGTGGA 180 

GTGCTTCATC T0GACATCCA CGGGAAGAAG GAAGACATGA GAATCACCCA ACAGTCTTCC 24D 

CAGCTATACC TGTGGGACAT GGGTGGTTTT ACAATATTTA AGAACCTGTG GATGAGCCTC 300 

ATACCCAOAG GGAACAAACG CTCCCCAAAA AGAGTTACAG AAACCATCCT GAGAGATTTT 360 

AAGCAGAAGC AAAGTTCAAA GATCCAAGAG GAGAGACGAA GAGAGTCTGC AGGACCAAAC 420 

CTCTCTTCAT TCTGGTTTGr GGGGAATGCT GGAAGAGGAG ACAGGCCCCA GATTTGOGCA 480 

GGAAGTAAAC AGTTTTCAGG OTQAGGCCAA TCTGAGCAGG AACATTCCAA TATTTCTTCA 540 

GCTACGTTGT CCCAGCACTT CACTGGTTAA CCTTTTATGT CCACCATTTG TGGATTTCAC 600 

AGCTACTTGT CAATGGTGAA TAT TGAT CAT CATCATTATC TACTGAGCTG CTACCATATC 660 

CCAGCTACTC CXTGCATGTT GTTCATTATT TTCTCAACAC TCAGCATATT TGCAATATGT 720 

TATQTAATAT CACAGACAAG GAAACTQAAC GCAGAAATGT TTTATTTCTT GCCAAACATC 730 

ACATGAGGAT GAACAATGAA ACCGATTTGA AACCAGGATT GTCTGATTCC AACATCTCTD 840 
CACTCTGATA TGCTOCAATT AAAAAGCCAT TTCTAAGACT GT 



A182 Protein sequence: 
Gene name: 
Unlgone number i 
Probeset Accession #: 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains z 
Cellular Localization: 



Hs. 162859 
AAS69S31 
none found 
1-46 

none found 
not determined 



1 11 21 31 41 51 

1 1 I I I 1 

MTYSYSFFRP ELrVHHLKYV HSEANRRTKT KTItf*3IiIiSFIi DETSGLSTHL PCLSLSKECG 
VLHLD1HGXK EDMRITQQSS QLYLWDMGGF TIFKMLWMSL IPSGNKRSPK RVTETJIiRDF 



60 
120 



A183 UNA SEQUENCE 
Gene names BSTs 
Unigene number: Hb. 179809 

Probeset Accession ft: N95796 
Nucleic Acid Accession fr: XM_D50197 

Coding sequence: 310-1971 (underlined sequences co: 

41 



r respond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I 1 1 

TCACACGTQC CAAGGGGCTG GCTCAGCGGA AOCAGCCTGC ACGCGCTGGC TCCGGGTGAC 60 

AGCCGCGCGC CTCGGCCAGG ATCTGAGTGA TGAGACGTGT CCCCACTGAG GTGCCCCACA 120 

GCAGCAGGTG TTGAGCATGG GCTGAGAAGC TGGACCGGCA CCAAAGGGCT GGCAGAAATG 180 

GGCGCCTGGC TGATTCCTAG GCAGTTGGCG GCAGCAAGGA GGAGAOGC03 CAGCTTCTOG 240 

AGCAGAGCCG AGACGAAGCA GTTCTGGAGT GCCTGAACGG CCCCCTGAGC CCTACCCGCC 300 

918 



WO 03/042661 



TGGCCCACT A TGGTCCAGAG GCTGTGGGTG AGCCGCCTGC TGCGGCACCG GAAAGCCCAG 360 

CTCTTGCTGG TCAACCTGCT AACCTTTGGC CTGGAOGTGT GTTTGQCGGC AGGCATCACC 420 

TATGTGCCGC CTCTGCTGCT GGAAGTGGGG GTAGAGGAGA AGTTCATGAC CATGGTDCTG 480 

GGCATTGGTC CAGTGCTGGG CCTGGTCTGT GTCCCGCTCC TAGGCTCAGC CAGTGACCAC S40 

TGSCGTGGAC GCTATGGCCG CCGCCGGCCC TTCATCTGQG CACTGTCCTT GGGCATCCTG 600 

CTGAGCCTCT TTCTCATCCC AAGGGCCGGC TGGCTAGCAG GGCTGCTGTG CCCGGATCCC 660 

AGGCCCCTGG AGCTGOCACT GCTCAXCCTG OGCGTGGGGC TGCTGGACTT CTGTGGCCAG 720 

GTQTG CTTCA CTCCACTGGA GGCCCTGCTC TCTGACCTCT TCCGGGACCC GGACCACTGT 780 

CGCCAGGCCT ACTCTGTCTA TGCCFTCATG ATCftGTCTTG GGGGCTGCCT GGGCTACCTC 640 

CTGCCTGCCA TTGACTGGGA CACCAGTGCC CTGGCCCCCT ACCTGGGCAC CCAGGAGGAG 900 

TGCCTCTTTG GCCTGCTCAC CCTCATCTTC CTCACCTGCG TAGCRGCCAC ACTGCTGGTG 960 

GCTGAGGAGG CAGCGCTGGG CCCCACCGAG CCAGCAGAAG GGCTGTCGGC CCCCTCCTTG 1020 

TCGCCCCACT GCTGTCCATG CCGGGCCCGC TTGGCTTTCC GGAACCTGGG CGCCCTGCTT 1O80 

CCCCGGCTGC ACCAGCTGTG CTGCCGCATG CCCCGCACCC TGCGCCGGCT CTTCGTGGCT 1140 

GAGCTGTGCA GCTGGATGGC ACTCATGACC TTCACGCTGT TTTACACGGA TTTCGTGGGC 1200 

GAGGGGCTGT ACCAGGGCGT GCCCAGAGCT GAGCCGGGCA CCGAGGCCCG GAGACACTAT 1260 

GATGAAGGCG TTOQGATGGG CAGCCTGGGG CTGTTCCTGC AGTGCGCCAT .CTCCCTGGTC 1320 

TTCTCTCTGG TCATGGACCG GCTGGTGCAG CGATTCGGCA CTCGAGCAGT CTATTTGGCC 1380 

AGTGTGGCM CTTTCCCTGT GGCTGCOGGT QCCACATGCC TGTCCCACAG TGTGGCCGTG 1440 

GTGACAGCTT CAGCCGCCCT CACCGGGTTC ACCTTCTCAG CCCTGCAGAT CCTGCCCTAC ISOO 

ACACTGGCCT COCTCTACCA CCGGGAGAAG CAGGTGTTCC TGCCCAAATA CCGAGGGGAC 1560 

ACTGGAGGTG CTAGCAGTGA GGACAGOCTG ATGACCAGCT TCCTGCCAGG CCCTAAGCCT 1620 

GGAGCTCCCT TCOCTAATGG ACACGTGGGT GCTGGAGGCA GTGGCCTGCT CCCACCTCCA lfiflO 

CCCGCGCTCT GCGGGGCCTC TGOCTGTGAT GTCTCCOTAC GTGTGGTtSGT GGGTGAGCCC 1740 

ACCGAGGCCA GGGTGGT TCC GGGCCGGGGC ATCTGCCTGG ACCTCGCCAT CCTGGATAGT 1B0O 

GCCTTCCTGG TGTCCCAGGT GGCCCCATCC CTOTTTATGG GCTCCATTGT CCAGCTCAGC 1860 

CAGTCTGTCA CTGCCTATAT GGTGTCTGCC GCAGGCCTGG GTCTGOTCGC CATTTACTTT 1920 

GCTAGACAGG TAGTATTTGA CAAGAGCGAC TTGGCCAAAT ACTCAGC GTA GA AAACTTOC 1980 

AGCACATTGG GGTGGAGGGC C1GCCTCACT GGGTCCCAGC TCCCCGCTCC TGTTAGCCDC 2040 

ATGGGGCTGC CGGGCTGGCC GCCAGTTTCT GTTGCTGCCA AAGTAATGTG GCTCPCTGCT 2100 

GOCACCCTGT GCTGCTGAGG TGCGTAGCTG CACAGCTGGG GGCTGGGGOS TCCCTCTCCT 2160 

CTCTCCCCAG TCTCTAOGGC TGCCTGACTG GAGGCCTTCC AAGGGGGTTT CAGTCTGGAC 2220 

TTATACAGGG AGGCGAGAAG GGCTOCATGC ACTGGAATGC GGGGACTCTG CAGGTGGATT 2280 

ACCCAGGCTC AGQGTTAACA GCTAGCCTCC TAGTTGAGAC ACACCTAGAG AAGGGTTTTT 234 0 

GGGAGCTGAA TAAACTCAGT CACCTGGTTT CCCATCTCTA AGCCCCTTAA CCTGCAGCTT 2400 

CGTTTAATGT AGCTCTTGCA TGGGAGTTTC TAGGATGAAA CACTCCTCCA TGGGATTTGA 2460 

ACATATGAAA GTTATTTGTA GGGGAAGAGT CCTGAGGGGC AACACACAAG AACCAGGTCC 2520 

CCTCRGCCCC ACAGGCACTG GTCTTTTTTG CTNGANTCCA CCCCCCCCCT CTTTACCCTT 2580 



A18 4 Protein sequence: 

Gene name: ~ BSre 

Wiigene number: Ha -179809 

Proneset Accession #i 6195796 

Protein Accession #• XP_050197 

Signal sequence: none 

Tranflraenibrane domains: 21-43, 53-75, 90-112, 125-147, 160-1B2, 199-221* 323-345, 350-372, 

Cellular Localization: plasma membrane 



1 11 21 31 41 51 

I 1 I I I I 

HVQHIiWVSRL LRHRKAQLIi VHU.TFGLEV CIiAAGlTYVF PLLLEVGVEE KFMTHVLGIG 60 

FVT..GI.VCVPL LGSASDHWHG RYGRRRPFIW ALSLGILLrSfc FMPRAGWLA GLLCPDPRPD 120 

SLALLUGVG LLDFCGQVCF TPX.EA1J.5DIj FHDFDHCRQA YSVYAFMIBL GGCLGYLLPA 180 

IDWOTSAIAP YLGTQBECDF GIxT.iTLIFL.TC VAATLIA/ABB AAIiGFTEFAE GLSAPSLSPH 240 

CCPCRAHIAF BNLGALLPfclt RQLCCRMFRT I^KLFVAELC SWMALMTFTIj FTIDFVGEGXi 300 

YQGVPRAKPG TSAKRHYDEQ VRMGSLGLFIi QCAISLVFSIf VMDRIATQRFG TRAVYLABVA 3 60 

AFPVAAGATC LSHSVAWTA SAALTGFTFS ALQILPYTLA SLTfHREKQVF IiPKYRGDTGG 420 

ASSED9LHT3 PLiPGPKPGAP FPNGHVGAGG SGliliFPPPAli CGASACDVSV RVWGEPTEA 480 

RWPGRGICL DLAILDSAFL LSQVAPSLFM GSIVQLSQSV TAYMV3AAGI* GLVAXYFATQ 540 
WPDKSDLAK Y9A 

A185 DMA SBQDBBICE 
Gene namat ESTs 
tJhigene number i Hs. 11260 

Probeeet Accession #: R73640 
Nucleic Acid Accession AK002126 

Coding sequence; 1-1593 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I I I 1 I 

ATflQ TTCGCC GGGGGCTGCT TGCGTGGATT TCCCOGGTGG TGGTTXTGCT GGTGCTCCXC 60 

TGCTGTGCTA TCrCTGTCCT GTACATGTTG GCCTGCACCC CAAAAGGTGA CGAGGAGCAG 120 

CTGGCACTGC CCAGGGCCAA CAGCCCCACG GGGAAGGAGG GQTACCAGGC CGTCCrTCAG 180 

GAGTGGGAGG AGCAGCAOCG CAACTACGTG AGCAGCCTGA AGCGGCWJAT CGCACAGCTC 240 

AAGGAGGAGC TGCAGGAGAG GAGTGAGCAQ CTCAGGAATG GGCAGTACCA AGGCAGCGAT 300 

GCTGCTGGCC TGGGTCTGGA CAGGAGCCCC OCAGAGAAAA CCCAGGCCGA CCTCCTGGCC 360 

TTCCTGCACT CGCAGGTGGA CAAGGCAGAG GTGAATGCTG GCGtCAAGCT GGCCACAGAG 420 

TATGCAGCAG TGCCTTTCGA TAGCTTTACT CTACAGAAGG TOTACCAGCT GGAGACTGGC 400 

CTXACCCGCC ACCOCGAGGA GAAGCCTGTG AGGAAGGACA AGCGGGATGA GTTGGTGGAA 540 

GCCATTGAAx CftGCCTTGGA GACCCTGAAC AATCCTGCAG AGAACAGCCC CAATCACCGT 600 



919 



WO 03/042661 



PCT/US02/36810 



CCTTACACGG CCTCTGATTT CATAGAAGOQ ATCTACCGAA CAGAAAGGGA CAAAGGGACA 660 

TTGTATGAGC TCACCTTCAA AGGGGACCAC AAACACGAAT TCAAACGGCT CATCTTATTT 720 

CGACCATTCG GCCCCATCAT GAAAGTGAAA AATGAAAAGC TCAACATGGC CAACACGCTT 7 BO 

ATCAATGTTA TCOTGCCTCT AGCAAAAAGG GTGGACAAGT TCCGGCAGTT CATGCAGAAT 64 0 

TTCAGGGAGA TGTGCATTGA GCAGGATGGG AGAGTCCATC TCACTGTTGT TTACTTTGGG 900 

AAAGAAGAAA TAAATGAAGT CAAAGGAATA CTTGAAAACA CTTCCAAAGC TGCCAACTTC 960 

AGGAACTTTA CCTTCATCCA GCTGAATGGA GAATTTTCTC GGGGAAAGGG ACTTGATGTT 1020 

GGAGCCOGCT TCTGGAAGGG AAGCAAGGTC CTTCTCTTTT TCTGTGATGT GOACATCTAC 1080 

TTCACA1CTG AATTCCTCAA TACGTGTAGG CTGAATACAC AGCCAGGGAA GAAGGTATTT 1140 

TATCCAGTTC TTTTCAGTCA GTACAATCCT GGCATAATAT ACGGCCACCA TGATGCAGTC 1200 

CCTCCCTTGG AACAGCAGCT GGTCATAAAG AAGGAAACTG GATTTTGGAG AGACTTTGGA 1260 

TTTGGGATGA CGTGT CAGT A TCGGTCAGAC TTCATCAATA TAGGTGGGTT TGATCTGGAC 1320 

ATCAAAGGCT GGGGCGGAGA GGATGTGCAC CTTTATCGCA AOTATCTCCA CAGCAACCTC 1380 

ATAGTGGTAC GGACGCCTGT GCGAGGACTC TTCCACCTCT GGCATGAGAA GCGCTGCATG 1440 

GACGAGCTGA CCCCCGAGCA GTACAAGATG TGCATGCAGT GCAAGGCCAX GAACGAGGCA 1500 

TCCCACGGCC AGCTGGGCAT GCTGGTGTTC AGGCACGAGA TAGAGGCTCA CCTTCGCAAA 1S60 
CAGAAACAGA AGACAAOTAG CAAAAAAACA TgA 



A3, a 6 Protein sequence; 

Gene name; ESTs 

Unigene number: Hs.112 50 

Probeset Accession #t R73640 

Protein Accession ft: KP_0 60841 

Signal sequences 1-2 6 

Trans Tuenibr-ane domains: none found 

Cellular Localisations not determined 

1 11 21 3L 41 51 

I 1 I i I l 

HVKRGLXiAWI SRVWLLVLL CCAISVLYMIi ACTPKGDEEQ LAkPHANSPT GKEGYQAVLQ 60 

EHEEQHEKYV SSIiKRQIAQIt KBELQBR8BQ LHNGQfTOASD AAGLGLDRSP PEKTQADI.I1A 120 

FLH8QVDKAE VMAGVKLATE fcAAVPFDSFT LQKVYQLETG I/TRHPEEKPV RKDKRDKLrVK 180 

AIE9ALBTLN WFAEN9PNHR PYTASDFIEG IYRTERDKGT LiYELT FKGDH KHEFKRliILF 240 

RPFGPIMKVK NEKLNMANTL INVIVPLAKR VDKFRQFMQN FREMCIEQDG RVHLTWYFG 30 0 

KEEIUEVKGI LEMTSKAANF RNFTF3QLNG EFS RGKGLiDV GARFWKGSNV LLFPCDVDIY 360 

FTSEPLWTCR LNTQPGKKVF YPVLFSQYNP GIIYGHHDAV PPLEQQLVIK KETGFWRDFG 420 

FGMTCQYRSD FINIGGFBIJ> 1KGWGGEDVH IiYRKlflJaSIIIi IWRTPVRGla FHLWHEKRCM 480 
DELTPSQYKM CMQSKAMNEA SHGQLGMLVF EHEIEAHLRK QKQKTSSKKT 

A197 DMA SBQUEKGB 

Gene name: ATPase, Ca++ transporting, type 2C, member 1 

Unigene number* Hs, 10677a 

Probeset Accession $t N51919 
Nucleic Acid Accession #1 AF1B9723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop cooons) 



1 11 21 31 41 51 

111111 

ATGATTCCTG TATTGACATC AAAAAAAGCA AGTGAATTAC CAGTCAGTGA AGTTGCAAGC 60 

ATTCTCCAAG CTGATCXTCA GAATGGTCTA AACAAATGTG AAGTTAGTCA TAGGCGAGCC 120 

TTTCATGGCT GGAATGAGTT TGATATTAGT QAAGATGAOG CACTGTBGAA GAAGTATATT 2.80 

TCrCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCTT CTGCAGTCAT CAGTGTTTTA 240 

ATGCAT CAGT TTGATGATGC OGTCAGTAIPC ACTGTGGCAA TACTTATCGT TGTTAjCAGTT 300 

GCCTTTGTTC AGGAATATOG TTCAGAAAAA TCTCTTGAAG AATTGAGTAA ACTTGTGCCA 360 

CCAGAATGOC ATTGTGTGOG TGAAGGAAAA TTGGAGCATA CACfTGCCOG AGACTTGGTT 420 

CCAGGTGATA CAGTTTBCCT TTCTGTTGGG GATAGAGTTC CTGCTGACTT ACGCTTGTTT 480 

GAGGCTGTGG ATCTTTCCAT TGATGAGTCC AGCTTGACAG GTGAGACAAC GCCTTGTTCT 540 

AAGGTGACAG CTCCTCAGCC AGCTGCAACT AATGOAGATC TTQCATCGAG AAOTAACATT GOO 

GCCTTTATGG GAACACTGGT CAGATGTGGC AAAGCAAAGG GTGTTGTCAT TGGAACAGGA 660 

GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATGCAAG CAGAAGAGGC ACCAAAAACC 720 

CCTCTGCAQA AGAGCATGGA CCTCTTAGQA AAACAACTTT CCTTTTACTC CTTTGGTATA 780 

ATAGGAATCA TCATGTTGGT TGGCTGGTTA CTGGGAAAAG ATATCC1GGA AATGTTTACT 840 

ATTAGTGTAA GTTTGGCTGT AGCAGCAATT CCTGAAGGTC TCCCCATTGT GGTCACAGTG 900 

ACGCTAGCTC TTGGTGTTAT GAGAATGGTG AAGAAAAGGG CCATTGTGAA AAAGCTGCCT 960 

ATTGTTGAAA CTCTGGGCTG CTGTAATGTG ATTTGTTCAG ATAAAACTGG AACACTGACG 1020 

AAGAATGAAA TGACTGTTAC TCACATATTT ACTTCAGATG QTCMCATGC TGAGGTTACT 1080 

GGAGTTGGCT ATAATCAATT TGGGGAAGTG ATTGTTGATG GTGATGTTGT TCATGGATTC 1140 

TATAACCCAG CTGTTAGCAG AATTGTTGAG GCGGGCTGTG TGTGCAAXGA TGCTGTAATT 1200 

AGAAACAATA CTCTAATGGG GAAGCCAACA GAAGGGGCCT TAATTGCTCT TGCAATGAAG 1260 

ATGGGTCTTG AK3GACTTCA ACAAQACTAC ATCAGAAAAG CTGAATACCC TTTTAGCTCT 1320 

GAGCAAAAGT GGATGGCTGT TAAGTGTGTA CAOCGAACAC AGCAGGACAG ACCAGAGATT 1380 

TGTTTTATGA AAGGTGCTTA CGAACAAGTA ATTAAGTACT GTACTACATA CCAGAGCAAA 1440 

GGGCAGACCT TGACACTTAC TCAGCAGCAG AGAGATGTGT ACCAACAAGA GAAGGCACGC 1500 

ATGGGCTCAQ CGGGACTCAG AGTTCTTGCT TTGGCTTCTG GTCCTGAACT GGGACAGCTG 1560 

ACATTTCTTG GCTTGGTGGG AATCATTGAT CCACCTAGAA CXGGTGTGAA AGAAGCTGTT 1620 

ACAACACTCA TTGGCTCAGG AGTATCAATA AAAATGATTA CTGGAGATTC ACAQQAGACT 1680 

GCAGTTGCAA TCGCCAGTCG TCTGGGATTG TATTCCAAAA CTTCCCAGTC AGTCTCAGGA 1740 

GAAGAAATAG ATGCAATGQA TGTTCAGCAG CTTTCACAAA TAOTACCAAA GGTTGCAGTA 1800 

TTTTACAGAG CTAGCCCAAG GCACAAGATQ AAAATTATTA AGTCGCTACA QAAGAACGGT I860 

TCAGTTGTAG CCATGACAGG AGATGGAGTA AATGATGCAQ TTGCTCTGAA GGCTGCAGAC 1920 

AT1GGAGTTG CGATGGGCCA GACTGGTACA GATGTTTGCA AAGAGGCAGC AGACATGATC 1980 

CTAGTGGATG ATGATTTTCA AACCATAATG TCTGCAATCG AAGAGGGTAA AGGGATTTAT 2040 
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AATAACATTA AAAATTTCGT TAGATTCCAG CTGAGCACGA GTATAGCAGC ATTAACTTTA 2100 

ATCTCATTGG CTACATTAAT GAACTITCCT AATOCTCTCA ATGCCATGCA GATTTTGTGG 2160 

ATCAATATTA TTATGGATGG ACCCCCAGCT CAGAGCCTTG GAGTAGAACC AGTGGATAAA 2220 

GATOTCATTC GTAAACCTCC TCGCAACTGG AAAGACAGCA TTTTGACTAA AAACTTGATA 2280 

CTTAAAATAC TTGTTTCATC AATAATCATT GTTTGTGGGA CTTTGTTTGT CTTCIGGOGT 2340 

GAGCTACGAG ACAATGTGAT TACACCTCGA GACACAACAA TGACCTTCAC ATGCTTTGTG 2400 

TTTTTTGACA TGTTCAATGC ACTAAGTTCC AGATCCCfiGA CCAAGTCTGT GTTTGAGATT 24G0 

GGACTCTGCA GTAATAGAAT GTTTTGCTAT GCAGTTCTTG GATCCATCAT GGGACAATTA 2S20 

CTAGTTATTT ACTTTCCTCC GCTTCAGAAG GTTTTTCAGA CTGAQAGCCT AAGCATACttS 2580 

GATCTGTTQT TTCTTTTGGG TCTCACCTCA TCAGTGTGCA TAGTGGCAGA AATTATAAAG 26 40 

AAGGTTGAAA GGAGCAGGGA AAAGATCCAG AAGCA.TGTTA GTTOGACATC ATCATCTTTT 2700 
CTTGAAGTAT OA 



A1B8 Protein 



sequence ; 



Gene name: 
Ilnigone number: 
Proneset Accession ft: 
Protein Accession, fr: 
Signal sequence i 
Transmembrane domains : 
Pf am domains: 
£744-889] 

Cellular Localization f 



ATPase, Ca++ transporting, type 2C, member 1 

HS. 106778 

N51919 

AAF27613 

none found 

€0-82, 86-1DB, 252-274, 282-304, 757-779, 827-849, 856-878 

Cation ATPhbs n 16-83], E1-E2 ATPase (89-324] , Hydrolase f32B~649] , Cation ATPase C 
not determined 



1 11 21 

1 I t 

MIPVXTSKXA SELPVSEVAS ILQADDQWGIi 
SQFKNPLIML LLA9AVXSVI, MHQFDDAVS1 
PECHCVREGK LEHTLARDLV PGDTVCIiSVG 
KVTAPQPAAT NGDIiASRSNI AFMGTIiVRCG 
PLQK8MDLIX3 KQLSFYSFGI IG2IMDVGWL 
TliALGVMRMV KKHAXVKKIiP IVETLGCOJV 
GVGYHQFGEV XVDGDWHGF YNPAVSEIVE 
MGXJDQLQQDY 1RKAKYPFS3 EQKWMAVKCV 
GQTLTLTQQQ RDVYQQEKAR MGSAQEiRVIA 
TTLIA3GVSI KMITGEEQET AVAIASRLGL 
FYRASPJlHKH KIIKSLQKNG SWAMTGD0V 
L.VDDDFQTIM SAIEEGKGIY NHIKNFVRFQ 
IJSIIMDGPPA QSLGVBPVDK BVIRKFFRNW 
ELRDNVITPR DTTWTFTCPV FFUMENALSS 
LVrxFPPLQK VFQTESiSIIi DUiFLLGLTS 
IiEV 



31 41 51 

i ) 1 

NKCEVSHRHA FHGWNEFDIS EDEPLWKKYI 60 

TVAILIWTV AFVQEYRSEK SLEELSKLVP 120 

DRVPADLRLF EAVD1*SIDES SLTGETTPCS 180 

KAKGWIGTG ENSBFGEVFK MMQAEEAPKT 240 

LGKDILEMFT ISVSIAVAAJ PEGItPIWTV SOD 

1C3DKTGTLT KNEMTVTEIP TSDGIjHAEVT 360 

AGCVCHDAVI RUNTLMGKPT EGALIAtAMK 420 

ERTQQDHPEJ CFMKGAYEQV IKYCTTYQSX 480 

XASGPELGQli TFLGLVGIID PPRTGVKEAV S40 

YfiKTSQSVSG EEIDAMDVQQ L8QIVPKVAV 600 

NDAVALKAAD IGVAMGQTGT TJVCKEAADMT 660 

LSTSIAAMT, ISIATLMNFP MPLNAMQILW 720 

KDSILTKMLI LKXIiVSSIII VCGTBFVPWR 780 

RSQTKSVPEI GLCSHHMFCY AVLGSIMGQL 84 0 

SVCIVABIIK KVBRSREKIQ KHVBSTSSSF 90 0 



A189 DNA SEQUENCE 

Gene name: ESTs 

Unigena nutiber* Hb. 293185 

Probeset Accession fti N62096 

Kudeic Acid Accession #; N6209G 

Coding sequences 1-1284 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

i I I I i I 

ATGGGCTACC AGAGQCAGGA GCCTGTCATC CCGCCGCAGA GAGGATTGCC TTATTCAATG 60 

AAGCAAGCTG GGTTTCCTTT QGGAATATTG CTTTTATTCT GGGTTTCATA TGTTACAGAC 120 

TTTTCCCXTG TTTTATTGAT AAAAGGAGGG GGCCTCTCTG GAACAGATAC CTACCAGTCT 180 

TTGGTCAATA AAACTTTCGG CTTTCCAGGG TATCTGCTCC TCTCXSTTCT TCAGTTTTTG 240 

TATCCTTTTA TAGCAATGAT AAGTTAjCAAT ATAATAGCTG GAGATACTTT GAGCAAAGTT 308 

TTTCAAAGAA TCCCAGGAGT TGATCCTGAA AAO&TGTTTA TTGGTCGCCA CTTCATTATT 360 

GGACTTTCCA CAGTTACCTT TACTCTGCCT TTATCCTTOT ACCGAAATAT AGCAAAGCTT 42 0 

GGAAAGGTCT CCCTCATCTC TACAGGTTTA ACAACTCTGA TTCTTGGAAT TGTAATGGCA 480 

AGGGCAATTT CACTGGGTCC ACACATACCA AAAACAGAAG ACGCTTGGGT ATTTGCAAAG 540 

CCC AATGCCA T TCAAGCGGT CGGGGITATG TCTTTTGCAT TJA TTTOC CR CCATAACTCC 600 

TTCrrTAGTTT ACAGTTCTCT AQAAGAACCC ACAGTAGCTA AGTGGTCCCG CCTtATCCAT 660 

ATOTCCATCG TGATTTCTGT ATTTATCrGT ATATXCTTTG CTACATGTGG ATACTTGACA 720 

TTTACIGGCT TCACCCAAGG GGACTTATTT GAAAATTACT GCAGAAATGA TGACCTGGTA 780 

ACATTTGGAA GATTTTGTTA TGGTGTCACT GTCATTTTGA CATACCCJAT GGAATGCTTT 840 

GTGACAAGAG AGGTAATTGC CAATGTGTTT TTTGGTGGGA ATCTTTCATC GGTTTTCCAC 900 

ATTGTTGIAA CAGTGATGGT CATCACTGTA GCCACGCTTG TGTCATTGCT GATTGATTGC 560 

CrCQGGATAG TTCTAGAACT CAATGGTGTG CTCTOTGCAA CTCCCCTCAT TITTATCATT 1020 

OCATCAGCCT GTTATCTGAA ACTGTCTQAA QAACCAAGGA CACACTCCQA TAAGATTATG 1080 

TC27TGTGTCA TGCTTCCCAT TGGTGCTGTG GTGATGGTTT TTGGATTCGT CATGGCEATT 1140 

ACAAATACTC AAGACTGCAC CCATOGGCAG GAAATGTTCT ACTGCTTTCC TGACAATTTC 1200 

TCTCTCACAA ATACCTCAGA GTCTCATGTT CAGCAGACAA CACAACTTTC TACTTTAAAT 1260 
ATTAGTATCT TTCAACTOGA GTAA 



A19Q Protein sequence; 

Gene name: " estb 

Unigene number: Hs. 2 93 18 5 

Probeset Accession #: N62096 

Protein Accession 4: none found 

Signal sequence ; none found 

transmembrane domains 1 28-50, 6B-88-112-134, 142-164, 217-239, 260-282, 298-320, 327-348, 359-381 
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Cellular Localization: plasma membrane 

1 11 21 31 41 51 

1 I i 1 I 1 

MGYQRQBPVI PPQRGLPYSM KQAGFPLGII, DLFWVStfVTD FSLVLLIKGG ALSGTDTYQS 60 
LVNKTFGFPG TTLKLSVIiQFTi YPFIAMISYN IIAGDTLSKV FQRIPGVDPE NVFIGRHFII 120 
GLSTVTFTLP LSLYRNIAKL GKVSBISTGL TTLILG1VMA KAISLGPHIP KTBDAWVPAK 180 
PNAIQAVGVM SFAFICEHNS FLVYSSIaEHP TVAKW8RLIH HSIVISVFIC IFFATCQYLT 240 
FTGFTQGDLF ENYCRNDDLV TFGRFCYGVT VILTYPMECF VTREVIANVF FGGNLSSVFH 300 
IWTVMVITV ATLVSLTjIDC UGIVLBtiNHV IiCATPLIFII PSACYLKLSE EPRTHSDKIM 360 
SCVMLPIGAV VMVFGFVMAI TNTQDCTHGQ EMFYCFFDNF 9L.TWTSESHV QQTTQLSTLN 42 0 
ISIFQZiB 

A191 ON A SEQUENCE 
G-one name: EBTs 
Unigene number: Hh. 2 93 IS 5 

Probaset Accession #: N62D96 
Nucleic Acid Accession Jh N62096 

Coding sequence: 1-1203 (underlined sequences correspond to start and stop codons) 

l 11 21 31 41 51 

111)11 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGT TTTGCCTTGT TTTATTGATA 60 

AAAGGAGGGG CCCTCTCTGG AACAGATACC TACCAGTCTT TGGTCAATAA AACTTTCGGC 120 

TTTCCAGGGT ATCTGCTCCT CTCTGTTGTT CAGlTTTTGT ATCCTTTTAT AGCAATGATA 180 

AGTTACAATA TAATAGCTGG AGATACTTTG AGCAAAGTTT TTCAAAOAAT CCCAGGAGTT 240 

GATCCTOAAA ACGTGTTTAT TGGTCGCCAC TTCATTATTG GACTTTCCAC AGTTACCTTT 300 

ACTCTGCCTT rATCCTTGTA CCGAAATATA GCAAAGCTTG GAAAGGTCTC CCTCATCTCT 360 

ACAOOTTTAA CAACTCTGAT TCTTGGAATT GTAATGGCAA GGGCAATTTC ACTGGGTCCA 420 

CACAIACCAA AAACAGAAGA GGCTTGGGTA TTTGCAAAGC CCAATGCCAT TCAAGCGQTC 480 

GGGGTTATGT CTTTTQCATT TATTTGCCAC CATAACTCCT TCTTAGTTTA CAGTTCTCTA 540 

GAAGAACCCA CAGTAGCTAA GTGGTCCCGC CTTATCCATA TGTCCATCGT GATTTCTGTA 600 

TTTATCTGTA TATTCTTTGC TACATGTGGA TACTTGACAT TTACTGGCTT CACOCAAGGG 660 

GACTTATTTG AAAATTACTG CAGAAATGAT GACCTGGTAA CATTTGGAAG ATTTTGTTAT 720 

GGTQTCACTG TCATTTTGAC ATACCCTATG GAATGCTTTG TGACAAGAGA GGTAATTGCC 780 

AATGTGTTTT TTGGTGGGAA TCTTTCATCG GTTTTCCACA TTGTTGTAAC AGTGATGGTC 840 

ATCACTGTAG CCACGCTTGT GTCATTGCTG ATTGATTGCC TCGGGATAGT TCTAGAACTC 300 

AATGGTGTGC TCTGTGCAAC TCCCCTCATT TTTATCATTC CATCAGCCTG TTATCTGAAA 960 

CTGTCTGAAG AACCAAGGAC ACACTCCGAT AAGATTATGT CTTGTGTCAT GCTTCCCATT 1020 

GGTG CTGTGG TGAXGGTTTT TGGATTGGTC ATGGCTATTA CAAATACTCA ACACTGCACC 1080 

CATGGGCWK3 AAATGTTCTA CTGCXTTCCT GACAATTTCT CTCTCACAAA TACCTCAGAG 1140 

TCTCATGTTC AGCAGACAAC ACAACTTTCT ACTTTAAATA TTAGTATCTT TCAACTCGAG 1200 
TAA 

A192 Protein sequence: 
Gene names ESTs 
TJnigene number: Hs. 293185 

Probeset Accession #: H62096 
Protein. Accession #x none Cound 

Signal sequence* 1-26 

Transmembrane domains: 45-61, 92-108, 117 -132 , 191-207, 274-290, 297-313, 335-351 

Cellular Localizations plasma membrane 

1 11 21 31 41 51 

I I I 1 I 1 

KGYQRQEPV3 PPQFSKVIiIiI KGQAIiSGTDT YQSLWKTFG FPGYUiLSVZ. CFliYFFIAMI 60 

SYNIIAGDTli BKVFQRIPGV DPENVFIGRH FIIGLSTVTF TLPLSLTRNX AKLGKVSIiIS 120 

TGiTTLILGI VMARAISLGP HIPKTEDAWV FAKPNAIQAV GVHBFAFICH HEJSFLVY88L 160 

EEPTVAKWSR LIHM8TVISV FICIFFATCG YLTFTGFTQG DLFEMYCEND DIiVTPGRFCY 240 

BVTVIKTYPM ECWTREVIA NVPFGGNLSS VFHIWTVMV ITVATIjVSM, IDCLGIVLBL 300 

NGVLCATPItl FXIP6AGYLK LSEEPRT3SD XIMSCVMIsPI GAWMVFGFV MAITHTQDCT 360 
HGQEMFYCFP ENPSI/TNTSE SHVQQTTQL»S TliNlSIFQLiE 

A193 DMA SEQPHWCE 

Gens name: — — - ebts 

Unigene numbers Hs. 29 3 185 

Probe set Accession tts N62096 

Nucleic Acid Accession ft: N62 096 

Coding sequence i 1-1140 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

\ I t I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGG TCAATAAAAC TTTCGQCTTT 60 

CCAGGGTATC TGCTCCTCTC TGTTCTTCftJG TTTTTGTATC CTTTTATAGC AATGATAAGT 120 

TACAATATAA TAGCTGGAGA TACTTTGAGC AAAGTTTTTC AAAGAATCCC AGGAGTTOAT ISO 

CCTGAAAACB TGTTTATTGG TOGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 240 

CTGCCTTTAT CCTTOTACCG AAATATAGCA AAGCTTGGAA AGGTCTCCCT CATCTCTACA 300 

GQTTTAACAA CTCTGATTCT TGGAATTGTA ATGGCAAGGG CAATTTCACT GGGTQCACAC 360 

ATACCAAAAA CAGAAGACGC TTGGGTATTT GCAAAGCCCA ATGCCATTCA AGCGGTCGGG 420 



922 



WO 03/042661 



PCT/US02/36810 



10 



GTTATGTCTT 
GAACCCACAG 
ATCTGTATAT 
TTATTTGAAA 
GTCACTGTCA 
GTGTTTrTTG 
ACTGTAGCCA 
GGTGTGCTCT 
TCTGAAGAAC 
GcrrCJTGGTGA 
GGGCAGGAAA 
CATGTTCAGC 



TTGCAT TTAT 
TAGCTAAGTG 
TCTTTOCTAC 
ATTACTGCAG 
TTTTGACATA 
GTGGGAATCT 



GTGCAACTCC 
CAAGGACACA 
TGGTTTTTGG 
TGTTCTACTG 
AGACAACACA 



TTGCCACCAT 



ATGTGGATAC 
AAATGATGAC 
CGCTATGGAA 
TTCATCGGTT 
ATTGCTGATT 
CCTCATT T X T 
CTCCGATAAG 
ATTCGTCATG 
CTTTCCTGAC 
ACTTTCTACT 



AACTCCTTCT 
ATCCATATGT 
TTGACATTTA 
CTGGTAACAT 
TGCTTTGTGA 
TTCCACATTG 
GATTGCCTCG 
ATCATTCCAT 
ATTATGTCTT 
GCTATTACAA 
AATTTCTCTC 
TTAAATATTA 



TAGTTTACAG 
CCATCGTGAT 
CTGGCTTCAC 
TTGGAAGATT 
CAAGAGAGGT 
TTGTAACAGT 
GQATAGTTCT 
CAGCCTGTTA 
GTGTCATGCT 
ATACTCAAGA 
TCACAAATAC 
GTATCTTTCA 



TTCTCTAGAA 
TTCTGTATTT 
CCAAGGGGAC 
TTGTTATGGT 
AATTGCCAAT 
GATGGTCATC 
AGAACTCAAT 
TCTGAAACTG 
TCCCATTGGT 
CTGCACCCAT 
CTCAGAGTCT 
ACTCGAGTAA 



480 
540 
600 
660 
720 
7 BO 
840 
900 
960 
1020 

ioao 



15 
20 



A194 Protein sequence: 
Gene name: 
Uhigene number: 
Probesat Accession #: 
Protein Accession #i 
Signal sequence: 
Transmembrane domains ; 
Cellular Local izat ion? 



Hs, 293185 
NG209S 

none found 
none found 

24-40, 70-86. 95-111, 171-186, 253-269, 27G-292, 314-330 
plasma membrane 



25 
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35 
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MGYQRQBPVI 
PENVFIGSHF 
IPKTEDAWVP 
ICIPPATCGY 
VFPGGNBSSV 



HVQQTTQLST 



11 
I 

PPQVNKTFGP 
IIOLSTVTFT 
AKPHAIQAVG 
I/TPTGPTQQD 
FHIWTVMVI 
IMSCVMIiPIG 
UfflSIFOLE 



21 
I 

PGYLTiliSVLQ 
LPfcSLYKNIA 
VMSFAFICHH 
LFENYCRKDD 
TVATLVSLLI 
AWMVFGFVM 



A195 DMA SEQUBKCSi 

Gene name: 
thigene numbers 
Probe set Accession 

Nucleic Acid Accession ftt 

Coding sequences 



31 41 51 

1 I 1 

FLYPFIAMIS YNIIAGDTLS K7PQRIPOVD 60 

KLGKVSLIST GliTTLILGIV MARA1SLGPH 120 

NSFLVYSSLE EPTVAKWSRI* IHMSIVISVF 180 

LVTFGRFCYG VTVII/TYPMB CFVTHEVIAN 240 

DCLGIVLELSr GVLCATPLIF IIPSACYIiKL 300 

AITNTQDCTH GQEHFYCPPD MF6LTNTSBS 360 



Hs. 293185 

■N62096 

K62096 

1-1389 {underlined sequences correspond to start and stop codone) 



1 
i 

ATGGGCTACC 
ACCCTTGTTT 
GTTGTCAACT 
GGGTTT CCTT 
GTTTTATTGA 
AAAACTTTCG 
ATAGCAATGA 
ATCCCAGGAG 
ACAGTTACCT 
TCCCrCATCT 
TCAC7FGGGTC 
ATTCAAGCGG 
TACAGTTCTC 
GTGATTTCTG 
TTCACCCAAG 
AGATTTTGTT 
GAQGTAATTG 
ACAGTGATGG 
GTTCTAGAAC 
TGTTATCTGA 
ATGCTTCCCA 
CAAGACTGCA 
AATACCTCA3 
TTTCAATGA 



11 
I 

AGAGGCAGGA 
CTGAACftTGA 
CGATTATAGG 
TGGGAATATT 
TAAAAGGAGG 
GCTTTCCAGG 
TAAGTTACAA 
TTGATCCTGA 
TTACTCTGCC 
CTACAGGTTT 
CACACATACC 
TCGGGGTTAT 
TAGAAOAACC 
TATTTATCTG 
GGGACTTATT 
ATGGTGTCAC 
CCAATGTGTT 
TCATCACTGT 
TCAATGGTGT 
AACTGrCTGA 
TTGGTGCTGT 
CCCATGGGCA 
AGTCTCATGT 



21 
1 

GCCTGTCATC 
GTATAAAGAG 
ATCTGGTATA 
GCT T TT ATT C 
GGCCCTCTCT 
GTATCTGCTC 
TATAATAGCT 
AAACGTGTTT 
TTTATCCTTG 
AACAACTCTG 
AAAAACAGAA 
GTCTTTTGCA 
CACAGTAGCT 
TATATTCTTT 
TGAAAATTAG 
TGTCATTTTG 
TTTTGGTGGG 
AGCCACGCTT 
GCTCTGTGCA 
AGAACCAAGG 
GGTGATGGTT 
GGAAATGTTC 
TCAGCAGACA 



31 
I 

GCGCCGCAGA 
AAAACCTGTC 
ATAGGATTGC 
TGGGTTTCAT 
GGAACAGATA 
CTCTCTGTTC 
GGAGATACTT 
ATTGGTCGCC 
TACCGAAATA 
ATTCTTGGAA 
GACGCTTGGG 
TTTATTTGCC 
AAGTGGTCCC 
GCTACATGTG 
TGCAGAAATG 
ACATACCCTA 
AATCTTTCAT 
GTGTCATTGC 
ACTCCCCTCA 
ACACACTCCX3 
TTTGGATTCG 
TACTGCTTTC 
ACACAACITT 



41 
1 

GAGATTTAGA 
AGTCTGCTGC 
CTTATTCAAT 
ATGTTACAGA 
CCTACCAGTC 
TTCAGTTTTT 
TGAGCAAAGr 
ACTTCAT TAT 
TAGCAAAGCT 
TTGTAATGGC 
TATTTGCAAA 
ACCATAACTC 
GCCTTATCCA 
GATACTTGAC 
ATQACCTOOT 
TGGAATGCTT 
CGGTTTTCCA 
TGATTGATTG 
TTTTTATCAT 
ATAAGATTAT 
TCATGGCTAT 
CTGACAATTT 



51 
1 

TGACAGAGAA 
TCTTTTTAAT 
QAAGCAAGCT 
CTTTTCCCTT 
TTTGQTCAAT 
GTATCCTXTT 
TITTCAAAGA 
TGGACXTTCC 
TGGAAAGGTC 
AAGGGCAATT 
GCCCAATGCC 
CTTCTTAGTT 
TATGTCCRTC 
ATTTACTGGC 
AACATTTGGA 



CATTGTTGTA 
CCTCGGGATA 
TCCATCAGCC 
GTCTTGTGTC 
TACAAATACT 
CTCTCTCACA 
TATTAGTATC 



60 
120 

iao 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



75 



80 



A196 Protein sequence* 
Gene names 
Unigenc number: 
Probe Ee t Accession # : 
Protein Accession #s 
Signal sequence : 
Transmembrane domains i 
Cellular Localization: 



BSTs 

HS. 293185 
N62096 
none found 
none found 

37-53, 66-82, 109-125, 155-172, 180-196, 255-271, 338-354, 361-377, 399-415 
plasma membrane 



1 11 21 31 41 51 

111111 

MGYQRQEPVI PPQRDLDmffi TLVSEHEYKB KTCQSAALEH WN&IIGSGX IGLPYSMKQA 

923 



WO 03/042661 



PCT/US02/36810 



GFPIiGIXJjIjF 
IAMISYNIIA 
S3JISTGLTTL 
YSS1±EEPTVA 
RFCYGVTVIL 
VliEtNGVfcCA 
QDCTHGQHMF 



WVSYVTDFSL 
GDTbSKVFQR 
ILGIVMARAI 
KWSRLTHMSI 
TYPHECFVTR 
TPLIFIIPSA 
YCFPDMFgt/T 



VLLIKGGAJbS 
IPGVDPENVF 
SLGPHIPKTE 
V2SVFICIFF 
EVIANVFFGG 
CYLKLSEEPR 
UTSESHVQQT 



GTDTYQSLVN 
IGRHFIIGIiS 
DAWVFAKPNA 
AT CGYliTFTG 
HLSSVFHIW 
THSOKIMSCV 
TQLSTLiNTSI 



KTFGFPGYLL 
TVTFTIiPIjSIi 
IQAVGVMSFA 
FTQGDIiFEWY 
TVMVITVATL 
MLPIGAWMV 
FQ 



LSVLOFLYPF 120 

YRNIAKIxGKV 180 

FICHHNSFLV 240 

CRNDDLVTFQ 300 

VSLLIDCLGI 3 60 

FGFVMAlTNT 42 0 



JU.97 DNA SEQUENCE 
Gene name i 
Unigene number: 
Pxrobeset Accession #x 
Nucleic Acid Accession #: 
Coding sequence: 



ESTe 

Hs, 31608 

H1B836 

HM_017636 

1-3501 (underlined sequences correspond to start and stop cottons) 



11 
1 



21 
I 



31 



41 



51 
I 



ATGGAGGATG CCTTCGGGGC AGCCGIGGTG ACCGTGTGGG ACAGCGATGC ACACACCACG 60 

GAGAAGCCCA CCGATGCCTA CGGAGAGCTG GACTTCACGG GGGCCGGCCG GAAGCACAGC 120 

AATTTCGTCC GGCTCTCTGA COGAACGGAT CCAGCTGCAG TTTATAGTCT GGTCACACGC 180 

ACATGGGGCT TCOGT0CCCC OAACCTGGTG GTGTCAGTGC TGGGGGGATC GGGGGGCCCC 240 

GTCCTCCAGA CCTGGCTGCA GGACCTGCTG CGTCGTGGGC TGGTQCGGGC TGCCCAGAGC 300 

ACAGGAGCCT OOATTGTCAC TGGGGGTCTG CACACGGGCA TCGGOCGGCA TGTTGGTGTG 360 

GCTGTACGGG ACCATCAGAT GGCCAGCACT GGGGGCACCA AGGTGGTGGC CATGGGTGTG 420 

GCCCCCTGGG GTGTGGTCOG GAATAGAGAC ACCCTCATCA ACCCCAAGGG CTCGTTCCCT 480 

GCGAGGTACC GGTGGCGOGG TGACCCGGAG OACGGGGTCC AGTTTCCCCT GGACTACAAC 540 

TACTCGGCCT TCTTCCtGGT GGACGACGGC ACACACGGCT GCCTGGGGGG CGAGAACCGC 600 

TTCCGCTTGC GCCTGGAGTC CTACATCTCA. CAGCAGAA3A CGGGCGTGGG AGGGACTGGA 660 

ATTGACATOC CTGTCCTGCT CCTCCTGATT GATGGTGATG AGAAGATGTT GACGOGAATA 720 

GAGAAOGCCA CCCAGGCTCA GCTCCCATGT CTCCTCGTGG CTGGCTCAGG GGGAGCTGCG 7 B0 

GACTGCCTGa CGGAGACCCT GGAAGACACT GTGGCCCCAG GGAGTGGGGG AGCCAGGCAA B40 

GGCGAAGCCC GAGATCGAAT CAGGCGTTTC TTTCCCAAAG GGGACCTTGA GGTCCTGCAG 900 

GCCCAGGTGG AGftGGATTAT GACCCQGAAG GAGCTCCTGA CAGTCTATTC TTCTGAGGAT 960 

GGGTCTGAGG AATTCGAGAC CATAGTTTT0 AAGGCCCTTG TGAAGGCCTG TGGGAGCTCG 1020 

GAGGCCTCAG OCTACCTGGA TGAGCTGCGT TTGGCTGTGG CTTGGAAOOG CGTGOACATT 1080 

GCCCAGAGTG AACTCTTTCG GGGGGACATC CAATGQCGGT CCTTCCATCT CGAAGCTTOC 1140 

CTCATGGACG CCCTGCTGAA TGACCGGCCT GAGTTCGTGC GCTTGCTCAT TTCCCAOGOC 1200 

CTCAGCCTGG GCCRCTTCCT GACCCOGATG CGCCTGGCCC AACTCTACAG CGCGGOGCCC 1260 

TCCAACTGGC TCATCCGCAA CCTTTTGGAC CAGGOGTCCC ACAGCGCAGG CAOCAAAGCC 1320 

CCAGCCCTAA AAGGGGGAGC TGOGGAGCTC CGGOCCCCTG ACGTGGGGCA TGTGCTGAGG 1380 

ATGCTGCTOa GGAAGATQTG CGCGCCGAGG TACCCCTCCG GGGGCGCCTG GGACCCTCAC 1440 

CCAGGCCAGG GCTTCGGGGA GAGCATGTAT CTGCTCTCGG ACAAGGCCAC CTCGCeGCTC 1500 

TOGCTGGATG CTGGCCTCGG GCAGGCCCCC TGGAGCGACC TGCTTCTTTG GGCACTGTTG 1560 

CTGAACAGGG CACAGATGGC CATGTACTTC TGGGAGATGG GTTCCAATGC AGTTTCCTCA 1620 

GCTCTTQGGa CCTGTTTGCT GCTCCGGGTG ATGGCACGOC TGGAGCCTGA CGCTGAGGAG 1680 

GCAGCACGGA GGAAAGACCT GGCGTTCAAG TTTGAGGGGA TGGGOGTTGA CCTCTTTGGC 1740 

GAGTGCTATC GCAGCAGTGA GGTGAGGGCT GCCCGCCTOC TCCMCGTCG CTGCCOGCTC 1800 

TGGGGGGATG CCACTTGCCT CCAGCTGGCC ATGCAAGCTG ACGCCCGTQC CTTCTTTGCC I860 

CAGGATGGGG TACAQTCTCT GCTGACACAG AAGTGGTGGG GAGATATGGC CAGCACTACA 1920 

CCCATCTGGG COCTGGTTCT OGCCTTClTl TGGCCTCCAC TCATCTACAC CCGCCTCATC 1980 

AOCTTCAGOA AATCAGAAGA GGAGCCCACA OGGGAGGAGC TAGAGTTTGA CATGGATAGT 2040 

GTCATTAATG GGGAAGGGCC TGTCGGGACG GOGGACCCAG CCGAGAAGAC GCCGCTGGGG 2100 

GTGCCGCGCC AGTDGGGCCG TCCGGGTTGC TGCGGGGGCC GGTGCGGGGG GCGCCGGTGC 2160 

CTAOGCCGCT GGTTCCACTT CTGGGGCGGG CGGGTGAGCA TCTTCATGGG CAACGTGGTC 2220 

AGCTACCTGC TGTTCCTGCT GCTTTTCTC3 CGGQTGCTGC TCG1GGATTT OCAGCOGGCG 2280 

CCGCCCGGCT CCCTGGAGCT GCTGCTCTAT TTCTGGGCTT TCACQCTGCT GTGCGAGGAA 2340 

CTGCGCCAGG GCCTGAGCQG AGGCGGGGGC AGCCTCGCCA GCGGGGQCCC CGGGCCTGGC 2400 

CATGCCTCAC TGAGCCAGCG CCTGCGCCTC TACCTCGCCG ACAGCTG6AA CCAGIGCGAC 2460 

CTAGTGGCTC TCACCTGCTT CCTCCTGGGC GTGGGCTGCC GGCTGACCCC GGGTTTGTAC 2520 

CACCTGGGCC GCACTGTCCT CTCCATCGAC TTCATGGTTT TCACGGTGCG GCTGCTTCAC 2580 

ATCTTCACG6 TCAACAAACA GCTGGGGCOC AAGATCGTCA TCGTGAGCAA GATGATGAAG 2640 

GACGTGTTCT TCTTOCTCTT CTTCCTOGGC GTGTGGCTGG TAGCCTATGG CGTGGCCACG 2700 

GAGGGGCTCC TGAGGCCACG GGACAGTGAC TTCCCAAGTA TCCTGOGOCG OOTCTTCTAC 27fiO 

CGTCCCTACC TGCAGATCTT CGGGCAGATT CCCCAGGAGG ACATGGAOGT GGCCCTCATG 2820 

GAGCACAGCA ACTGCTCGTC GGAGCCCGGC TTCTGGGCAC AGCCTGCTGG GGCCCAGGCG 2880 

GGCACCTGCG TCTCCCAGTA TGCCAACTQG CTGGTGGTGC TGCTCCTCGT CATCTTCCTG 2940 

CICGIGGCGA ACATCCTGCT GGTCAACTTG CTCATTGCCA TGTTCAGTTA CACATTCGGC7 3000 

AAAGTACAGG GCAACAGCGA TCTCTACTGG AAG9CGCAGC GTTACCGCCT CATCGGGGAA 3060 

TTGCACTCTC GGCCC3GCGCT GGCCCCGCCC TTTATCGTCA TCTCCCACTT GOGCCTCCT3 3120 

CTCAGGCAAT TGTGCAGGCG ACCCOGGAGC CCCCAGCCGT CCTCCCCGGC CCTCGAGCAT 3180 

TTCCGGGTTT ACCTTTCTAA GGAAGCCGAG CGGAAGCTGC TAACGTGGGA ATCGGTGCAT 3240 

AAGGAGAACT TTCTGCTQGC ACGCGCXAGG GACAAGCGGG AGAGCGACTC OGAGGGTCTa 3300 

AAGCGCACGT CCCAGAAiGGT GGACTTGGCA CTGAAACAGC TGGGACACAT CCGCGAGTAC 3360 

GAACAGCGCC TGAAAGTGCT GGAGCGGGAG GTCCAGCAGT GTAGCCGCGT CCTGGGGTGG 3420 

GTGGCCDAGG OCCTGAGOCG CTCTGCCTTG CTGCCCCCAG GTGGGCCGCC ACCCCCTGAC 3490 
CTGCCTGGGT CCAAAGACTG A 



A19B Protein sequence: 

Gene name: ests 

tfnigene numbers Hs. 31608 

Probeset Accession ttr m.8836 

Protein Accession #= none found 



924 



WO 03/042661 



PCT/US02/36810 



5 



15 
20 
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30 



35 



40 



45 



50 



70 



80 



Signal sequence? none found 

Transmembrane doming: 214-230, 537-556, 642-662, 730-752, 760-782, 81S-B37, B42-BS4, 877-899, 973-995 

Cellular Localization: plasma membrane 



1 11 21 31 41 51 

S I I I I ) 

MEDAPGAAW TVWDSDAHTT EKPTDAYGEIj DFTGAGRKHS NFLRLSDRTD PAAVYSLVTR 60 

TWGFRAPNLV VSVLGGSGGP VLQTWLQDLL RRGLVRAAQS TGAWIVTGGL KTGIGRHVGV 120 

10 AVRDKQMAST GGTKWAMGV APWGWRNRD TLINFKGSFP ARYRWRGDPE POVQFPLDTN 180 

YSAFFLVDDQ THGCLGGENR FRLRLESYIS QQKXGVGGTG JTMPVLLLLI DGD£KML,TRl 240 

ENATQAQLPC LLVAG SGGAA DCLAETLEDT LAPGSGGARQ GEARDRIRRF FPKGDLEVLQ 3 DO 

AQVERIMTRK ELLTVYSSED QSKEFETIVL KALVKACGSS EASAYU>E1>R LAVAWNRVDI 360 

AQ8ELFRGDI QWRfiFHLEAS LMDALLNDRP EFVRLLISHG I^SLGHFLTPM RLAQLYSAAP 420 

SMSLIRHLliD QASHSAGTKA PALKGGAAKti RPPDVGHVLR MLLGKMCAPR YFSGGAWDPB 480 

PGOGFGESMY LLSDKATSPL SLDAGLGQAP WSDLLLWALL LNRAQMAMtYF WEMGSNAVSB 540 

ALGACLLLRV MARLEPDAEE AARRKDLAFX FEGMGVDLFG ECYRSSEVRA AET.T.T.RRCPL 600 

WGDATCLQLA MQADARAFFA QDGVQSLLTQ KWHGDMA5TT PIWALVLAFF CPPLIYTRLI 660 

TFRKSBBEPT REELEPDMDS VIHQEGPVGT ADPAEKTPLG VPRQSGRPGC GGGRGGGRRC 720 

LRRWFHFWGA PVTIFMGNW SYLLFLLLFS KVLLTOFQPA PPQST.KT.T.T.Y FWAFTLLCEE 780 

LRQGL9GGGG SIASGGFGPG HASLSQRLRL YLADSMNQCD LVALTCFLLG VGCRLTPGLY 840 

KLGRTVLCID FMVFTVRLLH IFTVNKQLGP KXVIVSKMMK DVFFFLFFLG VWLVAYGVAT 900 

HGLIjRPRDSD FPSIKRRVFY RPYLQXFGQI PQEDMDVALM ERSNCSSEPG FWAHPPGAQA 960 

GTCVSQYANW LWLLLVIFL LVANI LLV2JL LIAMFSYTFG KVQGNSDLYW KAQRYRLIRE 1020 

FHSRPALAFF FIVISHURLL 1iRQIiCKRPR& PQPSSPALEH FEVYLSKEAB RKLLTWESVH 1080 

KENFLLARAR EKRESDSERL KRTSQKVD1A LKQLGHIREY EQRLKVLERB VQQCSRVLGW 1140 
VABAX4SRSAI1 LPPGGPPPPD LPGSXD 



A199 DMA SEQUENCE 
Gene name: ESTb 
Unigene number? HS.40B0B 
Probeset Accession W~ AA054237 
Hucleic Acid Accession #* AA054237 

Coding sequences 1-B94 (underlined sequences correspond to start and atop codonu) 

1 11 21 31 41 51 

I I I I I 1 

ATGGAGCCGC GGGCGCTCGT CACGGCGCTC AGCCTCGGCC TCAGCCTGTG CTCCCTGGGG 60 

CTGCTCGTCA CGGCCATCTT CACCGACCAC TGGTAG3AGA CCGACCCCCG GCGCCACAAG 120 

GAGAGCTGCG AGCGCAGCCG CGCGGGCGCC GAGCCCCGGG ACCAGAAGAA CCGCCTGATG 180 

CCGCTGTCGC ACCTGCCGCT GCGGGACTCG CCCOCQCTGG GGOGOCGGCT GCTCCCGGGC 240 

GGCCCGGGGC GCGCCGACCC CGAGTCCTGG CGCTOHCTCC TGGGGCTCGG CGGGCTGGAC 300 

GCCGAGTGCG QCCOGCCCCT CTTCGCCACC TACTCGGGOC TCTGGAGGAA GTGCTACTTC 360 

CTGGGCATCG ACCGGGACAT CGACACCCTC ATCCTGAAAG GTATTGCGCA GCGATGCACG 420 

GCCATCAAGT ACCACTTTTC TCAGCCCATC CGCTTGCGAA ACATTCCTTT TAATTTAACC 480 

AAGACCATAC AGCAAGATGA GTGGCACCTG CTTCATTTAA GAAGAATCAC TGCTGGCTTC 540 

CIC3GGCATGG OCGTAGCCGT CCTTCTCTGC GGCTGCATTG TGGCCACAGT CAGTTTCTTC 600 

TGGGAGGAGA GCTTGACCCA GCACGTGGCT GGACTCCTGT TGCTCATGAC AGGGATATTT 660 

TGCACCATTT COCTCTGTAC TTATGCCGCC AGTATCTCGT ATGATTTGAA CCGGCTCCCA 720 

AAGCTAATTT ATAGCCTGCC TGCTGATGTG GAACATGOTT ACAGCTGGTC CATCTTTTGC 780 

GCCTGGTGCA GTTTAGGCTT TATTG1GGCA GCTGGAGGTC TCTGCATCGC TTATCCGTTT 840 
ATTAGCCGGA CCAAGATTGC ACAGCTAAAG TCTGGCAGAG ACTCCACGGT ATGA 



55 A200 Protein, sequence; 



Unigene mimbert Hs. 40808 

Probes et Accession #s AAOS4237 

Protein Accession frs none found 

OU Signal sequencer 1-18 

Transmembrane domains: 179-201, 209-231, 257-279 

Cellular Localization: plasma membrane 

^_ 1 11 21 31 41 51 

65 1 1 1 1 I ) 

' MEPRALVTAL SLGL9LC6LG LLVIAIFTDH WYETDPRRHK ESCERSRABA DPPDQKNRIjM 60 

PLSHLPIiRDS PPLGRRLLPG GPGRADPESW RSLLGLGGLD AfiCGRPtiFAT YSGLWRKCYF 120 

LGIURDIDTL ILBGIAQRCT AlKYHFSQPI RLRNIPFNLT KT1QOBEWHL LHLRRITAGF 180 

LGMAVAVLLC GCIVATVSFF WEESLTQHVA GLLFLMTGIF CTIfiLCTYAA SISYDLNRLP 240 
KL1YSLPADV EHGYSWS1FC AWCSLGFIVA AGGLCIAYPF I8RTKIAQLK SGHDSTV 



A2Q1 DNA SEQUENCE 
Gene namet CDAJ.4 
75 Unlgene numbers Eg .26813 

Probe set Accession #= W32 912 
nucleic Acid Aeeeeeion #s NM_016570 

Coding eequence: 1- 1134 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I 1 I 1 

ATQA GGCGAC TGAATCGGAA AAAAACTTTA AGTTTGGTAA AAGAGTTGGA TGCCTTTCCG 60 
AAGGTTCCTG AGAGCTATGT AGAGACTTCA GCCAGTGGAG GTACAGTTTC TCTAATAGCA 120 

925 



WO 03/042661 



TTTACAACTA TGGL" 1 TTATT AACCATAATG GAATTCTCAG TATATCAAGA TACATGOATQ ISO 

AAGTATGAAT ACGAAGTAGA CAAGGATTTT TCTAGCAAAT TAAGAATTAA TATAGATATT 240 

ACTGTTGOCA TGAAGTGTCA ATATGTTGGA GCGGATGTAT TGGATTTAGC AGAAACAATG 300 

GTTGCATCTG CAGATGGTTT AGTTTATGAA CCAACAGTAT TTGATCTTTC ACCACAGCAG 360 

AAAGAGTGGC AGAGOATGCT GCAGCTGATT CAGAGTAGGC TACAAGAAGA GCATTCACTT 420 

CAAGATGTGA TATTTAAAAQ TGCTTTTAAA AGTACATCAA CAGCXCTTCC ACCAAQAGAA 480 

GATGATTCAT CACAGTCTCC AAATGCATGC AGAATTCATG GCCATCTATA TGTCAATAAA S40 

GTAGCAGGGA ATTTTCACAT AACAGTGGGC AAGGCAATTC CACATCCTCG TGGTCATGCA 600 

CATTTGGCAG CACTTGTCAA CCATGAATCT TACAATTTTT CTCATAGAAT AGATCATTTQ £60 

TCTTTTGGAG AGCTTGTTOC AGCAATTATT AATCCTTTAG ATGGAACTGA AAAAATTGCT 720 

ATAGATCACA ACCAGATGTT OCAATATTTT ATTACAGTTG TGCCAACAAA ACTACATACA 780 

TATAAAATAT CAGCAGACAC CCATCAGTTT TCTGTGACAG AAAGGBAACG TATCATTAAC 840 

CATGCTGCAG GCAGCCATGG AGTCTCTGGG ATATTTATGA AATATGATCT CAGTTCTCTT 900 

ATGGTGACAG TTACTGAGGA GCACATGCCA TTCTGGCAGT TTTTTGTAAG ACTCTGTGGT 960 

ATTGTTGGAG GAATCTTTTC AACAACAGQC ATGTTACATG GAATTGGAAA ATTTATAGTT 1020 

GAAATAATTT GCTGTCGTTT CAGACTTGGA TCCTATAAAC CTOTCAATTC TGTTCCTTTT 1080 
GAGGATGGCC ACACAGACAA CCACTTACCT CTTTTAGAAA ATAATACACA TTGA 



A202 Protein sequence; 
Gene name : 
Unlgene numbers 
Probeset Accession #: 
Protein Accession 
Signal sequence : 



Cellular Localization-. 



CDA14 
Hs. 26813 
N32912 
NP_0S7654 
none found 
none found 
nuclear 



1 11 21 31 41 51 

f i i i i i 

MRRIiNRKKTL SLVKBLDAFP KVPESYVETS ASGGTVSIjIA FTTMALLTIM EFSVY0DTWM 60 

KYKYEVDKDP SSKLRINIDI TVAMKCQYVG ADVLDLAETM VASADGEiVYE PTVFDkSPQQ 120 

KEWQRMLQLI Q8RLQEEKSL QDVIFK9AFK ST9TAIi?PR13 DDSSQBPNAC H1HGHLYVH1C 180 

VAGMFHITVG KAIPHPRGHA HLAALVNHES YKFSHRIDHL SPGELVPAII NPLDGTEKIA 240 

iDHHQMFQfYP ITWPTKLHT YKISADTHQF SVTEHHRI IK HAAGSHGVSG IPHKYDLSSL 300 

MVTVTEEHMP FHQFFVRLCG rVGGIFSTTG MLHGIGKFIV fillCCRFKLG SYKPVNBVPP 360 
BDGHTTJNHLP LIiENNTH 



A203 mA 5EQUBHCB 

Gene names ~ dlpeptidylpeptidase 
Onigene number: Hs. 44926 

Probe set Accession ff: S79Q76 
Nucleic Acid Accession #r HM_001935.l 
Coding sequence: 76-2301 {underlined 



IV (CD26, adenosine deaminase complexing protein 
sequences correspond to start and stop codons) 



* 11 21 31 41 51 

1 I 1 1 i I 

CGCGCGTCTC CGCCGCCCGC GTGACTTCTG CCIGCOCTCC TTCTCTGAAC GCTCACTTCC 60 

GAGGAGACGC CGACG ATQA A GACACCGTGG AAGATTCTTC TGGGACTGCT GGGTGCTGCT 120 

GCGCTTGTCA CCATCATCAG CGTGCCCGTG GTTCTGCTBA ACAAAGGCAC AGAK3ATGCT 180 

ACAGCTGACA GTCGCAAAAC TTACACTCTA ACTGATTACT TAAAAAATAC TTATAGACTG 240 

AAGTTATACT CCTTAAGATG GATTTCAGAT CATGAATATC TCTACAAACA AGAAAATAAT 300 

ATCTTGGTAT TCAATGCTGA ATATOGAAAC AGCTCAGTTT TCTTGGAGAA CAGTACATTT 360 

GATGAGTTTG GACATTCTAT CAATGATTAT TCAATATCTC CTOATGGGCA GTTTATTCTC 420 

TTAGAATACA ACTACGTGAA GCAATOGAGG CATTCCTACA CAGCTTCATA TGACATTTAT 480 

GATTTAAATA AAAGOCAGCT GATTACAGAA GAGAGGATTC CAAACAACAC ACAGTGGGTC S40 

ACATGGTCAC CAGTGGGTCA TAAATTGGCA TATGTTTGGA ACAATGACAT TTAXGTTAAA 600 

ATTQAACCAA ATTTACCAAG TTACAGAATC ACATGGAOGG GGAAAGAAGA TATAATATAT 660 

AATGGAATAA CTGACTGGGT TTATGAAGAG GAAGTCTTCA GTGCCTACTC TGCTCTGTGG 720 

TGGTCTCCAA ACGGCACTTT TTTAGCATAT GCCCAATTTA ACGACACAGA AGTCCCACTT 780 

ATTGAATACT CCTTCTACTC TGATGAGTCA CTGCAQTACC CAAAGACTGT AOGGGTTCCA 840 

TATCCAAAGG CAGGAGCTGT GAATCCAACT GTAAAGTTCT TTGTTGTAAA TACAGACTCT 900 

CTCAGCTCAG TCACCAATGC AACTXCCATA CAAATCACTG CTCCTGCTTC TATGTTGATA 960 

GGGGATCACT ACT TGTGT GA MTGACATGO GCAACACAAG AAAGAATTIC TTXGCAGTGG 1020 

CTCAGGAGGA TTCABAACTA TTCGGTCATG GATATTTGTG ACTATGATGA ATCCAGTGGA 1080 

AGATGGAACT GCTTAGTGGC ACGGCAACAC ATTGAAATGA GTACTACTGG CTGGGTTGGA 1140 

AGATTTAOGC CTTCAGAAOC TCATTTTACC CTTQATGGTA ATAGCTTCTA CAAGATCATC 1200 

AGCAATGAAG AAGGTTACAG ACACATTTGC TATTTCCAAA TAGATAAAAA AGACTGCACA 1260 

TTTATTACAA AAQGCACCTG GGAAGTCATC GGGATAGAAG CTCTAACCAG TGATTATCTA 1320 

TACTACATTA GTAATGAATA TAAAGGAATG CCAGGAGGAA GGAATCTTTA TAAAATCCAA 1380 

CTTATTGACT ATACAAAAGT GACATQCCTC AGTTGTGAGC TGAATCCGGA AAGGTQTCAG 1440 

TACTATTCTG TGTCATTCAG TAAAGAGGCG AAGTATTATC AGCTGABATG TTCCGGTCCT IS 00 

GGTCTGCOCC TCTATACTCT ACACAGCAGC GTGAATGATA AAGGGCTGAG AGTCCXGGAA 1560 

QACAAT TCAG CTTTGGATAA AATGCTGCAG AATGTCCAGA TGCCCTCXZAA AAAACTGGAC 1620 

TTCATTATTT TC3AATGAAAC AAAATTTTGG TAICAGATGA TCTTGCCTCC TCATTTTGAT 1680 

AAATCCAAGA AATATCCTCT ACTATTAGAT GTGTATGCAG GCCCATGTAG TCAAAAAGCA 1740 

GACAC TGTCT TCAGACTGAA CTGGGCCACT taccttgcaa gcacagaaaa cat TAT ACT A 1900 

GCTAGCTTTG ATGGCAGAGG AAGTGGTTAC CAAGGABATA AGATCATGCA TGCAATCAAC 1860 

AGAAGACTGG GAACATTTGA AGTTGAAGAT CAAATTGAAG CAGCCAGACA ATTTTCAAAA 1920 

ATGGGATTTG TGGACAACAA AOGAATTGCA ATTTGGGGCT GGTCATATGG AGGGTAGGTA 1980 

ACCTCAATGG TCCTGGGATC GGGAAGTGGC GTGTTCAAGT GTGGAATAGC CGTGGCGCCT 2040 
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GTATCCCGGT GGGAGTACTA TGACTCAGTG TACACAGAAC GTTACAT GGG TCTCCCAACT 21DO 

CCAGAAGACA ACCTTGACCA TTACAGAAAT TCAACAGTCA TGAGGAGAGC TGAAAATTTT 2160 

AAACAAGTTG AGTACCTCCT TATTCATOGA ACAGCAGATG ATAACGTTCA CTTTCAGCAG 2220 

TCAGCTCAGA TCTCCAAAGC CCTGGTCGAT GTTGGAGTGG ATTTCCAOGC AATGTGGTAT 2200 

ACTGATGAAG ACCATGGA AT AGC TAGCAGC ACAGCACACC AACATATATA TACCCACATG 2340 

AGCCACTTCA TAAAACAATG TTTCTCTTTA CCTTAGCACC TCAAAATACC ATGCCATTTA 2400 

AAGCTTATTA AAACTCATTT TTGTTTTCAT TATCTCAAAA CTGCACTGTC AAGATGATGA 2460 

TGATCTTTAA AATACACACT CAAATCAAGA AACTTAAGGT TACCTTTGTT CCCAAATTTC 2520 

ATACCTATCA TCITAAGTAG GGACTTCTGT CTTCACAACA GATTATTACC TTACAGAAGT 2560 

TTGAATTATC CX5GTCGQQTT TTATTGXTTA AAATOVTTTC TGCATCAGCT GCTGAAACAA 2640 

CAAATAGGAA TTGTTTTTAT GGAGGCTTT3 CATAGATTCC CTGAGCAGGA TTTTAATCTT 2700 

TTTCTAACTG GACTGGTTCA AATGTTGTTC TCTTCTTTAA AGGGATGGCA AGATGTGGGC 2760 

AGTGATGTCA CTAGGGCAGG GACAGGATAA GAGGGATTAG GGAGAGAAGA TAGCAGGGCA 2B20 

TGGCTGGGAA COCAAGTCCA AGCATACCAA CACGAGCAGG CTACTGTCAG CTCCCCTCGG 2080 

AGAAGAGCTG TTCACCACGA GACTGGGACA GTTTTCTGAG AAAGACTATT CAAACAGTCT 2940 

CAGGAAATCA AATATCGAAA GCACTGACTT CTAAGTAAAC CACAGCAGTT GAAAGACTCC 3000 

AAAGAAATGT AAGGGAAACT GCCAGCAACQ CAGCCCCCAG GTGCCAGTTA TGGCTATAGG 3060 

TGCTACAAAA ACACAGCAAG GGTGATGGGA AAGCATTGTA AATGTGCTTT TAAAAAAAAA 3120 

TACTGATQTT CCTAGTGAAA GAGGCAGCTT GAAACTGAGA "TGTGAACACA TCAGCTTBCC 3180 

CTGTTAAAAG ATGAAAATAT TTGTATCACA AATCTTAACT TOAAGGAGTC CTTGCATCAA 3240 

TTTTTCTTAr TTCATTXCTT TGAGTGTCTT AATTAAAAGA ATATTTTAAC TTGCTTGGAC 3300 

TCATTTTAAA AAATGGAACA TAAAATACAA TGTTATGTAT TATTATTOCC ATTCTACATA 3360 
CTATGGAATT TCTCCCAGTC ATTTAATAAA TGTGCCTTCA TTTTTTC 



A2Q4 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession # = 
Protein Accession 4t; 
signal sequence! 
Transmembrane domains; 
DPP I V_M_t e rm domain i 
Peptidaee_S9 domain: 
Cellular Localisation: 



dipeptidylpeptidase XV {CD26, adenosine deaminase coraplexing protein 2) 

HS.4492G 

S79B76 

NP_001?26.1 

none found 

6-28 

43-557 

558-635 

plasma membrane 



1 11 21 31 41 51 

1 I I I 1 I 

MKTPWKrLLG LLGAAALVTT ITVPWUJOK GTDDAXADSR KTYTLTDYLK NTYRIdKLYSL 60 

RWI8DHEYLY KQENUILVFN AEYGHSSVPI. ENSTFDSFGH SIMBYSISPD GQFTLLEYBIY 120 

VHQWHHSYTA SYDIYDLNXR QLITEERJPN NTQHVTHSPV GffittAYVWNN DIYVKJEPHL 1B0 

PSYRITKTGK EDIIYNGITD WVYEEEVFSA YSALWWSPNG TFXAYAQFHD TEVPLIEYSP 240 

YSDBSLQYPK TVTCVPYFKAG AVNPTVKPPV VNTDSEiSSVT NAT SI Q I TAP ASHLIGDHYL, 300 

CDVTWATQER ISLQWLRRIQ HYSVMDICDY DESBGRWNCL VARQHI EMST T3WVGRPRPS 360 

EPHFTJjDGNS FYKIISNEEG YKHICYFQID KKDCTPITKG TWEVIGIEAI, TSDYLYYISN 420 

EYKGMPGGFJN LYKIQIiIPYT KffTCLBCEW FERCQJYYSVB FSKBAKxYQI* RCSGPGLPLY 480 

TliHfiSVNHKG LRVLEDNSAI* EKMLQNVQMP SKKLDPIXLN ETKFWYQMIL PPHFDKSKKY 540 

PliTJJTVYAGP CSQKADTVPR IiNWATYTjASI* BNIIVA8FDG EG9GYQGDKI MHAINRKLGT 600 

FEVEDQIEAA RQFSKMGPVD NKRTATKGWS YGGYVTSMVX, GEGSGVFKCG IAVAPVSKWE 660 

YYDSVYTHRY MGLPTPEDHI, DITYRNSTVMS RAEHFKQVKY J*U I HGTADDN VHFQQSAQIS 720 
KALVDVGVDF QAMWYTDEDH GIASSTAHQH IYTHMSHPIK QCPSLP 

A205 DMA SEQUENCE 

Gene namei predicted exon 

Unigene number: none found 

Probes et Accession #i none found 

Nucleic Acid Accession #s none found 

Coding sequence - 1-261 {underlined sequences correspond to start and stop codons) 



1. 11 21 31 41 51 

I 1 l I l l 

ATGG CTCTGG CGAAGGTGAG GGAGCCAAAC GCAAATGACA ATGCCATCAG AGTTGACAAC 60 

AGAAGTGTGA TTAAAGTGCG TGCXAACCA3 TGT!lCCCTGC ATGAGGCAGA AAGTGAATCC 120 

AGAAACCCTC AGGAGCTCTG GATGGGCCTG CTCCTCTTGA TGGGGGTCCT AGAAGCATGT 180 

GTGGAAATGA GGOCTCTGTC AGTCTOGTCC CTGAQAGATG ACAAGGAGCA GAGCCCCCAC 240 
CAGCCCACAC TGGATGT CTA A 



A206 Protein sequences 



Xlnigene number: 
Probeaet Accession {| i 
Protein Accession ft* 
Signal sequences 
Transmembrane domains z 
Cellular Local izat ion i 



predicted exon 
none found 
none found 
none found 



46-68 

not determined 



1 11 21 31 41 51 

1 I I I 1 I 

MAIiAKVREPN AHDHAXRVDN RSVIKVRANQ CSIJEAESES HKPQELHWGI. U^LMGVLEAC 60 
VEMRPLSVWS LRDDKBQSPH QPTIjDV 
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A2Q7 DNA SEQUENCE 

Gene name? ESTs 

Unigene number: ns .222386 

Probeset Accession #: AI672225 

Nucleic Acid Accession. tt= none tound 

Coding sequencer 1-462 (underlined 



sequences correspond to start and stop codons) 



ATGCCAAATG 
CTCATACTTG 
ATTGATGTAT 
ATCATGTGGA 
TTTATGGCTA 
AACCTGACTA 
AGCAGAACAC 
GTCAAAGATC 



11 
I 

CTGAGTTAGA 
CTGTATGTTG 
CTTCTCAAGA 
CCAGTTTTGT 
TTGAAGAAGA 
ATGGTGCOBC 
CTGAAAGCCA 
AGATAGTTGT 



21 

i 

AGCAAAGAGC 
TGGATCAGCA 
TCTGGACAGA 
GGAAGACAAT 
AATGAAGAAG 
TQCTGGCAAT 
GCAATTTCCT 
AGATATGCQG 



31 
I 

CTTGGAAGCA 
AATATAGTCA 
CGGCCAGAGA 
CTTTCCATGG 
CAOGGAAGTA 
GGTGATGATG 
QACACTGAGA 
CGT I' ATiTCT 



41 
I 

GTAAATGTTT 
GCCCTCTACT 
GTATGCTGTT 
GCTGQGQGAA 
CTCATGTGGG 
GATTAATTCC 
ATGAAGAGTA 
GA 



51 
I 

AAAAACTGCT 60 

TGAGCAAAAT 120 

TCTAGTCATC 18Q 

GCTAGAAGAT 240 

ATTCCCAGAA 3€0 

TCCAAGGAAG 360 

TCACAGGTTT 420 



A2 OB Protein sequence: 
Gene name; 
Unigene number i 
Probeset Accession #; 
Protein Accession #t 
Signal sequence: 
Transme mbrane domains - 
Cellular Localization: 



ESts 

US, 222 8S 6 
AI672225 
none found 
none found 
26-38 

not determined 



1 11 21 31 41 51 

I I I I I I 

HPHABLBAKS ILGSSKCLiKTA L2LAVCCGSA N I VS PKLkEQN rDVeSQDLDB RPEShCbFLVI 60 
IKWTSFVEDN I»SMGWGKLED PMAIBEEMKK HGSTHVGFPE NIiTNGAAACBT ODDGLIPPRK 120 
SRTPESQQFP DTENEEYHRF VKDQIWDHR SYF 

A209 DNA SSQUEHCB 

Gene name: adz (odd Oz/ten- ci, Drosophila) horaolog 1 

Unigene number: Hs,23 796 

Probe set Accession #• NM_014253 

Nucleic Acid Accession #= NM_014253 

Coding sequence: 65-B242 (underlined sequences correspond to start and stop codons} 



1 11 21 31 41 SI 

1 I I } I I 

GACTDCTTGC ATTAAAGGAC TTCCTCATCC TTTTTTTCAT GAAACTGAGC TTGCTTAATC 60 
AGAGATGGAG CAAACTGACT GCAAACCCTA CCAGCCTCTA CCAAAAGTCA AGCATGAAAT 120 
GGATCTAGC1 TACACCAGTT CTTCTGATGA GAGTGAAGAT GGAAGAAAAC CAAGACAGTC 180 
ATACAACTCC AGGGAGACCC TGCAOGAQTA TAACCAGGAG CTGAGGATGA AT T ACAATAQ 240 
CCAGAGTAGA AAGAGGAAAG AAGXAGAAAA ATCTACTCAA GAGATGGAAT TCTGTGAAAC 300 
CTCTCACACr CTGTGCTCTG GCTACCAAAC AGACATGCAC AGCGTTTCTC GGCATGGCTA 360 
CCAGCTAGAG ATGGGATCTG ATGTGGACAC AGAOACAGAA GGTGCTGCCT CACCTGACCA 420 
TGCACTAAGA ATGTGQATAA GGGGAATGAA ATCAGAGCAT AGTTCCTGTT TGTCCAGCCO 480 
GGCCAACTCT GCATTATCCT TGACTGACAC TGACCATQAA AGGAAGTCTG ATGGGGAAAA 540 
TGOTTTCAAA TTCTCTCCTQ TTTGTTGTQA CATGGAGGCT CAAGCTGGGT CTACTCAAQA 600 
TGTGCAGAGC AGCCCACACA ACCAGTXCAC CTTCAQACOC CTCCCACCGC CACCTCCQOC 660 
TCCTCATGCC TGCACCTGTG CCAGGAAGCC ACCGCCTGCA GCGGACTCTC TTCAGAGGAG 720 
ATCAATGACT ACCCGCAGCC AQCCCAGCCC AGCTGCTCCA GCTCCCCCAA CCAGCACGCA 7 BO 
GGATTCAGTC CATCTGCATA ACAGCTGGGT OCTGAACAGC AACATACCAT TGGAGACCAG 840 
QCATTCCCTG TTCAAACATG GATCTGGTTC CTCTGCGATC TTCAGTGCAG CCAGTCAGAA 900 
CTACCCTCTG ACATCCAATA CCGTGTACTC GCCCOCTCOC AGGCCTCTTC CTCGAAGCAC 960 

CTETTCCCGA CCTflCCTTTA CCTTTAACAA ACCTTACAGG TGCTGCAACT GOAAGTGCAC 1020 

AGCATTGAGC GCCACTGCAA TCACAGTGAC TTTGGCCTTG TTACTAGCCT ATGTGATTGC 1080 

AGTGCATTTG TTCGGCCTGA CTTGGCAGTT GCAACCAGTT GAAGGAGAGC TGTATGCAAA 1140 

TGGAGTTAGC AAAGGGAACA GGGGGACCGA GTCCATGGAC ACTACTTACT CTCCAATTGG 120 D 

AGGAAAAGTT TCTGATAAAT CAGAOAAAAA AGTGTTTCAG AAGGGACGGG CGATAGACAC 1260 

TGQAGAAGTT GACATTGGTG CACAGGTCAT GCAGACCATT CCACCTGGTT TATTCTGGCG 1320 

TTTCCAGATT ACTATCCACC ATCCAATATA TCTGAAGTTC AATATTTCTT TAGCCAAGGA 13 BO 

CTCTCTGCTG GGAATTTATG GCAGAAGAAA CATTCCACCT ACACATACTC AGTTTGATTT 1440 

TGTAAAACTA ATGGATGGCA AACAGCTGGT CAAGCAGGAC TCCAAGGGCT CTGATGATAC 1500 

ACAQCACTCC CCTCGGAACC TGATCTTAAC TTCGCTTCAG GAGACAGGrT TCATAGAGTA 1560 

TATGGATCAA GGACCTTGGT ATCTGGCGTT TTACAATGAT GGAAAAAAQA TGGAGCAAGT 1620 

ATTCGTGTTA ACTACAGCAA TTGAAATAAT GGATGACTGT TCAACCAATT GCAATGGAAA 1660 

TGGAGAGTGT ATCTCTGGCC ATTGTCATTG TTTCCCAGGA TTCCTTQGAC CTGACTGTGC 1740 

TAGAGATTCC TGCCCTGTGC TGTGTGGXGG GAATGGAGAA TACGAGAAAG GACACTGTGr 1800 

CTGCCGGCAT GGCTGGAAGG GGCCAGAGTG TOACGTTCCG GAAGAACAAT GCATTGATCC 1860 

AACATGCTTT GGCCACGGCA CCTGCAT CAT GGGAGTCTGC ATCTGTGTGC CAGGATACAA 1920 

AGGAGAAATA TGCGAGGAAG AGGACTGCCT AGACCCAATG TGTTCCAACC ATGGCATCTG 19 B0 

TGTAAAAGGA GAATGTCACT GTTCTACTGG CTGGGGAGGA GTTAAClGTG AAACACCACX 2040 

TCCTGTATGT CAAGAGCAGT GCTCAGGACA CGGAACTTTT CTTCTOGACG CTGGAGTATG 2100 

CAGCTGXGAT OCCAAGTGGA CAGGATCTGA CTGCTCAACA GAGCTGTGTA CCATGGAGTG 2160 

TGGTAGCCAT GGAGTCTGCT CAAGAGGRAT TTGCCAGTGT GAAGAAGGCT GGGTAGGACC 2220 

AACATGTGAG GAACGCTCCT GTCATTCTCA TTGTACTGAG CATGaCCAAT GCAAAGATGG 2200 
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AAAATGTGAG TGTAGCCCTG GATGGGAGGG CGACCACTaC ACAATIGCTC ACT AC T TAG A 234D 
TGCTGTCCGA GATGGCTGCC CAGGGCTCTG CTTTGOAAAT GGACQATQTA CCCTGGATCA 2400 
AAATGGTTGG CACTGTGTGT GTCAGGTGGG TTGGAGTGGG ACAGGCTGCA ATGTTGTCAT 2460 
GGAAATGCTT TGTGGAGATA ACTTGGACAA TGATGGAGAT GGTTTAACCG ACTGTGTGGA 2520 
5 TCCTGACTGT TGTCAACAAA GCAACTGTTA TATAAGTCCT CTCTGCCAGG GCTCACCAGA 25 80 

TCCTCTTGAC CTCATTCAGC AAAGCCAAAC TCTCTTCTCT CAQCACACTT CAAQACTTTT 2640 
TTATGATOGA ATCAAATT CC TCATTGGCAA GGACAGTACT CATGTCATTC CTCCTGAGGT 2700 
GTCATTTGAC AGCAGGCGTG CCTGTGTGAT TCGAGGCCAA GTGGTGGCCA TAGATGGAAC 2760 
TCCTCTAGTG GGAGTGAATG TCAGTTTCTT GCACCACAGT GATTATGGGT TTACCATCAG 2820 
10 CCGGCAAGAT OGAAGCTTTG ACCTCGTGGC CATCGGTGGC ATCTCTGTCA TCTTAATCTT 2BflO 
CGACCGATCC CCTTTCCTGC CTGAGAAGAG AACACTCTGG TTGCCTTGGA AT CAGTTTAT 2540 
TGTGGTAGAD AAAQTCACCA TGCAGAGAGT TGTATCAGAC CCGCCATCCT GCBATATCTC 3000 
CAACTTTATC AGCCCAAACC CTATTGTGCT TCCTTCACCG CTCACATCAT TTGGAGGG7C 3060 
CTGTCCAGAG AGGGGAACTA TTGTTCCTGA GCTGCAGGTT GTACAGGAGG AAATTCCCAT 3120 
15 TCCCTCCAGC TTTGTGAGGC TGAGTTACCT GAGCAGCCGC ACCCCTGGGT ATAAAACCCT 31B0 

GCTACGGATC CTTCTGACAC ATTCAACGAT TCCOGTAGGC ATGATAAAAG TACACCTCAC 3240 
AQTAGCTGTG GAAGGGCGAC TCACACAGAA GTGGTTTCCC GCCGCAATTA ATCTTGTCTA 3300 
CACMTTGCT TGGAACAAOA CGGATATCTA TGGACAGAAG GTTTGGGGCC TGGCAGAGGG 3360 
TTTGGTATCT GTGGGATATG AATATGAAAC GTGCCCTGAG TTTATTCTCT GGGAGCAAAG 342 0 
20 GACAGTCGTT TTACAAQGTT TTGAGATGGA TGCTTCTAAC CTAGGAGACT GGTCTTTGAA 3460 
TAAGCATCAC ATTTTGAATC CTCAAAGTGG AATCATACAT AAAGGGAATG GAGAAAATAT 3540 
GTTCATTTCC CAGCAGCCCC CAGTCATATC AACCATAATG GGTAATGGAC ACCAAAGOAG 3600 
TGTAGCCTGC ACCAACTGCA ATGGCCCAGC CCACAACAAC AAACTCTTTG CTCCTGTCGC 3660 
CTTAGCTTCT GGCCCTGATG GCAGTGTGTA TGTTGGCGAC TTCAATTTTG TAAOGAGAAT 3720 
25 ATTTCCCTCG GGAAACTCCG TTAGTATTTT GGAATIAAGC ACAAGTCCTG CTCACAAATA 3780 
CTATCTGGCT ATGGACCCTG TGTCTQAATC ACTCTATCTA TCAGACACCA ATACTCGCAA 3840 
AGTCTACAAG TTGAAATCTC TTGTGGAGAC QRAAGATCTG TCCAAGAATT TTGAAGTGGT 390 0 
GGCAGGAACT GGTGATCAGT GCCTTCCCTT TGACCAGAQT CATTGTGGAG ATGGTGGGAG 3960 
AGCATCGGAA GCTTCACTGA ATAGCCCTCG AGGCATCACA GTTGATAGGC ATGGATTTAT 4020 
30 TTACTTTGTD GATGGGACTA TGATTCGCAA AATTGATGAG AATGCTGTGA TCACAACTGT 4080 
AATCGGCTCA AATGGTCTQA CTTCCACACA ACCACTGAGC TGTGACTCAG GAATGGACAT 4140 
CACTCAGGTG CGATTAGAGT GGCCAACAGA OCTTGCAGTA AATCCTATGG ACAATTCATT 42 00 
GTATGTCTTG GATAACAACA TTGTGCTGCA AATTTCTGAG AACAGGCGTG TTCOGATdAT 4260 
CGCAGOACGC CCCATTCACT QCCAGGTGCC AGGCATCGAT CATTTCCTGG TCAGCAAGGT 4320 
35 AGCAATTCAC TCCACTCTAG AGTCAGCGAG GGCCATCAGT GTCTCCCACA GCGGGCTGCT 4380 

CTTCATAGCT GAAACAGACG AGAGGAAAGT AAACCGCATT CAGCAAGTAA CCACCAATGG 4440 
GGAGATCTAC ATCATCGCTG GTGCCCCCAC TGACTGTGAC TGCAAAATTG ATOCAAACTG 4500 
TGACTGTTTT TCAGGTGATG GTGGCXATGC CAAAGATQCA AAGATGAAAG CCCCTTCCTC 4560 
CTTAGCAGTG TCGCCTGATG GAACCCTCTA TGTGGCAGAC CTCGGAAATG TTCGAATTCQ 4620 
40 TACCATCAGC AGGAACCAAG CCCACCTGAA TGACATGAAC ATTTATGAGA TTGCTTCACC 4680 
CGCTGATCAG GAACTGTACC AGTTCACTGT AAATGGAAOC CACCTACACA CCCTGAACTT 4740 
GATAACAAGG GACTATGTTT ATAACTTCAC CTACAATTCT GAAGGTGACT TGGGCGCGAT 4800 
TACCAGCAGC AATGGCAATT CAGTGCACAT TCGCCGTGAT GCAGGCGGAA TGCCGCTATG 4860 
GCTTGTGGTG CCTGGCGGAC AAGTATACTG GCTGACTATA AGCAGCAATG GAGTCCTGAA 4920 
45 AAGAGTGTCA GCCCAAGGCT ATAATCCGGC CTTAATGACC TATCCAGGAA ACACAGGGCT 4980 
TCTGGCTACC AAAAGTAACG AAAATGGATG GACAACCCJTT TATGAGTATG ACCCCGAGGG 5040 
ACACCTGACC AATGCAACGT TTCOCACTGG AGAGGTCAGC AGCTTCCAOA GTGACCTGGA 5100 
GAAGCTGACA AAAGTGGAGC TAGATACT1C CAACCGTGAA AATGTCCTCA TGTCAACCAA 5160 
CTTGACGGCA ACTAGTACCA TATATATTTT AAAACAAGAA AATACTGAAA GTACCTATCG 5220 
50 GGTGAATCCA GATGGTTCCC TGCGTGTCAC TTTTGCCAGC GGGATGGAGA TCGGCCTCAG 5280 
CTCAGAGCCC CACATCCTGG CAGGGGCAGT CAACCCTACC CTGGGCAAAT GCAACATCTC 5340 
ATTGDCCGGA GAGCACAATG CAAACCTCAT CGAGTGGCGG CAGAGGAAGG AGCAAAACAA 5400 
AGGCAATGTT TCGGCTTTTG AAAGGAGGCT GAQOGCCCAC AACAGAAACC TACTCTCCAT 5460 
AG AT TTT GAT CATATAACCC GCACAGGAAA GATCTATGAT GACCATCGAA AATTCACCCT 5520 
55 TCGAATTCTT TATGACCAGA CTGGGCGACC CATTCIGTGG TCTCCTGTAA GCAGATATAA 5580 
TGAAGTGAAC ATCACATATT CACCTTOGGG ATTGGTGACG TTTATTCAAA GAGGAACGTG 5640 
GAATGAAAAA ATGGAATATG ACCAGAQTGG GAAAATTATT TCAAGAACTT GGGCTGATGG 5700 
GAAAATTTGG AGCTATACCT ACTTAGAAAA ATCTGTGATG CTTCTCCTAC ACAGCCAGCG 5760 
GOGTTACATC TTTGAGTATG ACCAATCAGA TTGCCTGCTG TCAGTTACCA TOCCTAGCAT 5820 
60 GGTGCGCCAC AGCTTACAAA CCATGCTTTC AGTGGGCTAC TACCGTAATA TCTACACBCC S880 
AOCGGACAGT AGCAGTTCTT TTATCCAAGA CTATAGTCGA GATGGCOGAT TGCTACAGAC 5940 
CCTGCATCTG GGGACAGGGC GCAGAGTCTT ATACAAGTAC AGCAAGGAAG CAAGGCTTTC 6000 
TGAGGTTCTC TATGATACCA CTCAGGTCAC ATTAACATAT GAAGAGTCTT CTGGAGTGAT 6060 
TAAGACAATA CACCTGATGC ATGAOSGATT CATCTGCACA ATCAGATACA GGCAAACRGG 6120 
65 ACCTCTTATT GGACGCCAGA TTTTCAGATT CAGTGAAOAA GGCCTTGTGA ATGCACGGTT €180 
OGACTACAGC TACAACAATT TCOGAGTCAC AAGCATGCAA GCTGTAATCA ATOAAACCCC 6240 
TTTGCCTATA GATCTTTACC GATATGTTGA TGTCTCTGGC AGAACAGAGC AGTTTGGAAA 6300 
ATTCAGTGTA ATTAATTACG ATTTAAATCA GGTCATAACT ACTACAGTGA TGAAACACAC 6360 
CAAAATCTTC AGTGCCAATG GACAAGTGAT TGAAGTCCAA XATGAAATCC TAAAGGCAAT 6420 
70 TGGCTACTGG ATQACCATTC AATATGATAA TGTGGGCCGA CATGGTAATA TGTGCATAAG 6480 
GGTAGGAGTA GATGCCAATA TAACAAGGTA CTTCTATaAA TACGATGCTG ATGGGCAACT 6540 
TCAGACTGTT TCTGTAAATG ACAAAACCCA GTOGGQTTAT AGTTACGATC TGAATGGAGA 6600 
CATCAACCTC TTAAGCCATG GGAAQA0TGC TCGTCTTACT CCTCTCCGAT ATGACCTCCG 6660 
AGACGGCATC ACCAGATTAG GAGAAATTCA GTATAAAATG GATGAAGATG GCTTTCTGAG 6720 
75 GCAGAGGGGA AATGATATTT TTGAATATAA TTCTAATGGC CTGCTGCAGA AAGCCTACAA 6780 
TAAGGCTTCT GGCTGGACTG TGCAGTATTA CTATGATGGG CTTGGGGGAC GTGTCGCGAG 6B40 
TAAGTCCAGC CTAGGGCAGC AOCTTCAGTT CTTTGTOGAC GCGACOGCGA ACCCCATAAG 6900 
AGTTACTCAT TTGTACAACC ACACAAGCTC GGAGATTACA TCTCTGTATT ATGATCTOCA 6960 
. _ AGGTCACCTT ATTGCCATGG AGTTAAGCAG TOGTGAAGAA TATTATGTAG CCTGTGATAA 7020 

80 TACAGGTACC CCACTAGCTG TGTTCAGCAG CCGAGGTCAG GTCATAAAGG AGATACEATA 7080 
CACACCTTAT GGCGATATCT ATCATGACAC TTACCCTGAC TTTCAGGTCA TAATTGGTTT 7140 
TCATGGAGGA CTCTATGATT TCCTTACTAA ATTAGTGCAC CTGGGGCAAA GGGATTATGA 7200 
TGTTGTTGCT GGCAQATGGA CAACGGCCTA TCATCACATA TGGAAACAGT TGAACCTCCT 7260 
TCCTAAACCA TTCAACCTCT ACTCCTTTGA AAATAACTAC CCAGTTGGCA AAATTCAAGA 7320 
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TGTTGCAAAG TATACCACAG ACATCAGAAG TTGGTTGGAG CTATTTGGTT TCCAATTACA 7380 
CAATGTACTA OCTGGATTTC CCAAACCTGA ATTAGAAAAT TTAGAATTAA CTTACGAGCT 74.40 
TCTACGGCTT CAGACAAAAA CTCAAGAGTG GGATCCTGGA AAGACTATCC TGGGCATTCA 7SD0 
GTGTGAACTC CAGAAACAGC TCAGGAATTT CATTTCCTTG GACCAACTAC CTATGACTCC 7560 
CCGATACAAT GATGGACGGT GCCTTOAAGG AGGGAAGCAA CCAAGGTTTG CTGCTGTCCC 7620 
TTCTGTTTTT GGGAAAGGTA TAAAATTTGC CATCAAGGAT GGCATAGTAA CAGCTOATAT 76 80 
TATAGGAQTA GCCAATQAAG ATAGCAGG GG GCTTGCTGCC ATTCTCAATA ATGCCCATTA 7740 
CCTGGAAAAC CTACATTTTA CCATAGAGGG GAGGGACACT CACTACTTCA TTAAGCTTGG 7800 
GtCTCTGOAG GAAGACCTGG TGCTCATCGG TAACACTGGG GGGAGGCGGA TTCTGGAGAA 7B60 
TGGTGTCAAT GTCACTGTGT CCCAGATGAC TrCTCTGTTG AATGGGAGGA CTAGACGGTT 7920 
TGCAGATATT CAGCTCCAGC ATGGAGCCCT GTGCTTCAAC ATCCGGTATG GGACAACTGT 7980 
CGAAGAGGAA AAGAATCACG TGTTGGAOAT TGGCAGACAG OGCGCAGTGG CCCAGGCCTG 8040 
DAGTAAGGAA CAAAGAAGGC TGCAAGAGGG GGAAGAGGGG ATTAGGGCAT GGACAGAAGG 8100 
GGAAAAGCAG CAGCTTTTGA GCACTGGGCG GGTACAAGGT TACGATGGGT ATTTTGTTTT 8160 
GTCTGTTGtAG CAGTATTTAG AACTTTCTGA CAGTGCCAAT AATATTCACT TTATGAGACA 822 D 
GAGCGAAATA GGCAGGAG GT AAC AAAAATA TCTCTGCCTT TGCGTCACCA AAGACTGCCT 8280 
GTTTTTAAAA CATAAAATGG TTTATTGTAT TGGTTTTCTA GATCAGAACT CTGTATATGT 83 40 
AAATATGGAG GAAAAACATA TCCAACTGCC TTTCAATGTG ACGGAAGATG GTATTTTAAT 8400 
ATTGTTTGTT TAAACTCTTT AAGAAATGAC AGAGATTTTT AGTTCTTGTG TGGCAGTATT 8460 
CAAAATAACA CAAGTAGAAC TCAAACAGCT AAA&ACAGTT TTCAGAAAGC ACCACTTTCA 8520 
ATTTOCCGAG CCATGCATAT GTTCGAATAT CCAGAAAGAA CCCAAGGTTC TCTATCTCTA BS80 
TTGTGAGAAG CAQTTTCATC CTTAACTGTT GGCAGAACTT ACGGGCTATT TGAATAGGTG 8C40 
GTGCAATAGT ATCTGAAACT TGCCTTTCGA AAGACXGCCA GCCCTTTGAC GTTTTOCAGA 8700 
TCTGT TAT AG GAAACTTAAA AACAQGTGTA AAATGTCTTC AGCCACCATC TCCTAGAGTG 8760 
AGGACCCAAT TGCCCTTCCI T CTTGATT AT TCCTCCTTGC TTQTTAAAGT AAATGCCATA B820 
TTGTTGTGCT GTGTTTTGQC GTGTGGTGGC TGGGTTCTGT CTACCATOCT TCCCTGTGGG BBBO 
TGTGGTAACC AGACTGTATA GCCGCTATTT GCTCGTGTGT ACATGATACC ARAGCAGCTG 8940 
GCCAGCGTGA CCTCTCTCAC AGGACCTGTT TTGACTCAAT TTTTTACTAA AAGTTGTTCA 9000 
GCTGTATTGG TATGATGTAA ACATAGCTTT TATTAACCTG GGTAGGAATT TCTCATTTAT 9060 
ATATAGGATQ TGTTTTGGTC ATAGTTTCAC ATTAOTGATT CAGTATCTAT ACACTGACCC 9120 
AATGGTTTTG TGCACATGAA CGGTAATTTA CTTAAAAGTA TGATTCTGGT ACAAAAACAA 9180 
ACAAAGGCTT TAGCAGGCAT ACGTGTCTGG GATGCCGATA CATACATTAA CTACTACTGC 9240 
AGAAATTCAT AAGAGOCAAA ACCTTAAAAA AATAGACCTQ GTACTTAAGT GAAAGTACTA 9300 
AAGGGAAGAC GAGACCAAAC ATCACAGCAG TTGCTGGCAC ATTGTTTCAG OCCACTTAGA 93 60 
TTTATCTTTC AAATGTACAA TTCTGTATTG AACATCTCCG AGCCATCTTC AGGAAATCGA 9420 
ATCAAGTAAA TCCTXTCCAA CCGAAAACAT TTCAACTAAC TATAGAGAGG CAGACTCATT 9480 
TTTACTAAAA TAATTTATAC AGTTAGTTAT TTTOGTTCTC CGTACTTACC CATTTA1CTT 9540 
TATTTAATCG TCTCTACTQC CTAGGAAAAT AACTATTTTC CAGGACGGGT TATTTGTTCT 9600 
GCGA1CATTT AAAATTTGGA GAAAGGTCAG GAT£AGTGTT AATATCAGCT GCAGTTTCTC 9660 
AATCTCTAGG AATCCTGCAG TAAAACAAGC CCCTTGGTGA GCTGGAAGAT TTGTGCCCAG 9720 
TGACAAAGAG ATAGTTTOTA AAATGCTGTG TAATTGTAAG TTACCACAAA TGAAAATACA 9780 
TGACAGCACA ATGTGGCCCG TAGAAAATTC CCCTGAOCCA GCTTCTGCAC TTTCATCACC 9840 
GAATCTGAAC ATTTGCTATG TCTGAAGGCA AATTTATGAT GGAATGTOCAG TTTGGATTCT 9900 
TTCCAGATGC TACCTAAATG CAGTGTGGGG TCATTGGCTT GCTTTGOGAT GACAGTTTCT 9960 
TTGAAAATAT GCAAAGTCAT AAGCTCATGT TAAGGTTTTT CAAGAGTCTG CCTCCTACTA 10020 
CACAAAOGAA AGCAAGGGAA AGGAAATGAC CCTGGCAAAC AGTAGGGAAG GGTGTATTCA 100 80 
AACATTTCAT TTTCAAAACC TTGGGGTTAQ AATACCACTT ACACATGTAT TCTGAGAGAC 10140 
AGAATTCATG AGGAACTCAT CTCTCTTTAT AACTGGAAAC ACACCAGCTT GATATATTGC 10200 
TAATCCATAC TAAAATCATA TTATTGGGTT TTTTCTGAAT CAGGCCTGTA TTAATGGTAC 10260 
A&T AT T TATT GAGAATGGAA TTCTAAAATT ACTAACAAAC TTGTTGAAAA TTTGAATACC 10320 
TCCACACCAA CCTAAAAATG GACCTTAAGT TCCTAGAACC TCTGATGTTC TTTTAAATTA 10380 
ATGGAAAAAT AATTTGTGAA CTQTATATAG AGAGTGCATT CATAAATGTG ATTATGTATT 10440 
TTATCACAAA TCCAAAATGT CAATATTAGA GTCTATTTTG CTTATATTTT AAGCAATTAT 10500 
ACGTTTTTGC AATTCATTGA TGATGTATCA TTTTCAAACT GCTTTAAATA TCCATTAGAA 10560 
ACAAATATTT GAAGCTTTTA CTTAATAGTG ATTACCTTGA ACTGTGCATT TCTAGTTTGT 10620 
AATACGTATT TGGTTGGTTC GTGCCTTTAG TTTQTTAAAG TTACATTTGT ATTATATTCA 10680 
GGAAATQCAC TTTTTATTAC TTACAGCTGT GGTTTTAATA CTGCCTTGAA CTATTATTAT 10740 
TCTTTT TACA ACTCCTAAAG CTTGAGGGAG GAAAGAAAAA AAAAACAAAA CTACTAATCA 10600 
GTAOTAAATC GAAGAGAAAC ATTTTGGCAT TTCTTAAGAA GAAGATGGAG ATATIGAGTA 10860 
TATCACTTCC TATTCAGCTG AA7AGAAAGA ATGCCTTCAT TGACTTQCAG TTCTGCAGTT 10920 
TAAATTATTG AAAGAACAAT TCOTTTGCAT TTCCTGATGA AAGTAAAAGC ATTTTTCAGA 10980 
GAAACATATG AATTTCTCAT ACCCAGCAGA CAGATGGCTG ACACTGCACA GCCACACAOC 11040 
ATTCGAGTAA GTTAAAGTGA GAGCATAGTA GTTGGACTCT CCTATGAAGA ACATICTGGG 11100 
CTGGAGGCAG GGAATAGTCC ATGGTTQTTT CTTTTTCCTA CrTAAGCCCA TTTTGTTTGT 11160 
GCTTTTCTGT TTTGTTTTGT TTTCACTCTT GCACTACAGT CTAGAGATCC AAATGAACTG 11220 
AAAAGTTCAA AGTTTAACAG ATTTAAATAT GTTTACTTTT AGTTGTCATT CTAATCGTTA 11280 
TTGATTAfiAA GCATGACTCC TGAAGGAAAG GGAAATAAAT CTCAATTCAT ACTAACTTGC 11340 
AACAAAACAC TTTTACCATA TAAATAAGTA TATGATTTAT TTTTAACCCA AAAAATGTAT 11400 
AAAATAAGTG TGTCCTTTAC TGTCAATTTA TCGAGAAGAT CTATAATATA TAGACTACAT 11460 
ATATATAATA TATACAACAT AGCCAAATGT ATGAAAACTT GACAATGTAT AATTTGGAAT 11520 
TCACATGCTA CCTATGTAGA CAGGTATGAA ATTAAGTTAT AATTTTCATG AGACATTTTC 11580 
ATCACTGTTG ACACAOTTTC AAGGCATTCC ATCATGTTAT TTTGACTCTT TTTCTTTTTT 11640 
TTTTCTTTAA AAATATATTT TTAACTAGAC CAGGCC3CCAC TATAATATCA CTTAAGAGAG 117 00 
TCAGGGCAAA GTTXTTGCAT TTATGAAGAT GTGTTCATGT AAGGGTGATT GTAATGGAGT 11760 
TCATTGGTAA TAGAAGCAAA AGTACAGTAA CGAAGTATTG AAAAGAAAAT TTTGGAGACA 11820 
TTGGAGCATA TTATATATAQ CTTGTGGAAA GACATAAGGC TACAGATGGA ATGGAACATT 11880 
CCTGTTTTCT TGAAGAAATT CACATACACA TAGCTGACCT GACTAGTACT TCAGCTCTTC 11940 
CACAGCCTTC TATAAAGGTT CXlTCTTCTG CAAAGAAAAC AAAACAAAAC AAAACAAAAC 12000 
AAAAAAAAAC AAAAAAAGCG CAAAAAACAA AAAAACAAAA AAAAGCAAAQ TAAAATTTAA 12060 
AAATACAGAA AACAAACAAC AAAAAAGAAT TCAACCATAA ATAfiTGACTA TTATTTTCAG 12120 
TGXGXCCTTC ATGTGAAAGC TATTAAGGAC CAAATATACT ACTGTTXZATA AGAAGAAATT 12180 
ACTTTCTAAA CAGTAACTGA AAATACTTAG AGTTAAACTT GCTGTGGATT TTGTCTTGGC 12240 
AOTTGTCATC TTACATTATT TGTCAAAGGA AATGTGTTTG GCAGTTAAAA ATCTTTCCTT 12300 
AGATTTAGTG GTGGACTTTA ACC TCT T AAA TAAATGTTAG TATATCAGAT TGTGTCCTTG 12360 
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TCACCAGATT TGGTTCTTAA ACTTT 



10 



9eq ID NO: 272 Protein sequ&nce 
Protein Accession #= NP_0064G5.1 

1 11 21 31 41 51 

I I 1 1 I I 

MWKVfiALIiFV LGSASLWVLA BGASTGQPED DTETTGIJSGG VAMPGAKDDV VTPGTSKDRY 

KSGLTTUVAT SVKSVTQIRI EDIjPTSESTV HAQBQSPSAT ASNVATSH9T EKVDGOTQTT 

VEKDGLSTVT LVGI IVGVLI* AIGFIGGIIV WMRKMSGRY SP 



€0 
120 



Seq ID NO; 273 DMA sequence 

Nucleic Acid Accession tt= CAT cluster 



15 

20 

25 
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60 
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80 



i 
I 

GCGGCOGCCA 
CCCGQGGGTG 
G0G66CACTG 

TCCCGGGC7VG 
QAQAAGGGCS 
ACGGAGGCGO 
GTACTTGCGGG 
GGAAACCTAC 
CCCTTGACTG 
GCCGCTGGGA 
CTTCTTTCCA 



11 

I 

GCTTGCAAAG 
CCGAGCCGGC 
GGGGCGCGQG 
AGGTCCCGCT 
GCGGCX3QCC3Q 
CGGAGGOOGT 
CA3TCTTGCT 
GACCCCCGGA 
TGTACCCCGA 
CCCTCCGCGG 
ATTTCTCCOG 
GACTGCGCTT 



21 
I 

COGAAGTCTG 



AGGGGACGTG 
GOGGGCACGC 
GTGTGGCCCG 
GGAGTTCTTT 



GCTGGAGCCC 
GCCCTGGAGC 
CGGGTTGACC 
GCGGGGAGGA 
TGC 



31 
I 

googcgctct 

GTGGCTGCGC 
GCAQGCCCCG 
AATCTGCCTC 
TCGGGGCCGG 
GAGCTGCTGG 
CCTCTCBACG 
GGCCTCTTTG 
GTCCCGGGCT 
TTGAACGAGC 
CGGGCCGGGC 



41 
I 

TCGACTCGCT 
CGGCGGCCGG 
CGGGGGCCAC 
CGTOCTTCTT 
ACGTGAGCTT 
GGCCCGACTA 
TGTTCCCOGC 
AGCCQCCGCC 
GCTOCCOGAC 
CCTTGAGCCC 
CATTTGGCTT 



51 
I 

GCGCCACGTC 
GCTAGGAGGT 
GGCQATCCCA 
CAOGGAGOCG 
GGGCG ACCTQ 
CGGCGCCGGC 
CGGAflCCTCC 
GGCAGTGGTG 
CAAAAAGAGG 
OCTGTACCCC 
CTTTCGCCCC 



Seq ID HQs 274 DMA sequence 
Nucleic Acid Accession s 



jh sequence 



l 
I 

CAAAGAGGCC 
CGTCGAGAGG 
CCAGCRGCTC 
CCGGCCCCGA 
□AGGCAGATT 
CGGGGCCTGC 
GCGCAGCCAC 
AGAGCGCGGC 



11 
I 

GGGCTCCAGC 
CTCGGGGGCA 
AAAGAACTCC 
CGGGCCACAC 
GCGTGOCOGC 
CACGTCCCCT 
CTCACCCCCC 
CAGACTTCGG 



21 
I 

TCCGGGGGTC 
AGCAAGACTG 
AOGGCCTC3CG 
CCGCOGCCGC 
AGCGGGACCT 
CCCGCGCCCC 
GCCGGCTCGG 
CTTTGCAAGC 



31 
I 

COCGCAGTAC 
CCGCCTCCQT 
CGCCCTTCTC 



CAGTGCCCGC 
CACCCCCGGG 
TQGCGGCCGC 



41 51 

I 1 

GGAGGCTCCG GCGGGGAACA 
GCCGGOGCCG TAGTCGGGCC 
CAGGTOGCCC AAGCTCACGT 
OGGCTCCGTG AAGAAOGACG 
TGGGATCGCC GTGG0CCCO3 
ACCTGCTAGC CCGGCCGCCG 
GACGTGGCGC AGCGAGTGGA 



41 



51 

I 



GTGGGAGGCC AGTGGTGCTG 
GCTGCTCACT GCGGGGCCTG 
TGCAAGTCOG CGGCCCAGAG 
CAGTGGTGCT GGCCAAGAAG 
CTCCTGCTGC CTATGGCCCC 
ATGTGCCTGG AGAAGATCCA 
GGCTGTCCTQ GGA.TGTGGGA 
GTCCTGGTCA GCTGCCCTGA 
GAAACCATTG GAQAGTCTGA 
GTGGTGAGCC GGAACTGCAC 
GCCT OT GGGT TTGATGAATA 
GTGAAGGCCC TCT ACACGGT 
GTCATCCTTT GTCGCTTCCG 
TTTGTGTCGT TCATOCTGAG 
QAGCAGGACA GCAACCACTG 
TTCCACI7ACT GTGTTGTGTC 
ACTCTGCTGG TGGAGACCTT 
GGCTGGGGGA CCCCAACTGT 
GACACAGGCT GCTGGGATAT 
GTGGTTGGCT CTATCATGGT 
CAGAAACTTC AGTCTCCAGA 
CGGTCCACCC TGCTGCTCAT 
CCAGABAATG TCAGCAAAAG 
GGCTTTGTGG TGGCTGTTCT 
CGAAAATGGC GAAGCTGGAA 
CCGTCTCTGG CCAGCAGTGG 
AGCTCOCAAA TCCGCATGTC 
CCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 



Seq ZD NOi 275 DMA sequence 
nucleic Acid Accession ft: NMjOOlllB.l 
Coding sequence: 74- -1651 

1 11 21 31 

11)1 
AGCCCAGAGA CACATTGGGG CTOACCTGOC GCTGCTGTCA 
GCCAAGAAGT GTCATGGCTG GTGTOGTGCA CGTTTCCCTG 
TCCGTGGGGC CGGGGCAGAC TCCGCAAAGG ACGCGCAGCC 
ACACATTGGG GCTGACCTGC CGCTGCTGTC AGTGGGAGGC 
TGTCATGGCT GGTGTCGTGC ACGTTTCCCT GGCTaCTCTC 
TGCCATGCAT TCTGACTGCA TCTTCAAGAA GGAGCAAGCC 
GAGGGCCAAT 0AGCTGATGG GCTTCAATGA TTCCTCTCCA 
CAACATCACG TGTTGGAAGC CCGCCCATGT GGGTGAGATG 
GCTCTTCCGA ATCTTCAACC CAGACCAAGT CTGGGAGACC 
TTTTGGTGAC AGTAACTCCT TAGATCTCTG AGACATGGQA 
GGAOGATOGC TGGTCGGAAC CCTTCCCTCA TTACTTTGAT 
TGAATCTGAG ACTGGGGACC AGGATTATTA CTACCTGTCA 

TGGCTAJCAGC acatccctcg tcaccctcac cacxgccatg 

GAAGCTGCAC TGCACACGCA ACTTCATCCA CATGAACCTG 
GGCGATCTOC GTCTTCATCA AAGACTGGAT TCTGTATOCa 
CTTCATCTCC ACTGTGGAAT GTAAGGCCGT CATGGVTTTC 
CAACTACTTC TGGCTGTTCA TCOAGGGOCT GTACCTCTTC 
CTTCCCTGAA AGGAGATACT TCTACTGGTA CACCATCATT 
GTGXGTGACA GTGTGGGCTA CGCTGAGACT CT ACTTT GAT 
GAATGACAGC ACAGCTCTQT GGTGGGTGAT CAAAGGCCCT 
TAACTTTGTG CTTTTTATTG GCATTATCGT CATCCTTGTG 
CATGGGAGGC AATGAGTCCA GCATCTACTT GCGACTGGCC 
CCCACTATTC GGAATCCACT ACACAGTATT TGCCTTCTCC 
GGAAAGACTC GTGTTTGAGC TGGGGCTGGG CTCCTTCCAG 
CTACH3TTTT CTGAATGGTG AGGTACAAGC GGAGATCAAG 
GGTGAACCGT TACTTCGCTG TGGACTTCAA GCACCGACAC 
GGTGAATGGG GGCACCCAGC TCTCCATCCT GAGCAAGAGC 
TGGCCTCCCT GCTGACAATC TGGCCACCTG AGCCATGCTC 

Seq ID NOx 276 Protein sequence 
Protejja. Accession #z NP_001109.1 

1 11 21 31 41 - 51 

I I 1 I i I 

KAGWHVSLA AHCGACPHGR GRLRKGRAAC KSAAQHHIGA DCLPLLSVGGQ WCWPRSVMAG 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84D 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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15 
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80 



TGCGGGAGAG CCTGGCCAAG TGCTGCAGTT GTTCAAGAAA GAG AGCCTX T GGTGTGCTAA 420 

AGACTCTTGT GCCCATCTTG GAGTGGCTCC CCAAATACCG AGTCAAGGAA TGGCTGCTTA 4BO 

GTGACGTCAT TTCGGGAGTT AGTACTGGGC TAGTGGCCAC GCTGCAAGGG ATGGCATATG 540 

CCCTACTAGC TGCAGTTCCT GTCGGATATG GTCTCTACTC XGCTTTTTTC CCTATCCTGA 600 

CATACTT TAT CTTTGGAACA TCAAGACATA TCTCAGTTGG ACCTTTICCA GTGGTGAC5TT 660 

TAATGGTGGG ATCTGT TGTT CTGAGCATGG CCCCCGACGA ACACTTTCTC GTATCCAGCA 720 

GCAATGGAAC TGTATTAAAT ACTACTATGA TAGACACTGC AGCTAGAGAT ACAGCTAGAG 780 

TCCTGATTGC CAGTGCCCTG ACTCTGCTGG 1 TGGAATT AT ACAOTTQATA TTTGGTGGCT 840 

TGCAGATTGG ATTCATAQTG AGGTACTTGG CAGATCCTTT GGTTGGTGGC TTCACAACAG 900 

CTGCTGCCTT CCAAGTGCTG GTCTCACAGC TAAAGATTGT CCTCAATGTT TCAACCAAAA 960 

ACTACAATGG AGTTCTCTCT ATTATCTATA CGCTGGTTGA GATTTTTCAA AATATTGGTG 1020 

ATACCAATCT TGCTGATTTC ACTGCTGGAT TGCTCACCAT TGTCGTCTGT ATGGCAGTTA 108 0 

AGGAATTAAA TGATCGGTTT AGACACAAAA TCCCAGTCCC TATTGCTATA GAAGTAATTG 114 0 

TGACGATAAT TGCTACTGCC ATTTCATATG GAGCCAACCT GGAAAAAAAT TACAATGCTG 1200 

GCATTGTTAA ATCCATCCCA AGGGGGTTTT TGCCTCCTGA ACTTCCACCT GTGAGCTTGT 1260 

TCTCGGAGAT GCTGGCTGCA TCAT TTTCCA TCGCTGTGGT GGCTTATGCT ATTGCAGTGT 1320 

CAGTAGGAAA AGTATATGCC ACCAAQTATG ATTACACCAT CQATGGGAAC CAGGAATTCA 1380 

TTGCCTTTGG GATCAGCAAC ATCTTCTCAG GATTCTTCTC TTGTTTTGTG GGCACCACTG 1440 

CTCTTTCCOG CACGGCCGTC CAGGAGAGCA CTGGAGGAAA GACACAGGTT GCTGGCATCA 1500 

TCTCTGCTGC GATTGTGATG ATCGCOATTC TTGCCCTGGG GAAGCTTCTG GAACCCTTGC 1S60 

AGAAGTCGGr CTTGGCAGCT GTTGTAATTG CCAACCTGAA AGGGATGTTT ATGCAGCTGT 1620 

GTGACATTCC TCGTCTGTGG AGACAGAATA AGATTGATGC TGTTATCTGG GTGTTTACGT 16B0 

GTATAGTGTC CAT CAT T CTG GGGCTGGATC TCGGTTTACT AGCTGGCCTT ATATTTGGAC 1740 

TGTTGACTGT GGTCCTGAGA GTTCAGTTTC CTTCTTGGAA TGGCCTTGGA AGCATCCCTA 1800 

GCACAGATAT CTACAAAAQT ACCAAGAATT ACAAAAACAT TGAAGAACCT CAAGGAGTGA I860 

AGATTCTTAG ATTTTCCAGT CCTATTTTCT ATGGCAATGT CGATGGTTTT AAAAAATGTA 1920 

TCAAGTCCAC AGTTGGATTT GATGCCATTA GAGTATATAA TAAGAGGCTG AAAGCGCTGA 1980 

GGAAAATACA GAAACTAATA AAAAGTGGAC AATTAAGAGC AACAAAGAAT QG CAT CATAA 2040 

GTGATGCTGT TTCAACAAAT AATGCTTTTG AGCCTGATGA GGATATTGAA GATCTGGAGG 2100 

AACTTGATAT CCCAACCAAG GAAATAGAGA TTCAAGTGGA TTGGAACTCT GAQCTTCCAG 2160 

TCAAAGTGAA CGTTCCCAAA 3TGCCAATCC ATAGCCTTGT GCTTGACTGT GGAGCTATAT 2220 

CTTTCCTGGA CGTTGTTGGA GTGAGATCAC TGCGGGTGAT TGTCAAAQAA TTCCAAAGAA 2280 

TTGATGTGAA TGTGTATTTT GCATCACTTC AAGATTATGT GATAGAAAAG " CTGGAGCAAT 2340 

GOGGGTTCTT TGACGACAAC ATTAGAAAGG ACACATTCTT TTTGACGGTC CATGATGCTA 2400 

TACTCTATCT ACAGAACCAA GTGAAATCTC AAGAGGGTCA AGGTTCCATT TTAGAAACGA 2460 

TCACTCTCAT TCAGQATTGT AAAGATACCC TTGAATTAAT AGAAACAGAG CTGACGGAAG 2520 

AAGAACTTGA TGTCCAGGAT GAGGCTATGC GTACACTTGC ATCCTGAAAG TGGGTTCGGG 2580 

AGGTCTCTAT GAGCAAGGAA TACAAGACAA AACTTCCTCA ATGCATTGAC TATTTCTTCA 2640 

GACTCAAAAC AC TCATTCTT TTTTCIATTA AGCCATTGAA AGAGAAGCAC TAAGACTGCT 2700 

TCTAGGCTTT ATTTATAAAA TAAACACCTT ATCCCTAACA TGGGCAAAAT GGCTAGAATT 2760 

ATTCAGACGA TTTGGCAGCG TCCAGGGTAA GCTGGTGTTA TAATACGCTG CTGATCTACA 2820 

TCACAGATTT GCTAATAATG TTCACGTGGG CCCTGGCATA TCTCTGTTCA GTTAGAGTGA 2880 

GTGCTGACCC AACAGGCTCT GTOGTCAAGC GAGTCACGAA TGATTAATCA TAAAGAAAAA 2940 

TCAGTTTTTG ACTGACCTGG ATATCCATGA GCTGCACTGA TCACCATGTA AGGTCACATT 3000 

TAGTAAATGC TGAAATAAAA TGATTAATGC ATTTATCAAT AAAAGCCTTT GAAAATACTT 3060 

TGGATAATAA ATTGGAGTTT TAAAAATGCA AATTTGCTTA GTATCTAATA ATGAAGTGTT 3120 

ATTACATATA GCCGGAATTG AGGATCXCTT TGATCCTGGA AATGGTTTAC CTAAAAGCTA 3180 

CAQAACCAGG CCAATATATT TTGAAATATT GATGCAGACA AATGAAATAA TAAAGAGATT 3240 

TTCATGGTTT ATAAAAATCT TTTTTGATAT GATAATAATC ATGATCACAA CTGAGATCAA 3300 

AAAAATATAT GACAGATTAT TTTGTTTAAA AATGCAGTTT TAATTATCTT AGTCTATAGA 3360 

AATGATCATT GCATGGAGGC ATGTATAGGT ATGATCTGTG TAAAATCTGA CATAAAAACA 3420 

GTGCTATTCT GAGTGAAAAT TTTTTTGATG TGCTTACATA ACCATGGTGA TTAAAATGAG 3480 

TTTATATTTT TTCTCAAAAA TTTTAGCAGT GTGTAAAGTA AGTAATCTTT AACTGAACTC 3540 

TGAOCACTTA AAAAAAAATC TAAAAATTGA ACTACCTATA GTAGTCTGTG TTTAAAGTGA 3600 

ATTTTTAAAG ACAAAGCATT CIAAATGAAC TCAATATAAA AACATTCATT TGGAATGTAC 3660 

ATACTGAAAA ATACAGGTTT TTTTGACCAA AAGTTTTTAT ATCTTTTCTT TTTATTTATT 3720 

TTTTTCCTAA GTGCCAACAA TTTTCTAGAT ATTATATACA ACACAGGCTT TGATCTTGGG 3780 

GACTTTTCCC ATATATTTCA CACTGGAGTG AATGAAGTTG TACTTCATTT CTAGAGAAAA 3840 

GTTATACCCA GGTCCCCAAT TGAGAATGTC TTGCTTGATT GAAAACGACA TCATCCCTTG 3 900 

GTATACTCCA GGGATTGGTT TCAGGAGCCC TGCATTTACC AAAATTTGTG CACACTCAAG 3 960 

TCCTGCAGTC ACCCCTGCCT AAAGATAGAA TGGCTTCTCT GTTTTTCTTC TGAAATACAA 402 0 

CCAGAAACAA TGTGICTATT TCTGAAAGAA TAGGATTAAT GATCATACAA ATGGGTTAAT 4080 

OCTGAATTCT GGTTGTAAAT CTGGTTACAG CATAACTAGG ATTATAATGC TGCCTCATTT 4140 

TCACAGCACT ACTTGCTTAT ATTGACAACA AATCATCTCG CTAAAGAGTG AATGTAGGCC 4200 

AGGOGCGGTG GCTCATGCCT GTAATCCCAG CACTTTGGGA GGCCGAGGCG GGTGGATCAC 4260 

GAGGTCAGGA GATCGAGACC ATCCTGGCTA ACATGGTAAA ACGOCGTCTC TACTAAAAAT 4320 

AGAAAAAAAG AAATTAGCCT AGCGTGGTGG CTGGCGGGCG CCTGTAOTCC CAGCTATTTG 43 B0 

GGAGGCTAAG GCAGGAGAAT GGCGTGAACC CGGGAGGCGG AGCTTGCAGT GAGCCGAGGT 4440 

CGTGCCACTG CACTCCAGCC TGGGCQACAG AGCAAGACTC CGTCTCAAAA AAAAAAAAAA 4500 

AAAAAAAAAA AGAGTGAATG TAATAGTCTT GCAGAAAATG AATGAATACC TTTGTTCAAT 4560 

AAAGGAAATA TGCACTGCTC ACTTTTTTGA AGGAAATGCC AAAOTTACGT TTEACAACAA 4620 

GGCTAGAGTT TGTAAATTCT GGGTTCATTT GTGATGACAT AAGTCAGCAA ACTGCGGGAA 4680 

TACTGTCTCT TCTATGTATT TTGTGAATAG TAAGCATAAT TTTAGTTTTG TATTATCAAT 4740 

GAAAATTTCA CTTGAAATTA AAGCTQCCTT TTGTTATATT TTTAACCTAT AGGATAAGAT 4B00 

TCCAGTATTG TATATGAGTT TTAACAAATT AAAAAATCAA ATCATGTACA TTTGAAAATA 4B60 

TTTGCACACA TTTAAAAATA AATOTAAAGT TGTCTTTTAA ACTACTCGGA TGTGTCCTTT 492 0 
CXGAACAAAA 



A212 Protein gequeoce: 
Gene names 
Unigene number-; 
Probe ^et Accesaion 
Protein Accession # : 
Signal sequence : 



Solute carrier family 26, member 

Ha .159275 

AF0 30880 

043511 

none found 
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Transmembrane domains t 
Cellular Localization: 



81-103, 105-131, 136-158, 165-207, 221-243, 245-261, 270-291, 295-317, 
347-369, 386-408, 420-442, 44B-470, 4HS-508 
plasma membrane 



1 
I 

MAAPGGRfiEP 
AFGVIiKTXATP 
FFPILTYFIF 
RDTARVLIAS 
NVSTKNYNGV 
P1EVTVTIIA 
YAIAVSVGKV 
QVAQ1ISAAI 
IWVPTCrVSl 
EPQGfVKT liRF 
KNGTISDAVS 
DCGAISFLDV 
TVHD A I LYIiQ 
QDEAMRTLAS 



11 

I 

FQLPEYSCSY 
ILEWLPKYTCV 
GTSRHISVGP 
ALTbLVGIIQ 
I»SIIYTIiVBI 
TAISYGANLB 
YATK5fD YT ID 
VMIAILALGK 
IDGIiDLQIiLA 
SSPIFYGNVD 
THNAFEPDED 
VGVRSLRVIV 
NQVKSQEGQG 



21 
} 

MVSRPVYSEL 
KEWliLSDVIS 
FPWSLMVGS 
UFGGLQXGF 
FQNIODTNiA 
BMYNAGIV3C8 
GNQEFIAFGI 
X.LEPI*QKSVL 
GLIFGI*I»XW 
GFKKC1KSTV 
IKDIjSTTTiDIP 
KEFQRIDVWV 
SlIaETITLIQ 



31 

t 

AFQQQHERRL 
GVSTGLVATL 
WLEMAPDEH 
IVRYIADPLV 
DFTAGLLTIV 
IPRGFLPPEL 
SNIFSGFFSC 
AAWJANLKQ 
LHVQFPSWNG 
OFDAIRVYWK 
TKEIEIQVDVf 
YFASLQDYVJ 
UOCDTIiKLIE 



41 
I 

QERKTLRESLi 
QGMAYALIAA 
FLV9SSNGTTV 
GGFTTAAAFQ 
VCMAVKELHD 
PPVSLFSEML 
FVATTAX.SKT 
MFMQL.CDIPR 
IjGSIPSTDXY 
RXiKAIjRKIQK 
NSSLPVKVHV 
EKLBQCGFFD 
TELTEEELDV 



51 
I 

AKCCSCSRXR 
VPVGYGLYSA 
LNTTHIDTAA 
VLVSQLKIVL 
RFRHXIFVFI 
MSFSIAWA 
AVQESTGGKT 
LWRQNX1DAV 
KSTKNYKNIE 
LIKSGQLftAT 
PKVPIHSLVL 
DHIRKDTFFL 
0DEAMRTLAS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A213 DNA SEQUENCE: 
Gene name; 
Uhigene number : 
Probes et Accession # i 

Nucleic Acid Accession 
Coding sequence ; 



estb, Highly similar to calcium- activated pot as alum channel rSK2 [H .norvegicus} 

HS.9B280 

AA418000 

NM_021614 

4S8-2197 (underlined se<iuencee correspond to start and stop codons) 



1 11 21 31 

1111 
CGGCGGCAGC AGOCCATGCC TCOGGTGCAA CAGCTGCGCC 
GGGGGCGGGA GATAACCTGT CCCTGCTGCT OCGCACCTCC 
GACOCGCACC TCCTCGCCGC TGTCGGGCTC GTCCTGCTGC 
CCGGGGCAGC CAGCTCAATG TGAGCGAGCT GACGGOGTCC 
GCAGCAGTAC GCGCAGCAGT OCGCGCAGCA GTCGGCGTCC 
CCACAGGCTG CAGCCCGGCG CCAGCCCCAC GGGCAGCCTC 
CCCGCTCTCG CACCACCACC ACCACCCGCA CCCGGCGCAC 
GGCBCQCCGC GAGAGCAACC CCTTCACCGA AATAGCCATG 
GGGCGTCATG CGGCCGCTCA GCAACTTGAG CGOGTCCCGC 
CTCABAGGOG CAGCCCCTGC AGCOCCCCGC GTCTGTCGGA 
CCCGTCTGG& GACGCTGCCG OCGOCGCCGC TGTTT0GTCC 
GTCTAAGCCC GAGCACAACA ACTCCAACAA CCTGGCGCTC 
CAGCACTGGA GGAGGCGGGG GCGGTGGAGG GAGCGGGCAC 
GTGCAGCAAA AAGAAAAACC AGAACATCSG CTACAAGCTG 
CGAAAAGOGC AAGCGGCTCA GCGACTACGC GCTCATCTTC 
CATGGTCATC GAGACCGAflC TGTCGTGGGG CGCCTA0GAC 
AGCTCTGAAA TGCCTTATCA GTCTCTCCAC GATCATCCTG 
CCACGCCAGG GAAATACAGT TGTTCATGGT GGACAATGGA 
CATGACTTAT GAGCGTATTT TCTTCATCTG CTTGGAAATA 
CATACCTGGG AATTTATACAT TCACATGGAC GGGCOGGCTr 
CACAAOCACC GCTGATGTGG ATATTATTTT ATCTATACCA 
GATTGCCAGA GTCATGCTTr TAJCATAGCAA ACTTTTCACT 
TGGAGCACTT AATAAGATAA ACTTCAATAC ACGTTTTGTT 
ATGCCCAGGA ACXGTACXCT TGGTTTTTAG TATCTCAITA 
TGTCCGAGCT TGTGAAAGGT ACCATGATCA ACAGGATGTT 
GATGTGGTTG ATATCAATAA CTTTTCTCTC CATTGSTTAT 
ATACTGTGGA AAAGGAOTCT GCTTAjCTTAC TGGAATTATG 
GGTGGTAGCT GTAGTGGCAA GGAAGCTAGA ACTTACCAAA 
TrrCATGATG GATACTCAGC TOACIAAAAG AGTAAAAAAT 
GGAAACATGG CTAATTTACA AAAATACAAA GCTAGTGAAA 
AAGAAAACAT CAACGAAAAT TCCTGCAAGC TATTCATCAA 
GCAGAGGAAA CTGAATGACC AAGCAAACAC TTTGGTGGAC 
CATGTATGAT ATGATTTCTG ACTTAAACGA AAGGAGTGAA 
TACCCTGGAA ACAAAACTAG AQACTTTGAT TGGTAGCATC 
AAGCCAGAOC ATCAGGCAGC AGCAGAGAGA TTTCATTGAG 
CAAGCACGTC ACTTACAATG CTGAGCGGTC CCGGTCCTCG 
TTCCACAGCA CCACCAACTT CA1CAGAGAG TAGCTAGAAG 
TAAGACTTTT TO CCAT CAT A TGGTCAATAT TTTAGCTTTT 
CTAATCAGCG TTATOCGGGT TCTGATGTCA GAATCCTGGG 
GGCCAAAATG AGTGAAAACT CTTTTTTTTT CTTTCAGATG 
TTGCTATATA GATTGTTCCT CCTGTAATTT CACTAACTIT 
AACTTTACTA CTACATTATA TGATATATAA TAAAAAAAGT 



41 51 
1 I 

TCCTCCGGTG CCCCGGCGGC 60 

TCGCOCGGCG GCGCCTTCCG 120 

TGCTGCTGCT GCTCGTCGCG 180 

AGCCATGCCA GTGCGCTCCG 240 

GCCTCCCAGT ACCACCAGTG 300 

GGCAGTCTGG GCTCCGCGCC 360 

CACCAGCACC AOCAGCCCCA 420 

AGCAGCTGCA GGTACAACGG 4.BD 

CGGAACCTCC ACGAGATGGA 540 

GGAGGTGGCG GCGCGTCCTC 600 

TCAGCCCCCG AQATCGTGOT 660 

TATGGAACCG GCGQCGGAGG 720 

GGCAGCAGCA GTGGCACCAA 7B0 

GGCCACCGGC GCGCCCTGTT 840 

GGCATGTTCG GCATCOTGGT 900 

AAGGCGTOGC TGTATTCCTT 960 

CTCGBTCTGA TCATOGTOTA 1020 

GCAGATGACT GGAGAATAGC 1080 

CTGGTGTGTG CTATTCATCC 1140 

GCCTTCTCCT ATGCCCCATC 1200 

ATGTTCTTAA GACTCTATCT 1260 

GATGOCTCCT CTAGAAGCAT 1320 

ATGAAGACTT TAATGACTAT 1380 

TGGATAATTG CCGCATGGAC 1440 

ACTAGCAACT TCCTTGGAGC 1500 

GGTGACATGG TACCTAACAC 1560 

GGTGCTGGTT GCACAGCOCT 1620 

GCAGAAAAAC ACGTGCACAA 1680 

GCAGCTGCCA ATGTACTCAG 1740 

AAGATAGATC ATGCAAAAGT 1800 

TTAAGAAGTG TAAAAATGGA 1860 

TTGGCAAAGA CCCAGAACAT 1920 

GACTTCGAGA AGAGGATTGT 1980 

CACGCCCTCC CTGGGCTCAT 2040 

GCTCAGATGG AGAGCTACGA 2100 

TCCAGGAGGC GGCGGTCCTC 2160 

AGAATAAGTT AACCACAAAA 2220 

ATTGTAAAGC CCCTATGGTT 2280 

AACCTGAACA CTAAGTTTTA 2340 

CACAGGGAAT GCACCTATTA 2400 

TTATTCATGC ACTTCAAACA 2460 

taatttcgga 



A214 Protein aeqaence; 
Gene name t 
Unigona number: 
Probeset Accession fli 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 



ESTs, Highly similar to calcium- activated potassium channel rSK2 [R . norvegicus 

HS. 98280 

AA418000 

KP_0€7627 

none found 

135-1S7, 168-190, 208-230, 254-276, 306-328, 342-364, 373-394 



933 



WO 03/042661 



PCT/US02/36810 



Calmodulin binding domain; 
Cellular Localization; 



412-48S 

plasma membrane 



3. 
I 

WSSCRYHGGV 
SSAPETWSK 
LGHKRALFEK 
LLGI^XIVYHA 
IAPSYAPSTT 
VMKTIiMTICP 
YGDMVPNTYC 
UAAANVLRET 
DLAKTQNIMY 
EAQMBSYDKH 



11 
1 

HRPLSHLSAS 
PEHUNSNNLA 
RKRL5DYAI*! 
REIQLFMVDH 
TADVDIILSI 
GTVLIiVFGIS 
QKGVCLLTGI 
WLIYKKTKLV 



21 
I 

RRNLHEHDSE 
LYGTGGGGST 
PGMFGIWWV 
GADDWRIAMT 
PMFLRLYI/IA 
LWI IAAWTVR 
MGAGCTALW 
KKIDHAKVRX 
EDFEXRIVTL 



31 
I 

AQPLQPPASV 
GGGGGGGGSG 
IETELSWGAY 
YERIFPICLE 
RVMLLHSKI,F 
ACERYHDQQD 
AWARlOiELT 
HQRKFIiQAIH 
ETKLETLZGS 



VTYWAERSRS SSRRRRSSST APFTSSE9S 



41 

! 

GGGGGASSPS 
HGSfiSGTKSS 
DKAJSLY8UVL 
ILVCAMPIP 
TDASSR8IGA 
VTSNFLGAMW 
KABKKVHNFM 
QliRSVKMEQR 
IHALPGLISQ 



51 
I 

ADAAAAAAVS 
KiOCNONrGYK 
KCLISLSTII 
GNYTFTWTAR 
LNKINPHTRP 
LISITPLSIG 
MDTQLTKRVK 
KLNDQANTLV 
TIRQQQRDFI 



GO 
120 
lflO 
240 
300 
360 
420 
480 
540 



A215 PKft SEOUEMCEi 
Gene name : 
Unigene nuinber ; 
Probes et Access ion #s 
Nucleic Acid Accession #z 
Coding 



CGI-B6 protein 

Hs. 109201 
AW161450 
NM_01602 9 

226-1097 {underlined sequences 



espond to start and stop codons) 



1 
\ 

CTGCT3ATCQC 
GGGCGTGCGC 
TQCTCCTGCT 
TATGGGCCGA 
TQACTGGAGC 
TTTCTCTTGT 
TAQAGAATGG 
CTGGTTCCCA 
TGGTCAACAA 
ACAGAAAGCT 
CTCACATGAT 
TATCTGTACC 
ATQGCCTTCG 
GACCTGTGCA 
GCAATAATGG 
TCAGCA1GGC 
CATATTTGTG 
AAAGGATTGA 
AGACAAAACA 
AAACATGAAA 
ACTTTTTAAT 
AGATTGCCAT 



11 

I 

GCAGGGCAGC 
GGCCGCAATG 
CTTGGTGCAG 
GXGGCAGGQA 
CTCGAGTOGA 
GCTGTCAGCC 
CAATTTAAAA 
TGAAGCGGCT 
TGGTGGAATG 
AATAGAGCTT 
CGAGAGGAAG 
TCTTTCCATT 
AACAGAACTT 
ATCAAATATT 
AGACCAGTCC 
CAATGATTTC3 
GCAATACATG 
GAACTTTAAG 
TGACTGAAAA 
ACAGCAATCr 
AGATATGACT 



21 
I 

GACGCGACTC 
AACTGGGAGC 
CTGCTGOGCT 

ATTGGTGAGG 
AGAAGAGTGC 
GAAAAAGATA 
ACCAAAGCIG 
TCCCAGCGTT 
AACTACXTAG 
CAAGGAAAGA 
GGATACTGTG 
GCCACATACC 



CACAAGATGA 
AAAGAAGTTT 
CCAACCTGGG 
AGTGGTGTGG 



TCTTATGCTT 
TTGCTTCCAA 



31 
I 

TGGTGCGGGC 
TGCTGCTGTG 
TCCTGAGGGC 
AATGGGAGCT 
AGCTGGCTTA 
ATGAGCTGGA 
TACTTGTTTT 
TTCTCCAGSA 
CTCTGTGCAT 
GGACGGTGTC 
TTGTTACTGT 
CTAGCAAGCA 
CAGGTATAAT 
CCCTAC5CTGG 
CAACCAGTCG 
GGATCTCAGA 
CCTGGTGGAT 
ATGCAGACTC 
ACTTTTCAAG 
CTGAATAATC 
CATGGAATGA 



41 
I 

C3CTGCTQGTG 
TGACGGCGAC 
GACTGATATG 
CCAGTTGTCT 
AAGGGTGAAA 
GCCCCTTGAC 
GTTTGGTAQA 
GGATACCAGC 
CTTGACAAAA 
GAATAGCATC 
TGCTCTCCGG 
AGTTTCTAAC 
AGAAGTCACA 



ACAACCTTTC 
AACCAACAAG 
TTCTTATTTT 
CCACTGGAGG 
AAAGACTAAT 
AATAAAAAAT 



51 
I 

CCCCCGAGCT 
CTGTGCGCGC 
CK5ACGCTAC 
GTGGTGTGGG 
AAACTAQGAG 
AGAAGATGCC 
CTGACCGACA 
ATOGACATTC 
TTGGATGTCT 
TGTGTTCTGC 
CTGGGTATCA 
GGTTTTTTTA 
ATTTGCCCAG 
AAGACTATAG 
CTGATGTTAA 
TTGTTAGTAA 
ATBGGGAAGA 
AAAATCTTTA 
GAGAAATGGA 
TTGTGATTTT 
AAATAATAAA 



SO 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
•780 
B40 
900 
960 
1020 

loao 

1140 
1200 
1260 



A216 Protein sequence: 
Gene name; 
Unigene number: 
, Probeset Accession #* 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
Cellular- localization: 



CGI -0G protein 
Hs. 1092 01 
AH161450 
NP_057113 
1-26 

1S3-206, 221-243 
plasma membrane 



1 11 21 31 41 51 

\ I I I 1 | 

MNWBXJjLWLL VLCALLTiTiT>V QLLRFliRADG DLTDDWAEWQ GRHPEWEIjTD HWWVTGASS 60 
GIGKKT.AYQli SKLGVSIiVT>S ARRVHELRRV KRRCLENGNL KEKDILVLPli DI/TDTGSHEA 120 
ATBXVLQEPG RIDIbVNNGG MSQRSLCMDT SliDVYRKLIE LNYIX3TVSI/T KCVfcPHMIER 180 
KQGKIVTVMS ILGII3VPL9 JGYCA3KHAL RGFFNGLRTE LAIYPGIIVS NICPGPVQ3N 240 
rVEHSLAGEV TKTIGKKGDQ SHKMTTCRCV B1MLISMAND LKEVWISEQP PT.LVTYLWQY 300 
MPTWAWWITN KMQKKRIENF KSGVDAD8SY PKIFKTKHD 



A217 DMA SEQUENCE: 
Gene names 
unigene number: 
Probeset Accession #s 
Mticleic Acid Accession #: 
Coding sequences 



Homo aapie 
Hs, 27373 
P13036 
ACO 12 47 8 
1-1896 



mRHAj cUNA PKPZp56401763 (from clone X»KFZp56401763) 



{underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 \ I I I I 

ATGCGCGCCG TGCCGCTGCC CGCCCCGCTC CTQCCGCTGC TGCTGCTCGC GCTCCTGGCC 60 

GCTCCCGCCG CCCGCGCCAG CAGAGCCGAG TCCGTCTCCG CGCCGTGQCC CGAACCCGAG 120 

CGCGAGTCac GGCCACCGCC CGGCCCGGGG CCCGGGAACA CCACCCGGTT TGGGTCTGGG 1B0 

GCGGCGGGCG GCAGCOGCAG CTCCAGCTCC AACAGCAGTG GCGACGCCTT GGTGACCCGC 240 

934 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



ATTTCCATCC 
TTTACCACCC 
AACAAGACAC 
CTAAATGAAG 
TCCTTGCCGG 
GTGCCCCCAC 
GGTGGAATCA 
TCAGCTGCAA 
GAAACCAAAA 
TCAGACTGTC 
TTTGOQCATC 
CTGAATCTCA 
GCCAGGTGCT 
CCTTGGTGGC 
AGTACCTTGG 
AGAGCAGTGA 
GCCCTAGAGA 
ACOCATCCTG 
AGGGTQTTTC 
TGCCTTTTGG 
ATCTGICTCC 
GTCCTTGCGT 
AAACGAGGGC 
GCOAACCT0C 
CACCCTGGAG 
GATCJQCAQAT 
TGTGAGAGGC 
TCCCCCCGA.C 



TCCTCCGCGA 
TCCTCATCGC 
GCAAGTATGA 
AGGATOATGA 
CTGCACTGAG 
CCTTCATCCT 
GACCTGGTAA 

CGAACGTTAT 
ACTGGCAAGC 
CCTTTAAAGT 
TGGAAAAGCT 
TGCCACTGGT 
ACTTCAGCGC 
GCTTGGATGT 
CTAAGGTTCT 
GTGGGTT TCC 
TCAGGTTGGC 
GGCGTCCQCG 
TTTTGAAGAT 
CCTGCTGTGC 
CATTTGAGCA 
CCAGTGGGCA 
AGACAATTCC 
GAGACTTGGA 
GCCAGAAOAT 
TCACAGGTTC 
AGCCCCTOTT 



CCTACCCACC 
CTGOCTGCTG 
TATCATCACC 
AGATGAGGAC 
ACGTCAGCTG 
CGACATTGAC 
AACCTGTTTC 
GAAGGACTGG 
GTATAAAACC 
TCGTTTCCAC 
GCCCCCTACT 
GGATTCCTCT 
CCTGGCAGftA 
CACAGGCTCT 

CCAGGGTAGC 
CCATCATCTC 
TCGTTCAGAT 
GCAGTCTCTG 
TCTGTTGAGG 
CGTGGAACAC 
GAGCCCACAA 
ATTGACCAGA 
AGATACCCAA 
TGGGGTGGCA 
GOTCCTGATG 
CCATCACTTC 
TCTGTCCAGG 



CTCAAGGCAG 
CTGCGCGTCT 
ACTCCAGCAG 
TCCACAGTAT 
CCAGGGTGCC 
CTTCCAQCAA 
CCAGCCTGGT 
ACCTGGAAGC 
CCAGCTCCAT 
GTCACCACAA 
TCTACTCCCC 
GCCTTACGCA 
ATGGCGGCTG 
CCAATAAAAA 
GCCGGCCAGC 
TCTTTCTCCA 
AGGCTTCTCA 
GCCOGGGGAC 
CATGGCGGAG 
CGCCATCCTC 
CTACGGGAAG 
AAGGCAQCTG 
CACACATGCC 
GGCCAGGAAG 
AATTTCTATT 
TCTGAGGAAG 
TCCAGCCATT 
CCCTGA 



CCGTGATCGT 
TCAGGTCGGG 
AGCGAGTGGA 
TOGACATCAA 
AGACGCTACT 
GATGCAGTGG 
GGCATCCTGT 
CCTCTTGCGT 
OGTQCGTGTC 
TGGAGT TGCT 
ATGGTTTTCG 
GAAACACCCG 
CTGAAAGTGA 
CCCTTTACAC 
GGGGCACCTT 
AACAGCTGCG 
GAGAGTGTCC 
AAGCCAGCCT 
GGTCAGCGGG 
ACCTTGACCT 
CCAAGAGAAG 
CTGCCCACGG 

GCCCAGGTGA 
TGOAOGAAOA 
GGCCACCTAG 
CCAAGTCTTG 



GGCGTTCGCC 
AAAGAGOTTA 
AATGGCGCCA 
ATACAGAGTC3 
GACAGTTCCT 
AAGGCCTGAT 
GGAAAGTTGG 
CGGAGGTGTT 
AGGCATCTGC 
TCTGCCACCC 
ACAACTGCAG 
GGCTCCATCT 
CCTTCCAAAT 
ACAAACCAXG 
TTGTQAAGAC 
CTGGAAGCCA 
TCCGCTGAGC 
GACGGGGAGQ 
TACCGCAACT 
CTTCTACAAA 
CTCAGTGACT 
GGAGCCTGTC 
GATCACACAT 
GGATGTCACT 
GGGTITCCAG 



GTCCTTCCTT 



300 
3 60 
420 
480 
S4D 
600 
£60 
720 
7B0 
840 
900 
960 
102 0 
1080 
3140 
1200 
1260 
1320 
13 B0 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



A218 Protein sequence; 
Gene name: 
Unigene numbers 
Pxobeeet Accession #s 
Protein Accession #: 
Signal sequence: 
Trains membrane domains: 
Cellular Localization: 



Homo sapiens mRNA; cDNA DKFZp56401763 (from clone DXFZpS6401763) 

VB. 27373 

F13036 

FGENESH predicted 
1-27 

94-115. 448-469 
not determined 



1 
I 

MRAVPLPAPL 



TOCTHKYDIIT 

vpppf riiDin 

LNLMEKLDSS 
STIiGLOVPCG 
THPVRLARfiD 
ICCPCCAVEH 
ANLQTrPDTQ 
CERTjTGSHHP 



11 
1 

LPLUJjftLlA 
NSSGDALVTR 
TPAERVEMAP 



21 
1 

APAARASRAE 
ISIliLKDLPT 



PAPSCVSGIC 
ALRRNTHAPa 
AGQRGTFCBD 
AKGQASI/TGK 
LRKAKRSSVT 



SDCHWQAREH 
ARCLPLVLAE 
RAVTKV&QGS 
RVFRRPRQSL 
VLASFEQ3PQ 



31 
I 

SVSAFWPEPB 
IiKAAV I VAFA 
STVFDIKYHV 
PAWWHPVESW 
VTTMELLJ.PP 
KAAAESDLPN 
SFSKQIiRWKP 
HGGGSAGTAT 
KAAAAHGEPV 



41 

1 



FTTLLIACLL 
SltPAALERGL 
SAATWGVKDW 
PGHPFKVPPT 
PWWHFBATGS 
ALESGFPHHL 
CLLVLKILLR 



51 
I 

PGNTTBFGS3 
LRVFRSGKRL 
PGCQTLUTVP 
TWKPSCVGGV 



PIKTLYTQTM 
RUjRECPELS 
RHPBLDLFYK 



GQEGPRHDVT HPGGIXLDGVA NP5fLEEEGPQ DGRCQKMVLM SEEGPPSLTG 
3SESKSWSFL SPRQPLFL9R P 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



(endothelial adhesion molecule 1) 



A219 DMA SEQUENCE 

Gene names select in E 

Unigene number: He. 89546 

Probeset Accession #; M24736 
Nucleic Acid Access Ian NM_0004SO 

Ceding 1 sequence i 1-1833 [underlined sequences correspond to start and atop codans) 



1 11 21 31 41 

111)1 
ATGA TTGCTT CACAGTTTCT CTCAGCTCTC ACTTTOGTGC TTCTCATTAA 
GCCrGGTCTT ACAACACCTC CACGGAAGCT ATGACTTATG ATGAGGCCAG 
CAGCAAAGGT ACACACACCT GGTTGCAATT CAAAACAAAG AAGAGATTGA 
TCCATATTGA GCTATTCACC AAGTTATTAC TGGATTGGAA TCAGAAAAGT 
TGGGTCTGGG TAGGAAGCCA GAAACCTCTG ACAQAAQAAG CCAAGAACTG 
GAACCCAACA ATAGGCAAAA AGATGAGGAC TGCGTGGAGA TCTACATCAA 
GATGTGGGCA TGTGGAATGA TGAQAGGTGC AGCAAGAAGA AGCTTGCCCI 
GCTGCCTGTA CCAATACATC CTGCAGTGGC CACGGTGAAT GTGTAGAGAC 
TACACTTGCA AGTGTGACCC TGGCTTCAGT GGACTCAAQT GTGAGCAAAT 
ACAGCCCTGG AATCCCCTGA GCATGGAAGC CTGGTTTGCA GTCACCCACT 
AGCTACAATT CTTCCTGCTC TATCAGCTGT GATAGGGGTT ACCTGCCAAG 
ACCATaCAOT GTATGTCCTC TGGAGAATGG AGTGCTCCTA TTCCAGCCTG 
GAGTGTGATG CTGTGACAAA TCCAGCCAAT GGGTTCGTGG AATGTTTCCA 
AGCTTCCCAT GGRACACAAC CTOTACATTT GACTGTGAAG AAGGATTTGA 
GCCCAGAGCC TTCAGTGTAC CTCAICTGGG AATTGGK3ACA ACGAGAAGCC 
GCTGTGACAT GCAGGGCCGT CCGCCAGCCT CAGAATGGCT CTGTGAGGTG 
CCTGCTGGAG AOTTCACCTT CAAATCATCC TGCAACTTCA CCTGT3AGGA 
TTGCAGGGAC CAGCCCAGGT IGAATGCACC ACTCAAGGGC AGTGGACACA 
GTrTGTGAAG CTTTCCAGTG CACAGCCTTG TCCAACCCCG AGCGAGGCTA 
CTTCCTAGTG CTTCTGGCAG TTTCQGTTAT GGGTCCAGCT GTGAGTTCTC 
GGTTTTGTGT TGAAGGGATC CAAAAGGCTC CAATGTGGCC CCACAGGGGA GTGGGACAAC 



SI 
! 

AGAGAGTGGA 
TGCTTATTGT 
GTACCTAAAC 
CAACAATGTG 
GGCTCCAGGT 
GAGAGAAAAA 
ATGCTACACA 
CATCAATAAT 
TGIGAACTGT 
GGGAAACTTC 
CAGCATGGAG 
CAATGTGGTT 
AAACCCTGGA 
AjCTAATGGGA 
AACGTBTAAA 
CAGCCATTCC 
AGGCTTCATG 
GCAAATCCCA 
CATGAATTGT 



60 
120 
190 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



935 



WO 03/042661 



PCT/US02/36810 



GAGAAGOCCA CATGTGAAGC TGTGAGATGC GATOCTGTCC ACCAGCCCCC GAAGGGTTTG 1320 

GTGAQGTGTG CTCATTCCCC TATTGGAGAA TTCACCTACA AGTCCTCTTG TGCCTTCAGC 13 BO 

TGTGAGOAGG GATTTGAATT ATATGGATCA ACTCAACTTG AGTGCACATC TCAGGGACAA 1440 

TGGACAGAAG AGGTTCCTTC CTGCCAAGTG GTAAAATQTT CAAGCCTGGC AGTTCCQGGA 1500 

AAGATCAACA TQAGCTGCAG TGGGGAGCCC GTGTTTGGCA CTGTGTGCAA GTTCGCCTGT 1560 

CCTGAAGGAT GGAOGCTCAA TGGCTCTGCA GCTCGGACAT GTOOAGCCAC AGGACACTGG 1620 

TCTGGCCTGC TACCTACCTG TGAAGCTCCC ACTGAGTCCA ACATTCCCTT GGTAGCTGGA 1680 

CTTTCTGCTG CTGGACTCTC CCTCCTGACA TTAGCACCAT TTCTCCTCTG GCTTCGGAAA 1740 

TGCTTACGGA AAGCAAAGAA ATTTGTTCCT GCCAGCAGCT GCCAAAGCCT TGAATCAGAC IB 00 
GGAAGCTACC AAAAGCCTTC TTACATCCTT TAA 



A220 Protein ggguencei 
Gene names 
Unigene numbers 
Probeset Accession $ - 
Protein Accession #; 
Signal sequence: 
Transmembrane domains: 
C- lectin domain: 
Cellular Localization: 



Selectin E [endothelial 

Hs.89546 

M24736 

NP_000441 

1-22 

555-573 

23-139 

plasma membrane 



adhesion molecule 1) 



1 11 21 3x 41 51 

I I I I I 1 

, MIASQFIjSAIi TI*VLLIKE£G AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEEIEYIiM 60 

SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPNNRQKDED CVEIYIKREK 120 

DVGMWKDERC SKKKLALCYT AACTNTSCSG HGECVETJNK YTCKCDPGFS GUCCEQIVNC 180 

TALESPEHGS LVCSHPMNF SYNSSCSISC DHGYI/PSSME TMQCMSSGBW SAPIPACWV 240 

ECUAVTNPAH- GFVBCFQKPG SFPWNTTCTF DCEBGFELMG AQSLQCTSSG KWENBKPTCK 300 

AVTCRAVROP QNGSVRCSHS PAGBFTFKSS CNFTCBEGFM LQGPAQVECT TQQQHTQQIP 360 

VCZAFQCTAL SNPERGYMUC IiPSASGSFRY GSSCEFSCEQ GFVUDSSKRL QCGPTGEWDN 420 

E KPXC EAVRC DAVHQPPKGL VRCAHSPIGB FTYKSSCAFff CEHGFK3UYGS TQLECTSQGQ 4 B0 

WTKEVPSCQV VKCSSLAVPG KINMSCSGEP VPGTVCKPAC PEGWTLSJGSA ARTCGATGHM 540 

SGLLPTCEAP TESNIPLVAG LSAAGLSLLT 1APPLLWLRK. CLRXAKKFVP ASSCQSIiESD 600 
GSyQKPSYII* 



TaxoX Prostate 
A221 DMA SEQUENCE 

Gene namei ESTs ; Liprin A2 

Unigene number: Hs. 3064 80 

Probe set Accession # •. N51002 
Nucleic Acid! Accession #i M51002 

Coding sequence: 1-3793 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 SI 

I 1 I I I | 

ATGATGTGTG AAGTGATQCC CACGATTAAT GAGGACACCC CAATGAGCCA AAGGGGGTCC 60 

CAAAGCAGTG GCTCGGACTC AGACTCCCAT TTTGAGCAGC TGATGGTGAA TATGCTAGAT 120 

GAAAGGGATC GTCTTCTAGA CACCCTTCGG GAGACCCAGG AAAGCCTCTC ACT1GCCCAG 1B0 

CAAAGACTTC AGGATGTCAT CTATGACCGA GACTCACTCC AGAGACAGCT CAATTCAGCC 240 

CTGCCACAGG ATATCGAATC CCTAACAGGA GGGCTGGCTG GTTCTAAGGG GGCTGATCCA 300 

CGGGAATTTG CTGCACTGAG AAAAGAATTA AATGCCTGCA GGGAAGAACT TCXAGAAAAG 360 

GAAGAAGAAA TCTCTGAACT TAAAGCTGAA AQAAACAACA CAAGACTATT ACTGGAGCAT 420 

TTGGAGTGCC TTGTGTCACG ACATGAAAGA TCACTAAGAA TGACGGTGGT AAAAOGGCAA 480 

GCOCAGTCTC CCTCAGGAGT ATCCAGTGAA GTTGAAGTTC TCAAGGCACT GAAATCTTTG 548 

TTTGAGCAGC ACAAGGCCTT GGATGAAAAG GTAAGGGAGC GACTGAGGGT TTCTTTAGAA 600 

AGAGTCTCTG CACTGGAAGA AGAACTAGCT GCTGCTAATC AGGAGATTGT TGCCTTGCGT 660 

GAACAAAATG TTCATATACA AAGAAAAATG GCATCAAGOG AGGGATCCAC AGAGTCAGAA 720 

CATCTTGAAG GGATGGAACC TGGACAGAAA GTCCAIGAGA AGCGTTTGTC CAATQGTTCT 780 

ATAGACTCAA CCGATGAAAC TAGTCAAATA GTTGAACTAC AAGAATTGCT TGAAAAGCAA 840 

AACTATGAAA TGGCCCAGAT GAAAGAACGT TTAGCAGCCC TTTCTTCCCG AGTGGGAGAG 900 

GTGGAACAGG AAGCAGAGAC AGCAAGAAAG GATCTCATTA AAACAGAAGA AATGAACACC 960 

AAGTATCAAA GGGACATTAG GGAGGCCATG GCACAAAAGG AAGATATGGA ASAAAGAATT 1020 

ACAACCCTTG AAAAGCGTTA CCTCAGTGCT CAGAGAGAAT CTACCTCCAT ACAlGACATG 1060 

AATGATAAAC TAGAAAATGA GTTAflCAAAT AAAGAAGCTA TCCTACGGCA GATGGAAGAa 1140 

AAAAACAGAC AGTTACAAGA ACGTCTTGAG CTAGCTGAAC AAAAGTTGCA GCAGACCATG 1200 

AGAAAGGCTG AAACCTTGCC TGAAGTAGAG GCTGAACTGG CTCAGAGAAT TGCAGCCCTA 1260 

ACCAAGGCTG AAGAGAGACA TGGAAATATT GAAGAACGTA TGAGACATTT AGAGGGTCAA 1320 

CTTGAAGAGA AGAATCAAGA ACTTCAAAGA GCTAGGCAAA GAGAGAAAAT GAATGAGGAG 1380 

CATAACAAGA GATTATCQGA TACGGTTGAT AGACTTCTGA CTGAATCCAA TGAAOGCCTA 1440 

CAACTACACT TAAAGGAAAG AATGGCTGCT CTAGAAGAAA AGAATGTTTT AATTCAAGAA 1500 

TCAGAAACTT TCAGAAAGAA TCTTGAAGAA TCTTTACATG ATAAGGAAAG ATTAGCAGAA 1560 

GAAATTGAAA AGCTGAGATC TGAACTTGAC CAATTGAAAA TGAGAACTGG CTCTTTAATT 162 0 

GAACCCACAA TACCAAGAAC TCATCTAGAC ACCTCAGCTG AGTTGOGGTA CTCAGTGGGA 1680 

TCCCTAGTGG ACAGCCAGTC TGATTACAGA ACAACTAAAG TAATAAGAAQ ACCAAGGAGA 1740 

GGCCGCATGG GTGTGDGAAG AGATGAGCCA AAGGTGAAAT CTCTTGGGGA TCACGAQTGG 1800 

AATAGAACTC AACAGATTG3 AGTACTAAGC AGCCACCCTT TTGAAAGTGA CACTGAAATG I860 

TCTGATATTG ATGATGATGA CAGAQAAACA ATTTTTAGCT CAATGGATCT TCTCTCTCCA 1920 

AGTGGTCATT CCGATGCCCA GACGCTAGCC ATGATGCTTC AGGAACAATT GGATGCCATC 1990 

AACAAAGAAA TCAGGCTAAT TCAGGAAGAA AAAGAATCTA CAGAGTTGCG TGCTGAAGAA 20 40 

ATTGAAAATA GAGTGGCTAG TGTGAGCCTC GAAGGCCTGA ATTTGGCAAG GGTCCACCCA 21Q0 



936 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 



45 
50 
55 
60 
65 
70 
75 
80 



GGTACCTCOV 
GGACACTCAA 
GTCATGACAC 
GATGGTCGAG 
GCCCTCAGAA 
TCTGTCTCTC 
CTTCACAAAG 
AAAGAAAAAQ 
GAGTCCCTGG 
AASCATGAAC 
CCAACTGTGG 
TGCCGAQCCA 
AGAGAAATTG 
ATGGTTTCCC 
TGGGTGACTC 
GAAGGAAGCT 
CATGAGTGGA 
TTTATGGAAT 
GTCCATTTAA 
TTAAAGAGGT 
GAAATAAAAG 
GGACTTCGAQ 
CTGGATGAAA 
ACCCAGGCAA 
AGGCGACTGG 

TTTCCrccrc 

GCTGGATTTA 
TCATCAAGAC 
GCGGCCGCTT 



TTACTQCCTC 
CTCCAAAGCT 
TGCCAAGTGA 
AGGACAAAGC 
TGACTCACAC 
TTGAGCCAGA 
CCCCCAAGAA 
CTCGACTTQG 
GGTTAGGCAA 
TTCTTGAAGA 
TCGCATGGCr 
ACGTGAAQAG 
GAATCAGCAA 
TAACAAGTCC 
ATGAAGAAAT 
GGGCCCAGTG 
TTGGAAATGA 
GCTTGGTAGA 
AAATGGTGGA 
TGAATTATQA 
ACGTGTTGGT 
AATATOCAAA 
ACT TTGACTA 
GGCAGATTCT 
ATGAAAGTGA 
GTGAAGTAjCA 
GGTTAACCAC 
TGCAGAGGTT 
TAA 



TGTTACAGCT 
CACCCCTOSA 
TCTGAGGAAA 
AACAATTAAA 
TCTCCCTTCT 
AAGCCTCGGG 
GAAAGGAATC 
GCAGCTCCGA 
ACTCGGAACT 
AGCTCGOAGA 
AGAGCTTTGG 
TGGTGCCATC 
TCCACTGCAT 
TTCAGCTCCT 
GGAAAATCTT 
TOCOQTTTE T 
ATGGCTTCCC 
TGCAAGAATD 
TAGTTTC CAT 
CAQAAAAGAA 
GTGGAGCAAT 
TAATATACTT 
CAGCAGCTTA 
TGAAAGAGAA 
TGACAAGAAC 
SGGAATCAGC 
AACCTCTQGG 
AGACAACTCC 



TCATCGCTGG 
AGCCCTGCCA 
CATOGGAGAA 
TGTGAAACTT 
TCCTACCACA 
CTTGGTAQTG 
AAGTCTTCAA 
GGCTTTATGG 
CAAGCTGAGA 
AAGGGATTAC 
TTGGGAATGC 
ATGTCTGCTT 
CGCTTAAAAC 
CCAAGATCTC 
GCAGCTCCAG 
CTACAGACCG 
AGCTTGGGGT 
TTAGATCACC 
CGAACAAGTT 
CTAGAAAGAA 
GACOGAATTA 
GAGAGCGGTG 
ACTTTATTAT 
TACAATAACC 
TTCAGACGTG 
ATGATGCCTG 
CAATCAAGAA 
ACTQTTCGCA 



ccagtxcatc 
gggaaatgqa 

AGATTGCAGT 
CTCCTCCTCC 
ATGATGCTCG 



TAGGACGTTT 
AGACTGAAGC 
AGGATCGAAG 
CTTTTGOCCA 
CTQCOTGQTA 
TATCTGACAC 
TTCGATTAGC 
GAACTCCTTC 
CAAAAAOGAA 
TGGCTTATGG 
TACCTCAGTA 
TAACAAAAAA 
TACAATATGG 
GAGGGGAAGC 
TTCGCTGGAT 
TGCATGGCTC 
TACAQATTCC 
TCTTGGCCCT 
GATCAACCTG 
GGTCCTCAGA 
AAATGACAAC 
CATACTCATG 



TCCCCCCAOT 
TCGGATGGGA 
TGTQGAAGAA 
TACCCCTAGA 
AAGTAGTTTA 
CGAAGACTCT 
GTTTGGTAAA 
TGCAGCTCAG 
ACTAAAGAAA 
GTGGGATGGG 
CGTGGCAGCC 
TGAGATCCAG 
AATCCAGGAG 
AGGCAACGTT 
AQAATCTGAG 
AGATATGAAT 
CAGAAGTTAC 
AGATCTCCGT 
AATTATGTGC 
AAGCCAACAT 
ACAAGCAATT 
ACITATAGCC 
AACACAGAAC 
GGGAACXGAA 
GAGAAGOCAG 
AACATTACCA 
AGATGTTGCT 
TCTCGAGTAA 



A222 Protein sequence; 

Gene names 
_ c Untgene number : 
33 Probes et Accession # i 

Protein Accession #: 

Signal sequence i 

Transmembrane domains -* 

AAA dom&in i 
40 SAM domains: 

Cellular Localization: 



estb; biprin A2 
Hs. 3 06480 
N51QD2 

none found 
none found 
none found 
286-533 

895-964, 1017-1084, 
not determined 



1105-1177 



QRLQBV rXDR 



FEHHKALDEK 
HLEGMEPGQK 
VEQEAETARK 
HDKLiENELAN 



11 
I 

EDTPMSQRGS 
DSLQRQLNSA 
RKWTSiLIiEH 
VRERLRVSLE 



21 
I 



LPQMESLTG 



QLHIiKSHMAA 
EFTIPRTHLD 
NHTQQIGVIiS 
NKEIRLIQEE 
GHSTPKLTPR 
ALRMTHTLPS 
KEKARL§QLR 
PTWAWIiELW 
MVS&TSPSAP 
HSWJGNEWU 
ItKRLNYDRKE 
IiDENFDYflSL 
FPPREVHGIS 



DLIKTEEMNT 
KHAZLUQMEE 
EERMRHLEGQ 
LEEKNVXjIQE 
TSAELRYSVG 



KESTELRAEB 
BPAREKDRMG 
SYHMDARSS& 
GFMETEAAAQ 
IiGMPAWYVAA 
PTSRTPSGMV 
SLGLPQYRSY 



RVSALEEELA 
IU3TDETSQI 
KYQRDIREAM 
XNRQLQKKLE 
TiKFKMQKLQR 
SETFRKOTjER 
SIiVDSQSDYR 
gni nnnrro h t 
IENRVASVSL 
VWTLPSDLRK 

ESLGLGKLGT 
CRANVKSGAI 



31 
1 

FEQ1HVNMLD 
GLAGSKGADP 
SliRHTWKRQ 
AAJJQBIVAIiR 
VHLQELLEKQ 
AQKEDMEERI 
IiAEQKLQQTM 



TLIiLQlPTQJff 
MMPGSSETLP 



FMEQiVDARM 
EIKDVLVWSN 
TQARQUjERB 
AGFRLTTTSG 



SIiHEKERLAE 
TTKVIRRPRR 
rFSSMDLIiSP 
EGUW1ARVHP 
HRRK1AVVEE 
IiGSANSSCfDfl 
QAEKDRKLKK 
MSAlaSDTElQ 
AAPAKTKBSE 
IiDHIiTKKDLR 
DRIIRWIQAI 
YHHLLALGTE 
QSRKMTTJ5VA 



41 
1 

EHDRLLDTlrR 
PEFAALTKEL 
AQSPSGVEfiE 
BQKVHXQRKM 
NYKMAQMKKR 
TTXjEKRYXiSA 
RKAETLPEVE 
HNKRLSDTVD 
ElS£dxR3ELD 
GRMGVRRDEP 
SGHfitJAQTLA 
GTSITASVTA 
DGREDKATIK 



EEIG1SNP1M 
EGSHAQCPVP 
VHLKMVBSFH 
GLRRYAJOTII. 
RRLDKSDDKU 



2160 
2220 
2280 
2340 
2400 
24£0 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3X20 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
372 0 
3780 



51 
I 

ETQESIiSIiAQ 
HACRE QLIiEK 
VEVUCALKSL 
ASSEGSTESE 
IiAALSSRVGE 
QRE3TSIHDM 
AELAQRIAAIj 
KLLTE8NERL 
QLKMRTGSLI 
KVKSLGDHEK 
MMLQEQLDAl 
SSliAflSSPPS 
CSTBPPPTPR 
KSSIGRLFGK 
KGLPFAQWDG 
RltKLRIiAIQE 
IaQTLAYGDMN 
RTSLQYGIHC 
HHGVHGSLIA 
PRRGSTHRRQ 
TVRTygCliE 



GO 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
IDflO 
1140 
1200 



A3 23 DMA SEQUENCE 
Gene name; CDA24 
Unigene number: H» .26813 

Probeeet Accession #x M32912 
HUcleic Acid Accession #: HM_016570 

Coding sequence i 1- 1134 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGA GGCGAC 
AAGGTTCCTG 
TTTACAACTA 
AAGTATGAAT 



GTTGCATCTG 
AAAGAGTGGC 



11 
I 

TGAATCGGAA 
AGAGCTATGT 
TGGCTTTATT 
ACGAAQTAGA 
TGAAGTGTCA 
CAGATGGTTT 
AGAGOATGCT 
TATTTAAAAG 



21 
I 

AAAAACTTTA 
AGAQACTTCA 
AACCATAATG 
CAAGGATTTT 
ATATGTTGGA 
AGTTTATGAA 
GCAGCTGATT 
TGCTTTTAAA 



31 
\ 

AGTTTGGTAA 
GCCAGTGGAG 
GAATTCTCAG 
TCTAGCAAAT 
GCOQATGTAT 
CCAACAGTAT 
CAGAGTAGGC 
AGTACATCAA 



41 
I 

AAGAGTTGGA 
GTACAGTTTC 
TATATCAAGA 
TAAGAATTAA 



TXGATCTTTC 
TACAAGAAGA 
CAGCTCTTCC 



51 
I 

TGCCTTTCCG 60 

TCTAATAGCA 120 

TACATGGATG 1B0 

TATAGATATT 240 

AGAAACAATG 300 

ACCACAGCAG 360 

GCATTCACTT 420 
480 



937 



WO 03/042661 



PCT/US02/36810 



10 



GATGATTCAT 
GTAGCAGGGA 
CATTTGGCAG 
TCTTTTGGAG 
ATAGATCACA 
TATAAAATAT 
CATGCTGCAG 
ATGGTGACAG 
ATTGTTGGAG 
GAAATAATTT 
GAGGATGGCC 



CACAGTCTCC 
ATTTTCACAT 
CACTTGTCAA 
AGCTTGTTCC 
ACCAGATGTT 
CAG CAGACAC 
GCAGCCATGG 
TTACTGAGGA 
GAATCTTTTC 
GCTGTCGTTT 
ACACAGACAA 



AAATGCATGC 
AACAGTGGGC 
CCATGAATCT 
AGCAATTATT 
CCAATATTTT 
CCATCAGTTT 
AGTCTCTGGG 
GCACATGCCA 
AACAACAGGC 
CAGACTTGGA 
OCACTTACCT 



AGAATTCATG 
AAGGCAATTC 
TACAATTTTT 
AATCCTTTAG 
ATTACAGTTG 
TCTGTGACAG 
ATATTTATGA 
TTCTGGCAGT 
ATGTTACATG 
TCCTATAAAC 
CTTTTAGAAA 



GCCATCTATA 
CACATCCTCG 
CTCATAGAAT 
ATGGAACTGA 
TGCCAACAAA 
AAAGGGAACG 
AATATGATCT 
TTTTTGTAAG 
GAATTGGAAA 
CTGTCAATTC 
ATAATAC3VCA 



TGTCAATAAA 
TGGTCATGCA 
AGATCATTTG 
AAAAATTGCX 
ACTACAtACA 
TATCATTAAC 
CAGTTCTCTT 
ACTCTGTGGT 
ATTTATAGTT 
TGTTCCTTTT 
TTGA 



540 
600 
660 
720 
780 
£40 
900 
960 
1020 
1080 



15 
20 
25 
30 



A224 Protein sequence; 

Gene name: * CDA14 

unigene number: Hs.26813 

Probesefc Accession # = N32 912 

Protein Accession ft: NP_0S76S4 

signal sequence: none found 

Transmembrane domains: none found 

Cellular Localization: nuclear 



1 
i 

MRRIiNRKKTL 
RYEYEMDKDF 
KEWQRMLQLiI 
VAGNFHTTVG 
IDHNQMFQYF 
KVTVTEEHMP 



11 21 31 41 51 

11111 

SLVXELDAFP KVPESYVETS ASGGTVSLIA FTTHALLTIM EFSVYQDTWH 60 

SSKLHIKIDI TVAMKOQYVG ADVLDliAETM VASADGLVYE PTVFDLSPQQ 120 

QSRIiQBEHSIi QDV1PKSAFK 3 T STAIjP PRE DDSSQSFNAC RIHGHLYVNK 100 

KA1PHFRGEA HLAALVKHES YNFSHRIDHL SFGELVPAII 1TPLDGTSKIA 240 

ITWPTKLHT YKISADTHQF SVTERERIIN HAAGSHGVSG TFMKYDLSSL 30 0 

FWGFFVRLCG IVGGIFSTTG MLRGIGKFlV EIICCRFRLG SYKFVNSVPF 360 
KLENNTH 



35 



40 



Uterine 

A22B DMA, SEQUENCE; 
Gene names 
Unigene number : 
Probe set Accession ft; 
Nucleic Acid Accession #t 
Coding sequence : 



ESTs, Weakly similar to JB0350 Anterior gradient: -2 [h. sapiens] 

Hs. 100686 

AA487468 

AA487466 

55-555 (underlined sequences correspond to start and stop codousl 



45 
50 
55 
60 
65 
70 
75 
80 



i 
1 

CGGCACCAAG 
CTACRCTCAG 
GCAATAAAAA 
ACTTGGGXAC 
ATGGTTATTC 
CAAAATGAAG 
GAAACCACTG 
GACCCTTCTT 
TATGAGCCTC 
CAGTCAGAGC 
GAAGAAAACC 
TTACTATTTA 
TCTGAAAAAA 



11 
I 

AOCACTGGCC 
CTTTGGGTCT 
AGGAAAAOAG 
AAACTTATGA 
ATCACCTGGA 
AAATACAAGA 
ATAAGAA.TTT 
TAACAGTTAG 
GGGATTTACC 
TATAAGAGAT 
TCTGGCACAT 
GTTTTTTTAA 
AAAAAAAAAA 



21 
1 

AAGTCAGCTT 
CTGCCTCTTA 
GCCTCCTCAG 
AGAAGGTCTC 
GGATTGTCAA 
AATGGCTCAG 
ATCACCTGAT 
AGCTGACATA 
CCTAT TGAT A 
GATAGAAAAA 
TGACAAATAC 
TGTGTTTGCA 
AAAAAAAAA 



31 
1 

CTTCTGAGAG 
CTCGTCACAG 
ACACTCTCAA 
TTTTATGCTC 
TACTCTCAAG 
AATAAGTTCA 
GGGCAATATG 

gctggaagat 
gaaaacatga 

AGCCTTCACT 
TAAATGTGGA 
ATAQTCTTAT 



41 

I 

AGT CTCTAOA 
TTTCTTOCAA 
QA0GATGGGG 
AAAAAAGTAA 



TCATGCTAAA 
TGCCTAGAAT 
ACTCTAACAG 
AGAAAGCATT 
TCAAAGAAGT 
AGTATATAQA 
TAAAATAAAT 



51 
I 

AGA CATGA TG 
CCTTGCCATT 
AGATGACATC 
GAAGCCATTA 
AGTATTTGCC 
CCTTATGCAT 
CATGTTTGTA 
AITGTACACA 
AAGACTTATT 
CAAATTTCAT 
TTTTGTAATA 
GTTTTTTAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



A226 Protein sequence ; 
Gene name : 
unigene numbers 
Probes et Accession #x 
Protein Accession #s 
signal sequence: 
Transmembrane domains : 
Cellular Localization: 



ESTs, weakly similar to JE03 50 Anterior gradient-2 tH. 

Hs. 100686 

AA487468 

none found 

1-23 

none found 
secreted 



apiens] 



1 11 21 31 41 51 

I I 1 I t 1 * 

MHLHSALGLC I^LVTVBBNL AIAIKKEKRP PQTLSRGWGD DITWVQTYBE QLFYAQKSKK 60 

PIjMVIHHLED cqysqalkkv faqneeiqem aqkkfimlnl MHBTTDKNLS PDGQYVPRIM 120 

FVDP81.TVRA DIAGRYSNSL YTYEFRDLFL LIENKKKALR 1.IQSKL, 
A227 DMA SEQUENCE 

Gene name: a protein-coupled receptor 49 (GPR49) (HG38) (LGR5> 
Unigene number: Hs. 285529 

Probeset Accession #2 AA460S30 
Acid Accession #: NH_003667 

201-2924 {underlined sequences correspond to start and stop codone) 



Coding sequence: 



938 



WO 03/042661 



PCTAJS02/3681© 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 
I 

GTGGCGGCAA 
TCAGGAACGC 
OGCCOGGCGC 
GGCCCCCTAC 
TGCTQCTGCA. 
GCCCCACACA 
ACCTGGGGCT 
GTATGAACAA 
AGTTACGT CT 
ACAGTCTTAA 
TGCAGAATTT 
CCCCAAGCTG 
TAACA.OAAAT 
CCCTGAACAA 
TTCTACATCT 
ACAGCCTAGA 
GGACACTCTC 
AGAAAGCATT 
AATTTGTTGG 
GTGCCTCACA 
CTTTAACTGG 
TCCAAGTGCT 
AAAAGCTTCA 
TCCAGCAGTT 
ACCCCAATGC 



ATGCCTTACA 
CTTATGCTTA 
AATGGAATAA 
TTCAGGCTCA 
AAGCCCTTCA 
TQCITOATOG 
GTAATGCTTT 
TGTTAATTGG 
CTGGTGTGGA 
QGGTTGOTTG 
TGCrTAClCT 
CGAAAGCTCC 
CCATGGOCGC 



CCCTTTGCTT 
BAGACCTGGA 
CCAACTGCAT 

catttatcag 
GTCTCAATCC 
TGAGAAAGCA 
CTGATGATGT 
CCAGCATCAC 
AGAGCTGCCA 
AA.TGTTTTCA 
ATAAGAAGAG 



11 
1 

CCGGCAOCTC 
GGCGTCTGGC 
GCCACGGCCC 
TTCGGGCACC 
GCXGGCGACC 
CTGTCATTGC 
CTCGGAGCTG 
CATCAGTCAG 
TGCGGGAAAC 
AGTTCTTATG 
GCGAAGCCTT 
TTXCAGTGGC 
CDCXX3TCCAG 
AATACACCAC 
CCATAACAAT 
GACTTIAGAT 
CAACCTTAAA 
TGTAGGCAAC 
GAQATCTGCT 

aataactgaa 
agcacagatc 
agatctgtct 

GAAAATTGAJC 
GCTTAGCCTC 
ATCTTOCACI 
TCCTATAACT 
GAGCTTGATA 
CCAGTGCTGT 
AGGTGACAAC 
AGATGAAGGT 
TTCAGTGCAG 
CTGGCTGATC 
GGTGACTTCA 
GQTCATCGCA 
TGCGTTCACT 
CCATGTCATT 
GGCAGCCCTG 
ATTT TCT AGC 
AGTTCCCCTG 
TGGGGAGCCC 
CCTCATGATG 
GAATATTTGG 
CCTAAACTGC 
TCCTGAAGTA 
CCTTCTCTAC 
AACCTACGTC 
CGAAAAACAG 
TTATGACCTG 
TCTTTCCTCT 
AAGGTTGAGA 
CIGAGGTGAA 



21 
I 

AGTCCCCGCC 
GCTC3CAGACG 
GTAGCAGTCC 
ATQOACACCT 
GGGGGCAGCT 
GAGOCCGACG 
CCTTCCAACC 
CTGCTCCDGA 
GCTCTGACAT 
CTGCAGAATA 
CAATCCCTGC 
CTGCATTCCC 

GdnriAGAA 
ATACCAGACT 
AGAATCCACT 
TTAAATTACA 
GAACTAGGAT 
CCTTCTCTTA 
TTTCAACATT 
TTTCCTGATT 

TCATCrcrrc 

TACAACCTAT 
CTAAGACATA 
CGATCOCTQA 
TTGCCATCCC 
QGQTTACATG 
TCATCTGAAA 
GCATTTGGAG 
AGCAGTATGG 
GAGCTTGAAG 
TGTTCACCTT 
AGAATTGGAG 
ACAGTTTTCA 
GCAGTGAACA 
TTTGGCAGCT 
GGTTTTTTGT 
GAGCGTGGGT 
CTGAAAGTAA 
CTGGGTGGCA 
AGCACCATGG 
ACCATTGCCT 
GACTTOCTCTA 
CCTGTGGCTT 
ATTAAGTTTA 
ATCTTGTTCA 
TGGACAAGAT 
TCCPGTGACT 
CCTCCCAGTT 
GTGGCATTTG 
ACCTGAAAAT 
ACTCGGTTTA 



31 
I 

GCGCTTCTCC 
CCCGCTGAGT 
GGTGCTGCTC 
CCOGGCTCGG 
CTCCCAGGTC 
GCAGQATGTT 
TCAGCGTCTT 
ATCCCCTGOC 
ACATTCCCAA 
ATCAGCTAAG 
GTCTGGATGC 
TGAGGCACCT 
GTTTATCGGC 
ATOCCTTTGG 
CCCTGGGAAA 
ATAACCTTGA 
TTCATAGCAA 
TTACAATACA 
TACCTGAACT 
TAACTGGAAC 
CTCAAACCGT 
TAGAAGAXTT 
ATGAAATCTA 
ATTTGGCTTG 
TAATAAAGCT 
GTTTAACTCA 
ACTTTCCAGA 
TGTGTGAGAA 
ACGACCTTCA 
ATTTCCTGCT 

CCCcaogccc 

TGTGGACCAT 
GATCCCCTCT 
TGCTCACGGG 
TTGCAC5GACA 
CCATTTTTGC 
TCTCTGTGAA 
TCATTTTGCT 
GCAAGTATGG 
GCTACATGGT 
ACACCAAGCT 
TGGTAAAACA 
TCTTGTCCTT 
•rCCXTClGGX 
ATOCTCACTT 
CAAAACACGC 
CAACTCAAGC 
CCGTGCCATC 
TCCCATGTCT 
GTGAGATTGA 
AA 



41 
1 

TCGCCGCCCA 
TGCAGAAGCC 
TCCGCCCGCG 
TGTGCTCCTG 

GCTCAGGGTG 
CACCTCCTAC 
CAGTCTCOGC 
□GGAGCATTC 
ACACGTACOC 
TAACCACATC 
GTGGCTGGAT 
ATTGCAAGCC 
AAACCTCTCC 
GAAATGCTTT 
TOAATTCCCC 
GAATATCAGG 
TTTCTATGAC 
AAGAACACTG 
TGCAAAjQCTG 
CTGCAATCAG 
ACCCAGTTTT 
CGAAATTAAA 
GAACAAAATT 
GGACCTATCG 
CTTAAAATTA 
ACTCAAGGTT 
TGOCTATAAG 
TAAGAAAGAT 
TGACTTTGAG 
CTTCAAAGCC 
AGCAGTTCTG 
GTACATTTCC 
AGTCTCCAGT 
TGGTGCCTGG 
TTCAGAATCA 
ATATTCT0CA. 
CTGTGCOCTG 
CGCCTCCCCT 
CGCTCTCATC 
CTACTGCAAT 
CATTGCCCTG 
CTCCTCTTTA 
GGTAGTCCCA 
TAAGGAGQAT 
AAGCTTGATG 
CTTGGTAACC 
AOCAGCTTAT 
C TAAT TAATA 
GTATATCAGA 



SI 
1 

CGCCGTGGGG 
CACGGAGCGG 
TCCGGCTCGT 
TCCTTGOCTG 
CTGAGGGGCT 
GACTGCTCGG 
CTAGACCTCA 
TTCCTGGAGG 
ACTGGCCTTT 
ACAGAAGCTC 
AGCTATGTGC 
GACAATGCGT 
ATGACCTTGG 
AGCTTGGTAG 
GATGGGCTCC 
ACTGCAATTA 
TCGATACCTG 
AATCCCATCC 
ACTCTGAATG 
GAGAGTCTGA 
TT ACCTAAT C 
TCAG1CTGCC 
GT TQACACTT 
GCTATTATTC 
TCCAACCTOC 
ACAGGAAATC 
ATAOAAATGC 
ATTTCTAATC 
GCTGGAATGT 
GAAGACCTGA 
TGTGAACACC 
GCACTTACTT 
CCCATTAAAC 
GCCGTGCTGG 
TGGGAGAATG 
TCTGTTTTCC 
AAATTTGAAA 
CTGGCCTTGA 
CTCTGCCTGC 
TTGCTCAATT 
TTGGACAAGG 
TTGCTCTTCA 
ATAAAOCTTA 
CTTCCTGCAT 
CTGGTGAGCC 
TCAATTAACT 
TXTACCAGCT 
CCAGTGACTG 
TGTGAAGGAA 
GCAGTAATTA 



€0 
12 0 
1BO 
240 
3 00 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10BO 
1140 
1200 
1260 
1320 
13 BO 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



A228 Protein, sequence 
Gene names 
Uaigene number: 
Protein Accession #i 
Signal sequence: 
Transmembrane domains > 
Cellular Localization $ 



Q protein- coupled receptor 49 (GEK49) <HG38> (fcGRS) 

Be. 285529 

NP_00365a.l 

1-22 

S57-579, 596-618, 641-663, 687-709, 724-746, 770-792, 
plasma membrane 



803-825 



1 
1 

PSNIiSVFTSY 
LQNNQLRHVP 
AFHSLSALQA 
LNYNNLDEPE 
FQHLPELRTIj 

KPSLIKUJLS 
AFGVCEMAYK 
CSPSFGPFKP 
AVNMLTGVSS 



STKGYHVAI.I 
PVAFBSFESL 
WTRSKKPSLM 
VAFVFCL 



11 
1 

SLEYJaLQTAT 
LDtrfiMNWIfiO 
TEALQHLRSIj 
MTLALNKIHH 
TAIRT^SNLK 
TLUGASQITE 
SVCQBXQXID 
SNIibSSFPlT 
ISNQWNKGDN 
CEHBIiDGWIjT 
AVLAG VDAFT 

kfetkapfss 

ItLMSLCPLMM 
IKLXFI SP EV 
SIH3DDVEKQ 



21 
I 

GGSSPR3GVI. 

QSLRLDAKHI 
IPDYAFGNL3 
ELGPHSNNIH 
FPD I/TGT ANL 
LRHNEITEIK 
^zj .wrayri7T.yT» 
SGMDDLHKED 
RIGVWTIAVL 
FGSFARHGAW 
UCVIIM.CAL 
TIAYTKLYOf 
IKPHiVWP 
SCDSTQAIjVT 



31 
I 

LHGCPTHCHC 
FLEELALAGN 
fiYVPPSCFSG 
SLWLHLHNN 
SIPEKAPVGN 
ESLTLTGAQI 
VDTFQQLiLaii 
TGNHALQSIil 
AGHPQAQDER 
AI/TCMALVT6 
WEHGVGCHVI 
LAL.TMAAV Pli 
LDKGDLBHIH 
LPAC1NPLCY 
FTSS8ITYDL 



41 
1 

EPDGRHliLllV 
ALTYIPKGAP 

HIHGLGKKCF 
PSIilTIHFYD 
SSIjEQTVCNQ 
RSU9LAHNK1 
SSKNFPBLKV 
DlfEDFlkliDFE 
TVFRSPLYIS 
GRLSIPASES 
LGGSKXGA&P 
DCSKVKHIAL 
ILFNPHFKKD 
PPSSVPSPAY 



51 
1 




dcsdlglskl. 


60 


TGLY8LKVLK 


120 


DHAliTElPVQ 


180 


DGLHSIfETLD 


240 


KP1QFVGRSA 


300 


LPNLQVU>LS 


360 


AIIHPMAFST 


420 


IEMPYAYQCC 


480 


EDLXALHSVQ 


540 


PIKLLIGVIA 


600 


SVPIiTUVAL 


660 


LCLPLPFGEP 


720 


LI.FTHCIUE}C 


7 SO 


LVSLRKQTYV 


840 


PVTESCHLSS 


900 



Table 75- SeeTabbl 
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Table 76A deplete Seq ID No; UnlgenelO; UnlgeneJKfe; PKey; Predicted Cellular Localization; and Exemplar Accession for each of the sequences in Table 78, The Information In 
Table 76A is linked by SeqlD No io Table 78. 



Seq 10 No: 

Pkey: 

ExAccn: 

UnJgetielD: 

UnlganeTlllK 

Prod Subcell Loc: 



Seq ID No 

SeqlD1&2 
SeqlD3&4 
SeqlD5&6 
5eqlD7&B 
Seq[D9*10 
Seq ID 11 &12 
Seq ID 13 & 14 
SeqlD15&16 
SeqlDl7&l8 
Seq]D19&20 
SeqlD2U22 
Seq ID 23 & 24 
Seq ID 254 26 
Seq ID 274 26 
Seq ID29&30 
SeqlD31&32 
SeqlD33&34 
SeqlD35&35 
Seq ID 37 & 38 
SeqlD39&40 
SeqtD41&42 
Seq!D43&44 
SeqJD45&46 
Seq ID 47448 
SeqtD49&50 
SeqlD51&52 
Seq ID 534 54 
SeqlD 55*56 
Seq ID 57 4 53 
Seq ID 59460 
Seq ID 614 62 
Seq ID 634 64 
Seq ID 65466 
Seq ID 674 68 
Seq ID 69470 
Seq ID 71472 
Seq ID 73474 
Seq ID 76476 
Seq ID 77478 
SeqlD79&80 
Seq ID 81 482 
SeqlDB3&84 
SeqlDB5&88 
Seq ID 87486 
Seq(D89490 
Seq ID 914 ©2 
SeqlD 93 4 S4 
Seq!D95496 
Seq ID 97498 
Seq ID 39 4 100 
Seq ID 101 4 102 
Saq ID 1034104 
8Bq ID 1058,106 
Seq ID 1074 108 
Seq ID 1094 110 
Seq ID 111 4 112 
Seq ID 1134 114 
Seq ID 1154 116 
Seq ID 1174 11B 
Seq ID 119 4 120 
Seq 10 1214122 
Seq ID 1234124 
SeqID12S4126 
Seq ID127 4128 
Seq ID 129 4130 
SeqlD 131 4132 
Seq ID 133 4 134 
SeqlD 1354136 
Seq ID 1374138 



Sequence ID No for sequences In table 
Unique Eos probesef Identifier number 
Exemplar Accession number, Genbank 
Unlgene number 
Unlgenegene tide 
Predicted sub^Mar localization 



accession number 



Ptey ExAccn UnfcenelD UnlgeneTIBe" 



425023 
424503 
429423 
4Q0289 
419172 
4180D7 
418007 
407B36 
414577 
418738 



415136 
429276 
418994 
407975 
429113 
41B50S 
423961 
414812 
417433 
417433 
424399 
422967 
428227 
4443B1 



411558 
400363 
4117B9 
428696 



421552 
45Z747 
415539 
416636 
416638 
4Q9079 
442082 
400297 
451398 
429220 
421524 
423242 
423242 
423242 
452190 
452190 
325372 
450375 
426215 
425247 
429353 
429353 
432201 
427585 
446163 
442117 
428179 
431211 
447033 
447033 
447033 
115522 



446051 
422048 
410418 
446342 
422260 



AW956889 

NMJXJ2205 

AJ016712 

XD7820 

AW338625 

M13509 

M13509 

T79340 

AI05654B 

AW3B8833 

BE440042 

C18356 

AF056065 

AA296520 

X89426 

D28235 

AA084248 

D13666 

X72755 

BE270266 



L32137 

AA321649 

BE387335 

AW602166 

M102670 

AA243756 

AF245505 

AA852773 

W27249 

AF0266S2 

BE153855 

AT733881 

N3253S 

N32536 

WQ7707 

R41823 

AI127076 

A1793124 

AW207206 

AA312082 

AL0394O2 

ALQ39402 

AL0394O2 

K26735 

H26735 

AA009847 

AW963419 

NMJ30594O 

AL1 17406 

AL1 17406 

A153B613 

D31152 

AA026880 

AW664964 

AM27772 

M86649 

A1357412 

A1357412 

AI357412 

BE614387 

Z42387 

BE048061 

NM.012445 

D31382 



AA315993 



Hs.1 54210 

Hs.149609 

Ha267797 

H&2258 

rls2212fJ 

Hs.83169 

Hs.83169 

Hs.200272 

HsJ2116 

Hs>66B2 

Hb.63326 

Hs.295944 

Hs.198612 

Ks.69546 

Hs.41716 

Hs.196384 

Hs.65339 

Hs,136348 

Hs.77367 

Hs.82128 

Hs.82126 

HS.15B4 

H&22AB 

Hs2B3713 

Hs_222399 

rte.70725 

Hs.79136 

Hs72167 

Hs33483B 

Hs.ei09 

Hs.105700 

Kb.61460 

Hs.72472 

KG42645 

Ks.42645 

Ha.82065 

Hs.7413 

Hs.306201 

RM44479 

Hs.tQ5445 

Hs.125763 

Hs.125763 

Hs.125763 

H&3166B 

H&9166B 



Hs.155223 

Hs.155324 

HS335W1 

HS.33SS91 

rfs.288241 

Ks.179729 

Hs. 25252 

K&.128899 

Hs,Z7S696 

Hs. 3 23733 

Hs.157601 

HsJ57601 

HM57601 

HSJ33893 

HsJ3683 

Ha37054 

H&28B126 

VK63325 

Hs.14846 

Hs.105484 



endothelial differentiation, sphmgolipi 
Integrin, alpha 5 (fibronectfn receptor, 
Integrin, beta 1 (fibronecbn receptor, 
matrix rristolloprofetoase 10 (strornelysln 
ESTs; similar to TRANSMEMBRANE 4 SUPERF 
matrix metalloprdsinase 1 (Interstitial 
matrix melalo proteinase 1 (interstitial 
B-catt CLUymphoma &, member B, zinc fi 
hypothetical protein FU 20992 similar to 
solute carrier family 7, (cafcnlc amino 
malrix mBtaloprotelnase 3 (stromafysin 
Bssue factor pathway inhibitor 2 
G protein-coupled receptor 51 
selecfin E (endothelial adhesion moiecul 
endothelial ceflnSperific molecule 1 
prodaglaidln-endoperoxide synthase 2 (p 
G protein-coupled receptor 39 
perfostfn (0SF-2os) 
monokine Induced by gamma Interferon 
5T4 oncofetal tropfioblast glycoprotein 
5T4 oncofetal tropfioblast glycoprotein 
AI9066B7:IL-BfT095-1 901 99*019 BT095 Homn 
canHage oHgomertc matrix protein (pse 
small inducible cytok&ie subfamly B (Cy 
ESTs, Weakly aWlar to S64054 hypotheO 
CEGP1 protein 

garnrna-arnlnobulyric acid (GABA) Arecepto 
LtV-1 protein, estrogen regulated 
Adlcan 

WAA1B66 protein 
hypothetical protein FU210B0 
secreted feteckeJated protein 4 
!g superfamily receptor INIR 
BMP-R18 

solute carrier family 16 (monocarboaylic 
solute carrier family 1 6 (monocarboxylic 
mterleukfn 6 signal transducer (gp130t 
ESTs 

hypolhatfcal protein DKFZp56401 27B 

ESTs 

ESTs 

GDNF fsmSy receptor alpha 1 
DEME-6 protein 
DEME^proteti 
DEME-6 protein 

Homo sapiens clone PP149B unknown mRNA 
Komo sapiens clone PP149B unknown mRNA 
Phase243Exona 

a dtelntogrin and melafloprcteinase coma 



matrix. metHHoproteinase 1 1 {stromelyain 
AlTMrindlng cassette trangporfar MRP8 
ATP-bindlng cassette transporter MRP8 
TiansmBrrfcrene protease, serine 3 
collagen, type X. alpha 1 (SchrnW meiaph 
prolactin receptor 

ESTs; hypothetical protein for IMAGE:447 

serutrVoIiJcocorecoid regulated Wnase-I 

gap junction protein, beta Z 26kD (com 

ESTs 

ESTs 

ESTs 

c-MyctargelJP01 

transmembrane, prostate androgen induced 
ephroWtt 

spondln 2, exfrac el liar matrix protein 
transmembrane protease, serine 4 
Homo sapiens mRNA; cONA DKFZp5S4D016 (fr 
regenerating gene type IV 

940 



PredSubcatJjDc 

plasma membrane 
plasmB membrane 
plasma membrane 
secreted 

plasma membrane 
secreted 



plasma membrane 



secreted 

plasma membrane 
plasma membrane 



bilracell 

plasma membrane 

secreted 

secreted 

plasma membrane 
plasma membrane 



secreted 

plasma membrane 
plasma membrane 



plasma membrane 

ihtraceti 

secreted 

plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
fotracell 

plasma membrane 
plasma membrane 
plasma membrane 



Intracdl 



Inbaceti 
cytoplasmic 
plasma membrane 
secreted 



plasma membrane 
plasma membrane 
plasma membrane 
secreted 

plasma membrane 
plasma membrane 
kitracell 

plasma membrane 



secreted 
secreted 
kitracell 

plasma membrane 
plasma membrane 
secreted 

plasma membrane 
plffirna membrane 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



SeqID 139*140 
SeqID 1414142 
SeqlB143&144 
Seq ID 145 & 146 
Seq ID 147 & 148 
SeqID 149 & 150 
Seq ID 151 & 152 
Seq ID 153 & 154 
Seq 101554156 
Seq!D157fil5a 
Ssq ID 159 & ISO 
Seq tD 16? & 162 
SeqID 163 & 164 
Seq ID 165 
SeqID 166 & 157 
SeqID 1GB & 169 
SeqID 170 & 171 
Ssq ID 172 4 173 
Geq ID 174 & 175 
Seq ID 178 & 177 
Seq ID 173 & 179 
SeqlD1B0&iai 
SeqID 182 & 183 
Seq ID 184 & 185 
Seq ID186&1B7 
SeqID 168&1B9 
SeqID 190 & 191 
Seq ID 192 & 193 
Seq ID 194 A 195 
Seq ID 196 & 197 
Seq ID 198 & 199 
Seq1D2OO&201 
SeqID 202&203 
SeqlD2D4&2Q5 
SeqlD206&207 
Seq ID 206 
SeqlD209 &210 
Seq<D211&2l2 
Seq ID 213 & 214 
Seq ID 215 & 21 6 
SeqlD217&216 
SeqlD219&220 
Seq tD 221 & 222 
Seq ID 223 & 224 
Seq)D225&226 
SeqlD227 &226 
SeqlD229 &230 
GeqlD23lA232 
Seq ID 233 & 234 
Seq ID235 &236 
Seq ID 237 & 236 
SeqID 239 &240 
SeqlD241&242 
Seq ID 243 a 244 
Seq ID 245 *246 
Seq ID 247 & 248 
Seq ID 249 & 250 
SeqID 251 
SeqID 252 
Seq ID 253 & 254 
Seq ID 255 & 258 
Seq ID 257*. 258 
Seq ID 259 & 260 
Seq ID 261 & 262 
SeqlD263 &264 
SeqID 2B5& 266 
Seq ID 207 4268 
SeqlD269&270 
Seq ID 271 & 272 
Seq ID 273 
Seq ID 274 
SeqlD275&276 
Seq ID 2774279 
Seq ID 279 & 280 
Seq ID 281 & 262 
SeqJD283&2B4 
SeqlD2S5&286 
Seq ID 287 & 288 
Seq ID 2694290 
Seql029U292 
Seq ID 293 * 294 
SeqlD29S&296 
Seq ID 297 & 296 



409041 
409041 
111929 
111929 
111929 
104688 
420159 
422330 
452461 
413324 
412420 
416658 
407811 
402230 
402230 
432829 
425721 
42&37Q 



AB033025 

AB033025 

AF027208 

AF027208 

AF027208 

AW939591 

AI572490 

D307B3 

N76223 

V00571 

AL035666 

U03272 

AW190902 



Hs.50081 

Hs.50081 

Hs.1 12360 

Hs.1 12360 

Ks.112360 

H6.594Q 

Hs.99785 

Hs.115263 

Hs,106106 

Hs.75294 

Hs.73853 

Hs.79432 

Hs.40096 



W60377 Hs.57772 

AC002115 Hs.159309 

Y13545 Ha.97234 

BE001S3S HS.256B97 



402075 
421110 
451668 
451668 
451668 
408243 
422282 
425B52 
439738 
404875 
425883 
404977 
420876 
427747 
4202B1 
446673 
437553 
437553 



AJ250717 

243948 

Z4394B 

Z43946 

Y00787 

AF019225 

AKD01504 

BE246502 



Hs.326444 

Hs.326444 

Hs,32G444 

Hg.624 

Hs.1 14309 

Hs.159651 

Ks.9598 



AL13770B Hs/161031 



AA916425 
AW411425 



426900 
426900 
426900 



NMJMB361 

AI829935 

A1B29935 

A) 8299 35 

AW163554 

AW163564 

AW183564 



Hs.177744 

Hs.180655 

Ks.323494 

Hs.15871 

H3.130497 

Hb.130497 

Hs.130497 

Hs.1 42375 

Hs.142375 

Hs.142375 



405932 
424006 
424003 
424008 
444342 
421379 
417079 
412986 
412986 
412986 
447072 
419723 
419723 
430890 
456759 
429466 
429466 
419721 
407834 
413472 
438380 
428271 
419704 
444471 



413063 
433800 



458435 
424343 



412709 
435615 
404049 
418932 
404029 
436480 
452401 
452401 



R02740 

R02740 

R02740 

NM_014398 

Y16221 

U65590 

X81120 

XS1120 

XB1120 

D81594 

AL120193 

AL120193 

X54232 

BE269150 

W 35835 

W85335 

NM.O0165O 

UB4540 

BE242870 

T05430 

AF026547 

AA429104 

ABQ206B4 

U46745 

AL035737 

AJ034351 

AI418718 

AI418718 

AW956360 

U5B515 

ALQ22327 

Y15065 

L34059 



Hs.137555 

Hs.137555 

Hs.137555 

Hs/tQ887 

Ha.103982 

Hs.81134 

Hs.75110 

Ha.75110 

Hs.75110 

Hs.17279 

Ha.339810 

Hs.339810 

Hs.2699 

Hs.127792 

Hs.12827 

Hs.12827 

Hs.286650 

Hs,75379 

HsX194 

H&.169047 

Hs.45057 

Ha.11217 

Hs.336678 

Hs.75184 

H5.135150 

Hs.144121 

Hs.144121 

Hs,4748 

Hs.154138 

Hs.74518 

Hs^975 

Ks.89484 



421471 



AJ 271 643 Hs.87469 

NMJW7115 Hs.29352 

HIO07115 Hs.29352 

AF037335 Hs£338 

U90545 Ks-327179 



Hypothetical protein, XPjMIBfiO (KIAA1 19 

Hypothetical protein, XP_05186O (WW 19 

prornWn (mousej-l&e 1 

promlntn jmousB)-1itas 1 

promfrln jfnouse)-like 1 

mucin 1% eprthefel frarernembrane 

Homo sapiens cDNA; FU21245 fis, done C 

eplreguln 

transcription factor 

cortfco&op'm releasing hormone 

bone morphogeneflc protein 2 

fibrlBIn 2 (congenital contracture! ara 

cysteine knot superlarruly 1. BMP an tag on 

FgenRsh predicted: CYTOCHROME P450 4F5 { 

Foenesh predicted: CYTOCHROME P450 4F5 ( 

ESTa 

uroplekin 1A 

uroplaHn2 

ESTs ( WeaWy«lrrtlartodJ365012.1 ]Rsa 
ENSP0C00D2S1O56':Plasma membrane calcium 
cathepsin E 

card age acidic protein 1 
cartage acidic protein 1 
csrtisge acidic protean 1 
Intedaukln B 
epdipoprotam L 

death receptor 6, TNF supertamJIy member 
eema domain, ImmunoglobuBn domain (lg) r 
MM-022B1 9*:Homo sapiens phosphatase A2 
Homo sapiens mRNA; cDN A DKFZp434K0322 {i 
Insullrt-IIke growth fector 2 (somatomedl 
ESTs 

serine/threonine kinase 12 

Predicted cation efflux pump 

LPAP tor iysophosphafidlc acid phosphata 

ESTa, Week!/ scnilflr to MATCLKUMAN CHL0R 

ESTs. WfeaWy similar to MA7B_HUMAM CHLOR 

ESTs, WaaWy similar to MATB JHUMAN CHLOR 

ESTs 

ESTs 

ESTs 

C15000305^3B06122feb|AAC69198.1l (APD 
C1500030S^|3B06122|gblAAC6919B.1| (AFO 
putah'vechemoktne receptor: GTP-Wndfng 
putaQve chernoWne receptor; GTP-Wntfing 
putative chemotdne recaptor; GTP-b?n(Tmg 
arrila-to tyaoaomfi-asaoelHted membrane 
small Inducible cytoWncsubfanVl/ B (Cy 
InterleuUn 1 receptor antagonist 
cannablrioW receptor 1 (brain) 
cannablnokt receptor 1 (brain) 
cannablnold receptor 1 (brain) 
tyfosytproteln sulfotransferase 1 
bngavSy essuranca (IAG1, 8. cerevisiaa 
kxigevrty assurance (IAG1, S. cerevlslae 
tfypbanl 

deltapjosophte)-IRce3 

ESTs 

ESTs 

aqifaportn4 

gb:Human oVa trubrevf n tsofbmi DTM-3 tDTN 
sduta carrier family 1 (gBal high affi 
chondroltln suHate proteoglycan BEHAB/b 
chondroJHn sulfate proteoglycan 3 (neur 
ESTs 

KlAAD877protdn 

dyetrobrevln, alpha 

chttfnase 3-Bke 1 (cartilage glycoprote 

lung type-t est memtrana-associated gly 

ESTs, Weakty similar to T4691 6 hypothec 

ESTs, WeaMy similar toT46916hypothetl 

adBnylate cyclase activa&ig polypeptide 

chib'nase3-Cke2 

WAA0027 protein 

potassium wjltage-gated channel, KQTflc 
NMJJl6937*:Homo sapisnB pratocadherin be 
cadheifn 4> type 1. fcadherin (reb'nal) 
Nf<L016935*:Honio sapiens protocadberin be 
putofive acid-sensing ion channel 
tumor necrosis factor, a^hannduced pro 
tumor nficrosra fector, alpha-Induced pro 
carbonic anhydrasa XII 
Bokite carrier family 17 (sodium phospha 
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secreted 

plasma msmbrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
vilracdl 



secreted 
secreted 
iniracel 
Intracell 
trrtracell 

pi asma membrane 
plasma membrane 
plasma membrane 
secreted 
aecreted 

plasma membrane 



secreted 
secreted 

plasma membrane 
plasma membrane 
iniracel 

plasma membrane 
secreted 

plasma membrane 



plasma membrane 
In trace] 

plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
Intracell 
Iniracel 

plasma membrane 
plasma membrane 
plasma membrane 



plasma membrane 
plasma membrane 
plasma membrane 
ki trace!! 



plasma membrane 
plasma membrane 



plasma membrane 
secreted 

plasma membrane 
secreted & plasma 



membrane 



plasma rrtairtbrane 

secreted 

aecreted 

plasma membrane 



plasma membrane 
secreted 

plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 



plasma membrane 
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Seq ID 299 & 300 
SeqlD30t&302 
Seq ID 303 & 304 
SeqlD305£3Q6 
SBq ID 307 & 303 
SeqlD309 &310 
Seq ID 311 4 312 
Seq ID 313 & 314 
Seq ID 315 6 316 
SeqlD317&318 
Sea. ID 319 & 320 
SeqlD 321 & 322 
Seq ID 323S.324 
SeqlD 325*326 
SeqlD 327 & 328 
SeqlD 329 & 330 
SeqlD331&332 
SeqlD333&334 
Seq ID335&336 
SeqlD 337&33S 
SeqlD 339 &340 
SeqlD 341 & 342 
SeqlD 343 £344 
SeqlD 345*346 
SeqlD 347 4348 
SeqlD 3494350 
SeqlD 351 4352 
SeqlD 353 S354 
Seq!D355&356 
SeqlD357 &358 
Seq ID 359 & 360 
SeqlD 361 & 362 
Seq ID 3634364 
SsqU>365 &366 
SeqJD367&36B 
SeqlD3S9&370 
Seq1D371&372 
Seq ID 373 & 374 
SeqD375&376 
5eql0377437B 
SeqlD379&3B0 
Seq ID 381 & 382 
SeqlD3B3&384 
SeqlD3B5&386 
SeqlD387&3BB 
Seq tD 339 & 390 
SeqlD391&392 
SeqlD393&394 
Seq ID 395 4396 
SeqlD 9974398 
SeqlD399 &400 
Seq ID 401 4402 
Seq ID 403 4404 
Seq ID 405 4 406 
Seq ID 4074408 
Seq ID 409 4 410 
Seq ID 411 4412 
Seq ID 413 4 414 
Seq ID 415 4416 
SeqlD4l7&41& 
Seq ID 419 4 420 
Seq ID 421 4 422 
Seq ID 4234424 
Seq ID 4254426 
Seq ID 427 4420 
Seq ID 429 4430 
Seq ID 431 4432 
Seq ID 4334434 
Seq ID 435 4436 
Seq ID 4374438 
Seq ID 4394440 
Seq ID 441 4442 
Seq ID 4434444 
Seq ID 4454446 
Seq ID 4474448 
Seq ID 449 4 450 
Seq ID 451 4452 
Seq )D 453 4 454 
Seq ID 455 4 456 
Seq ID 457 4 458 
SeqlD 4594460 
Seq ID 461 4 462 
Seq ID 463 4464 



428296 

423508 

450001 

410407 

453496 

420737 

309931 

412719 

417034 

430486 

413753 

425650 

423673 

41B663 

418663 

429610 

406690 

431646 

422158 

431956 

437044 

4284B4 

429211 

417389 

431009 

417542 

449230 

410555 

410555 

424687 

418462 

410274 

439606 

404877 

444781 

109424 

416B17 

415817 

415817 

415817 

415817 

415B17 

421817 

418678 

41B678 

409420 

332180 

408790 

408790 

439223 

409757 



NMJW3058 

AW6G4297 

NM_001044 



AA442103 

L08096 

AW341683 

AW016610 

NM_006183 

BE062109 

U17760 

NM_001944 

BE003054 

AK001100 

AK001100 

ABG24937 

M29540 

BE019924 

L10343 

X63629 

AL035864 

AF104O32 

AF052693 

BE260964 

BE149762 

J04129 

BE613348 

U92649 

U92649 

JD5070 

BE001596 

AA3S1607 

W79123 

NM.014400 

NM_O05329 

UB8967 

U88967 

U88937 

UB8967 

U8Q967 

U88967 

AF146074 

NM_DfJl327 

NMJM1327 

Z15D08 

AF134160 

AWSS0227 

AW580227 



428969 



450701 
450701 
414774 
407944 
407944 
428486 



432674 
445891 
445891 
404682 
429547 
425921 
407242 
407242 
407242 
432598 
444006 
423685 
428392 
429597 
446030 
422109 
419235 
449048 
427333 
417931 
419216 



WMJM1898 

A*120274 

API 20274 

AFI20274 

AF120274 

H39960 

H3S960 

XD2419 

R34O0B 

R3400B 

AW583497 

AI693815 

W94322 

AW3&1342 

AW391342 



Hs.1 83572 

Hs.129711 

Hs,406 

Hs.63287 

K5330B4 

H3u99899 

Hs.816 

H3.B09G2 

Hs.241551 

Hs.75517 

Hs.1925 

H8.1695 

Hs.41690 

Hs.41690 

Hs.211092 

rls.220529 

HS.27158Q 

Hs.112341 

Ks3877 

Hs.69517 

Hs.184601 

H5.19B249 

Hs.B2D45 

Hs.48956 

K5.822S9 

H&511579 

HS.64311 

H5.64311 

HS.1S173B 

Hs.85266 

Hs.61762 

Rs.68561 

Hs.11950 

Hs,B5962 

HS78B67 

Hs.78867 

H&.78867 

Hs.78867 

HS76B67 

HB.78B67 

Hs.108660 

Hs.87225 

Hs.87225 

H&54451 

H3.7327 

Hs.47860 

Hs.47860 

Hs,250618 

Ks.123114 

Hs.1 94669 

Hs.194689 

HsJ946B9 

Hs.194689 

H8.28B467 

Hs^88467 

Hs.77274 

He.239727 

Hs.239727 

Hs.184604 

H8.127179 

H&^79651 

Hs.199460 

rte.199460 



salute carrier fam3/ 22 (organic cation 

hepatitis A vims cellular receptor 1 

solute carrier femOy 6 (neuntfransmittfl 

carbonic anhydrasa IX 

solute earner family 2 (faciKlated gkJ 

CD70 ; tumor necrosis factor (ligand) 6 

gb±d13dGU1 SoaresJ^R^T^GBC^SI Homo s 

ESTs 



AW0O91G6 


Hs.99376 


KfcL007231 


Hs.162211 


M18728 




M1B723 




M18728 




AJ224741 


rte.278461 


BE3950B5 


Hs.100B6 


BE350494 


Hs.49753 


H10233 


Hs.2265 


NWJ003816 Hs.2442 


N3Q714 


Hs.325960 


S73265 


Hs.1473 


AW470411 


Hs.288433 


Z4S051 


Hs,22920 


AF067797 


Hb.176658 


W95642 


Rs.82961 


AU076718 


Hs/l 64021 



chloride channel, calcium activated, fern 
tamlnln, beta 3 (nlcetn (125<D), Kalinin 
desmogteln 3 (pemphigus vulgaris antigen 
matrix rratelfa proteinase 12 {macrophage 
desmocolSin 3 
desmocotRn 3 

LUNX protein; PLUNC (palate lung and nas 
cairinoernbryonic antigen-related cell ad 
uropIaWn 1B 

protease Inhibitor 3, sWrnlertved {SKAL 
cadhertn 3, type 1, Podherin {placenta 
differentially expressed in FanconVs en 
solute carrier farrfly 7 (catfonfc amino 
gap {unction protein, beta 5 (conn ©dn 3 
mfdkine (neurits o/owm-promoting factor 
gap junction protein, beta 6 (connexln 3 
pro gestagen- associated endomotriai prate 
melanoma cell adhesion molecule 
a dEstategrtn and metslbpr otaViase doma 
adiskitegrin and metfii bp rote has a dome 
matrix metalloprotelnase 9 (gelatlnase B 
fntegrtn, beta 4 
hypoxia-inducible protein 2 
G prcteln-fioupled receptor 67 
NM_005365:Homo sapiens melanoma antigen, 
GPI-ancbored m etas tasfe-as3 ocfeted prate 
hyaturonen synthase 3 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
ATP-bindng cassette, sub-tamuy C (CFTR 
eancertesBs antigen (NY-ESO-1) 
cancerftesBsanEgen (NY-ESO-1) 
lamtoln, gamma 2 (riceto (10QKD?, kaftri 
dmJdinl 

neurotrophic tyrosine kbiase, receptor, 

neurotrophic tyrosine ttiase, receptor, 

ULl6Wrxffng protein 2 

cystaSn SN 

arteirltn 

artemln 

artemSn 

artemln 

hypothefical protein XPJ198151 Oeucina- 

hypothetica) protein XP_09B151 {leudne- 

plasmlnogen acSvalor, irrddnase 

desmocotn 2 

desmocolBn2 

pancreafc polypeptide 

cxypGcgene 

melanoma Inhibitory activity 
DPCR1 protein 
DPOT1 protesi 

C9Q01188*^|1273a842tref|NPJF73725.1l p 
ESTs 

solute canter family 6 (neiirotransinftfe 
g bitumen nonspecific crossraacting anfig 
gb:Human nonspecific crossreacflng en tig 
glxRuman nonspeeffic crossreadkig artSg 
matriJin3 

type ) transmembrane proleki Fn14 
uveal autoanflgen with coled coil domal 
secretory granule, neuroendocrine protel 
a dtsiitagrin and rnataSoprotemase doma 
membtane^panri'mg domains, eubfomly A 
gas bin-re leasing peptide 



plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
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secreted 

plasma membrane 
SBcretsd 

plasma membrane 



secreted 

plasma membrane 
plasma membrane 
secreted 

pi asma membrane 
plasma membrane 
plasma membrane 
plasma membrane 



plasma membrane 



plasma membrane 
p!3sma membrane 
plasms membrane 



plasma membrane 
secreted 

plasma membrane 
Inlracell 



plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
IrriraceD 



pfasma membrane 
plasma membrane 
plasma membrane 



secreted 
secreted 



secreted 
secreted 



plasma membrane 
secreted 
secreted 
secreted 

plasma membrane 
plasma memhrBna 



plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 



IntraceU 
secreted 

plasma membrane 



similar to S66401 (catfle) glucose Indue 
aquapcrin 8 

trefoil factor 3 (intestinal) 

small Inducible cytokine subfamily B (Cy 
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plasma membrane 

secreted 

secreted 

secreted 

secreted 

JntraceU 

secreted 

secreted 

IntraceJI 



secrete cJ 
secreted 

plasma membrane 
ptesma membrane 
plasms membrane 



Seq ID 465 4 466 431629 AU07702S Hs.265S27 Interferon, alpha-kiduc&le protein (cb secreted 

Seq ID 467 4 468 413554 AA319146 Hs.?5426 secretogrann II (chratnogranln C) secreted 

SeqlO4B94470 452194 AI694413 Hs.332649 Ubiquitin-lfce protein FAT1077? plasrra membrane 

Seq ID471 4472 452194 AJ694413 Ks^32649 Ubiqulhft-lfte protein FAT1 0777 

ScqlD473& 474 426322 J05068 KsJ2012 transcobalaiTrai Hvilamin B12 bidding pr 

Seq ID 475 4 476 429010 Y18198 Ha.194725 we cut toman, fenfly member 2 ftUraceS 

Seq ID 477 4 478 431462 AW5S3672 Hs.256311 graairvfike neuroendocrine peptide precu secreted 

Seq ID 479 4 480 448243 AW3S9771 Hs.52B20 Intern, beta 8 plasma membrane 

Seq ID 481 4482 426427 M86699 Hs,169B40 TTK proteinase Intracell 

Seq ID 483 4 484 423187 A1697303 Hs-285529 G praten-coupted receptor 49 plasma membrane 

Seq ID 485 4 486 426579 KM_005756 Hs.184942 G protein-coupled receptor 64 plaama membrane 

Seq ID 487 & 463 42B579 MMJD05756 Ks.184942 G protein-coupled receptor 64 plasma membrane 

Seq ID 489 4 490 428579 NM_005755 Hs.184942 G protelrwoBpIed receptor 64 plasma membrane 

Seq ID 491 & 492 42B579 NMJJ05756 Hs.184942 G protaJrvomplsd receplnr 64 plasma membrane 

Seq ID 4934 494 445537 AJ245S71 Hs.12844 EGF4ike<tomaln, muluiple6 

SeqlD4954496 422278 AHJ72873 Hs.114218 frizzled (DrosGphna) homolog 6 

Seq ID 497 & 496 424620 AA1 01 043 H 5. 151254 teJI&reln 7 (chymotrypflc. stratum corn 

Seq ID 499 & 590 406400 kaJlfcrein 8 (nBiropshfevasin) Ct<LKd) 

Seq ID 5014502 431130 NM_006103 rfe3719 HE4; erMdyrrts-specmc, whey-acidic pr 

Seq ID 503 & 504 420440 NM_0024D7 Ks.97644 mammagtobln 2 

Seq ID 505 4 506 426450 Nty.014791 rte.1B4339 KIAA0175 gene product 

Seq ID 507 & 608 446619 AUD76643 Hs.313 secreted phosphoprotetrt 1 (osieopontin, 

Seq ID 509 4 510 436982 AB018305 Hs.5378 spondln 1,(^spondlnJ extracellular mat 

SeqlD5114512 453392 U23752 Hs.32954 SRY(sexdmenrfnlng region Y)-box 11 

Seq ID 513 431989 AW972670 H&29108S ESTs 

SeqlD514 439820 AL36O204 H&283B53 Homo sapiens mRNA Ml length insert cDN 

SeqID 5154616 409179 B&39394& Hs.50915 kalliknsnS 

Sea ID 5174 518 426514 BE616633 Hg.170195 bono nwphogeneflc protein 7 (osteogenic 

SeqlD5t94520 421476 A16B3243 He.97258 ESTs, Moderated sMar to S29539 r&os 

SeqlD521 4622 421476 AIBB3243 Hs.97258 ESTs, Moderately similar to S29539 riboa 

SeqlD5234524 425776 U2512B Hs.159499 paTBthyfoW hormone receptor 2 

Seq ID 525 & 526 425776 U25128 Hs, 159499 parathyroid hormone receptor 2 

Seq ID 527 & 528 452097 AB0023S4 Ksl27916 a cSsintogrin-like and matatlo protease { secreted 

Seq ID 529 4 530 416530 U62801 Hs.79361 kallikrdn 6 (neurosti, zyme} secreted 

SeqlD5314532 431515 NM.012152 Hs.258583 endothelial dlfrerenMon, lysophospha plasma membrana 

Seq ID 533 4 534 419452 U33635 Hs. 90572 PTK7 protefn ^rosfne kinase 7 ofasma membrane 

Seq ID 535 & 536 412070 XB9599 Ms.73149 paired box gene 8 intracetl 

Seq ID 537 & 538 412078 X69699 H&73149 paired box gene 8 intracell 

Seq ID 539 & 540 411773 M4_006?99 Ks.72028 protease, serine, 21 (tesBsIn) secreted 

Seq ID 641 4 542 407792 AI077715 Hs,39384 pufeSve secreted Ggand homologous to f 

Seq ID 543 & 544 431616 AA508S52 Hs.195839 ESTs, Weakly sfmflar to I38022 hypothec 

Seq ID 545 & 546 452792 AB037765 Hs.30852 KTAA1 344 protein plasma membrane 

Seq ID 547 & 546 400294 N95796 Hs.278695 Homo sapiens prostefci mRNA, complete cds plasma membrane 

Seq ID 549 & 550 432653 N62Q96 HsJ93lB5 ESTs, Weakly similar to JC73Z8 amino act plasma membrane 

SeqlD55l&552 432653 NG2096 Ks^931B5 ESTs, Weakly similar to JC732B amrto aci plasma membrane 

SeqlD5534554 432653 N 6 2096 Hs.293185 ESTs, Weakly similar to JG7328 amino act plasma membrane 

SeqlD65S&556 432653 K62096 Hs^93135 ESTs, Weakly similar to JC732S amino set plasma mernbrarie 

Seq ID557 4558 400290 H18B36 Hs.31606 hypothefcal protein FU20041 plasma membrane 

SeqlD559&560 410001 AB04103S Hs.57771 kallkreki 11 secreted 

Seq ID 561 & 562 418398 AI765605 H&26691 ESTs plasma membrane 

Seq ID 5634664 451027 AW519204 rfcUOQQB ESTs ptesma membranB 

Seq ID 565 A 566 446057 AM20227 Hs.149358 ESTs, Weakly similar to A46010 Winked plasma membrane 

Seq ID 5674568 433466 AA508353 Hs.105314 retaxm 1 (H1) secreted 

Seq ID 569 4570 453370 AI470523 rfe.139335 ATPJjImSng cassette, sub-femUy C (CFTR plasma membrane 

Seq ID 571 4572 453370 AI470523 Hs.139336 ATrMrfndlng cassette, sub-ferrfly C (CFTR plasma membrane 

Seq [D 573 & 574 414569 AF109296 Hs/116256 prostatB cancer associated protein 1 plasma membrane 

Seq ID 575 4 576 413435 X51405 Hs.75350 carboxypepb'dase E secreted 

Seq ID 577 4578 426501 AW043762 Hs^93816 ESTs ptasma membrane 

Seq ID 5794560 448099 AF1 79274 Hs22791 transmembrane protetn with EGRIke and ptasma membrane 

SeqlD58l45B2 408369 R3B438 Hs.182575 so!ute earner tarrr3y 15 fj^77 transport plasma membrane 

Seq ID 583 & 584 412826 AI972402 Hs.308051 hypothetical proteVi MGC2648 secreted 

Ssq ID 5B5 4586 403047 NMJ)05S56*^Qmo sapiens transmembrane pr plasma membrane 

Seq ID 5874 588 403047 NM_005656*:Homo sapiens transmembrane pr plasma membrane 

SsqlD5B94990 40B43O S79B76 Hs,44926 dlpepbdyf peptidase IV (CD26, adenosVie ptasma membrane 

SeqlD691&692 445413 M151342 Hs.12677 CGH47 protefn secreted 

Seq ID 593 4 594 451982 F13036 Hs.27373 Homo aflffens mRHA; cDNA DKFZp5 5401763 (f Intracell 

Seq ID 595 4 598 427958 AA418Q00 Hs.98280 potBSsJum mtemiiidiatc'smfiJI conduDt^CQ plasma membrane 

Seq ID 597 4 598 421887 AW161450 Hs.109201 CO-66 protein plasma membrane 

Seq ID 599 4 600 425071 NMJ0139B9 I Is, 154424 detodmase. fodothyronlne, type II secreted 

Seq ID 601 4 602 432101 AK16950 Us. 122642 EpoA3 plasma membrana 

Seq ID 603 4 604 407786 AA687538 Hs.3B972 tetraspanl plasma membrane 

Seq ID 605 4 606 416836 D54745 Hs.80247 chdecystoklnln secreted 

Seq ID 607 & €08 416539 Y07909 rls.79368 epUhatial membrana protein 1 plasma membrane 

Seq ID 609 & 610 1310B3 YD9763 Hs .22765 ganima-aiTinobutyric ackJ {GABA) A recepto plasma membrane 

Seq ID 611 &612 1310B3 Y09763 Hs^2765 gamma-anrrtnolxityric acid (GABA) A lecepto plasma membrane 

Seq ID 613 4 614 131083 Y0S763 HS-227B5 gamma-amlnobutydc add (GABA) A recepto plasma rnemhrane 

Seq ID 615 4 616 131083 Y09763 Hs.22785 gamma-amlnabutyric sold (GABA) A recepto ptasma membrane 

Seq ID 617 4 618 422424 AJ186431 Hs.296638 prostate alfferenfiaiton factor secrelad 

Seq ID 619 4 620 428970 8E276891 Hs.194691 rati noic add Induced 3 plasma membrane 

Seq ID 621 4 622 428330 L22524 Hs.2256 mainx metal lop rotBBiasfl 7 (matriyain, secreted 

Seq ID 623 4 624 439018 AW300B87 Hs^6638 membrane-Bpannlnrj 4-domaIns, siMarnlly A plasma membrane 

Seq ID625 4626 420610 AJ683163 HsJ9348 distaHess homeo box 5 intraceii 

Seq ID 627 & 628 425723 NM_0 14420 Hs.1 59311 dk^dtcpf (Xenopus laevia) hcrralog 4 secreted 
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SeqtD629 &630 
SeqlD631 ft 632 
Seq ID 633 & 634 

Table 76B: 



456662 NMJB2448 Ha.1494 msh (Droscphlla} homeo box honwtog 1 (To 
41B231 UO9550 Hs.1154 ovfductaJglycopfoleJn 1, 120kD(mucm 9 
429903 AL134197 Hs.93597 cyclin-dapendent kinase 5, regulatory BU 



imracell 
secreted 



Ptey. Unique Eos probeset identifier number 

CAT number Gene cluster numter 
Accession: Gen bank accession numbers 



Pkey 
424399 

429220 
450375 



Table 76G 



CAT Number Accession 



236961J 

3013B4.1 
83327.1 



A19056B7 AT905B24 AJ905837 A1905623 AA340059 R76793 W72637 BE074512 AI905633 W72B38 BE09242 1 AI127172 BE 186013 AW070916 

A113&456AW176044AW291950 

AW20720B AW341 473 AA44B1 95 A1951341 

AA0O9647 AA131254 AA374293 AW954405 H04410 AW606284 AA151166 BE157467 BE1576Q1 H04384 W4B291 AW663674 H04021 H01532 
M190993H03231 H59605 H01 642 AA852B76AA 113758 AA62691SAA746952AI161014AA099554 R69C67 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this col jmn are Genbaik Identifier (Gl) numbers. "Dunham I. el d refers to 8ie publication 

sequence of human chromosome 22." Dunham L el al, Nature (1999) 40S489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposfSon: Indicates nucleotide positions of predicted exons. 



The DNA 



PkBy 


Ref 


Strand 


NLposWofl 


402075 


8117407 


Plus 


121907-122O35l122804-122921J24O19424161,1 


402230 


9966312 


Mmis 


29782-29932 


403047 


3540153 


Minus 


59793-59966 


404029 


7671252 


Plus 


108716-111112 


404049 


3668074 


Minus 


75765-78155 


404662 


9797231 


Minus 


40977-41150 


404675 


9801324 


Plus 


96568-96732,97722-97831 


404677 


1619284 


Phis 


1096-2107 


404977 


3738341 


Minus 


43061-43229 


406932 


7767612 


Minus 


123525-123713 


406400 


9256298 


Plus 


1553-1712,1878-2140,4252-4385,5922^077 
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Table 77 provides Pkey; Seq ID Nor Disease Indications; and Preferred Willy for sequences in Table 78. Seq ID No links me infermaSon in Table 77 to Table 7a 

Seq ID No: Sequence ID No for sequences &i table 

Pkey"- Unique Eos prabesetklenflffer number 

Disease Indications: Diseases designated for coverage as described in Table 1 

Preferred Utility: Preference of utility, based partly upon predicted localization (Ab is antibody; sm is smatt rnotecute target CTL is vaccine target) 



Seq ID No Pkey Disease Indications 



425023 
424503 
428423 



419172 



SeqlD1&2 
SeqlD3&4 
SeqlD5ft6 
SeqlD74B 
SeqlD9&lO 
Seq ID 11 & 12 418007 
Seq ID 13 & 14 418007 
Seq ID 15 & 16 407936 
Seq ID 17 tV 16 414577 
Seq tD 19 & 20 419738 
Seq ID 21 & 22 428368 
Seq ID n & 24 415138 
Seq ID 25 ft 26 429276 
Seq ID 27 & 23 416994 
Seq ID 29 & 30 407975 
Seq ID 31 & 32 429113 
SeqtD33&34 418506 
Seq ID 35 & 36 423961 
SeqlD37&38 414812 
Seq ID 39 & 40 417433 
Seq ID 41 & 42 417433 
Seq ID 43444 424399 
Seq ID 45 & 46 422867 
SeqlD47& 48 428227 
Seq ID 49650 444381 
SeqlD51&52 439569 
Seq ID 53 & 54 411558 
Seq ID 55 & 56 400303 
Seq ID 57 & 58 4T1789 
SeqtD59 * 60 428696 
Seq ID 61 8.62 450098 
SeqtD63&64 421652 
Seq ID 65866 452747 
Seq ID 67 & 65 415539 
Seq ID 69 & 70 416636 
SeqlD71ft72 416636 
SeqtD73&74 409079 
SeqtD75& 76 442082 
5eq1D77&76 400297 
SeqtD79&80 461390 
SeqlDB1&82 429220 
Seq ID 83 & 84 421524 
SeqlD85&65 423242 
Seq ID 87 & 88 423242 
SeqlD89&80 423242 
Seq(091&92 452190 
SeqID9G&94 452190 
SeqlD95&96 325372 
SeqlD97&98 450375 
Seq ID 99 & 100 426215 
Seq ID 101 & 102 425247 
Seq ID 1034 104 429353 
Seq ID 1054 10B 429353 
Seq ID 107 ft 108 432201 
Seq ID 109 & 110 4275B5 
Seq ID 111 & 112 446163 
Seq ID 113 & 114 442117 
Seq ID 115 & 116 428179 
SeqlD117ft118 431211 
Seq ID 1 19 4 120 447033 
Seq ID 121 & 122 447033 
Seq ID 123 & 124 447033 
Seq ID 125 & 126 115522 
Seq ID 127 & 128 452679 
Seq ID 129 & 130 446051 
Seq ID 13U 132 422048 
Seq ID 133 & 134 410416 
Seq ID 135 & 136 446342 
Seq ID 137 & 138 422260 
Seq ID 139 & 140 409041 
Seq ID 141 & 142 409041 
Seq ID 143 & 144 111829 
Seq ID 145*146 111929 
Seq!D147&148 111929 



sngibgBnaSts 

pancreas, prostate, engiogenesis, bladder, lung 
angbgenesls 

an g la genesis, bladder. lung, cervical, ovaian, head & neck 
engiogenesis, renal 

engiogenesis, Sung, bladder, fibrosis, head ft neck, pancreas, stomach, colon, ovarian 
autogenesis, king, Madder, fibrosis, head & neck, pancreas, stomach, colon, ovarian 
angtogenesfe 



angtogenesls, lung, ovarian, bladder & stomach, pancreas, uterine 

angtogenesls, head ft neck, stomach 

sngraganssis, pancreas, stomach, lung, uterine 

angiogenesJs, bladder, glioblastoma 

prostate, engiogenesis 

angiogsnesls, renal 

angfogeneste, Madder and stomach 

angtogenesB, ovarian, glioblastoma, uterine, rung, bladder, pancreas 

breast coton, bladder, lung, fibrosis, pancreas, head and neck, ovarian 

breast, bladder, limg. Fibrosis, pancreas, colon, head and neck, cervical, stomach, renal, ovarian 

pancreas, breast, bladder, lung, head & neck, cervical. Uterine, ovarian, stomach, renal 

pancreas, breast, bladder, lung, head & neck, cervical, uterine, ovarian, stomach, renal 

breast, uterine, head & neck 

breast ovarian, prostate, pancreas, lung, colon, uterine 

breast lung, bladder, ovarian, head & neck, ftnosia, colon, stomach, cervical 

breast colon, Madder, lung, pancreas, head ft neck, ovarian, stomach, uterine, ranal. angngenesls 

breast, androgen withdrawal prostate, prostate, bladder 

pancreas, prostate, stomach, breast uterine, cervical, ovarian 

breast ovarian, pros, stomach, uterine, bladder king head & neck 

pancreas, lung, breast stomach, head & neck, ovarian, uterine 

breast colon, lung, pancreas, stomach, head & neck, ovarian 

breast lung, stomach, uterine 

breast ovarian, pancreas, cervical, uterine, prostate, lung, stomach, heed & neck 

breast bladder, lung, head & neck, ovarian, stomach, uterine, pancreas 

breast prostata, ovarian, cervical, uterine 

breast pancreas, uterine 

breast pancreas, uterine 

breast prostate 

breast prostate, ovarian 

breast, bladder, colon, prostate 

breast, ovarian 

breast prostate, benign prostatic hyperplasia 



breast renal ovarian, prostate, colon 
breast renal, ovarian, prostate, colon 
breast renal, ovarian, prostate, colon 
breast stomach, pancreas 
breast stomach, pancreas 



breast ovarian, head & neck, pancreas, lung, coton 
breast king, renal, colon, ovarian, uterine 

breast ovarian, lung, coton, pancreas, head & neck, stomach, uterine, cervical, bladder 
breast prostate; 
breast, prostata 

breast coton, lung chemo, ovarian, stomach, pancreas, uterine, cervical 
breast lung, head & neck, pancreas, stomach, colon, ovarian, cervical 
breast cervical, uterine 

breast lung, bladder, pancreas, tread ft neck, stomach, ovarian, prostate 
breast 

colon, bladder, king, pancreas, head & neck 
colon, prostata, fibrosis, breast lung pancreas 
colon, prostata, fibrosa, breast lung pancreas 
coton, prostate, fibrosis, breast lung pancreas 
colon, king, bladder, pancreas 
prostata, colon, pancreas, Taxnl prostate 
colon, breast 
colon, pancreas, prostate 

colon, bladder, Jung, ovarian, pancreas, head & neck 



coton, ovarian mucinous 

uterine, ovarian, lung, colon, stomach, head ft neck, breast pancreas 
uterine, ovarian, lung, colon, stomach, head & neck, breast pancreas 
colon, breast fibrosis 
colon. Breast fibrosis 
colon, breast fibrosis 
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PreferredUflirv - 

Ab, sm. CTL, Imaging 
Ab.sm. imaging 
Ab,sm 

Ab, sm, CTL, diagnostic 
Ab, sm, CTL, Imaging 
Ab, sm. diagnostic 
Ah, sitv diagnostic 
CTL 

Ab, CTU diagnostic 
Ab, sm, CTL, Imaging 

Ab, CTL, degnoslic 
Ab,sm, imaging 
Ab, CTL Imaging 
Ab, CTL, diagnostic 
3m, CTL 
Ab, sm, imaging 
Ab, sm, diagnostic 
Ab, CTL, diagnostic 
Ab.CTL, imaging 
Ab.CTU Imaging 
Ab, CTU diagnostic 
Ab, CTU diagnostic 
At^CH^ diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab. sm. imaging 
Ab.sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab, sm, Imaging 
CTL 

Ab, sm, CTL, diagnostic 
Ab. Imaging 
Ab.sm, CTL, imaging 
Ab.sm, CTL. Imaging 
Ab.sm, CTL, Imaging 
Ab,em, CTL, imaging 
Ab, imaging 
Ab^m, CTL, imaging 
CTL 

Ab, CTL, imaging 
Ab.sm, CTL, Imaging 
Ab, CTL, Imaging 
CTL 

era 

CTL 
CTL 
CTL 

Ab^m, CTL, Imaging 
Ab, CTL, diagnostic 
Ab, sm, CTL, diagnostic 
Ab, em, CTL, imaging 
Ab,3n\Cm,snaging 
Ab,sm, CflT^imagfng 
CTL 

Ab,sm, Imaging 
Ab, CTL, imaging 
sra.CTL 
Ab, sm, (maglng 
Ab, CTL, dfegnosrJc 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
CTL 

Ab.CTL, imaging 
Ab, sm, CTL, Imaging 
Diagnostic 

Ab, sm, CTL, Imaging 
Ab, sm, CTU imaging 
Ab, sm, CTU diagnostic 
Ab, CTL, dfegnosbc 
Ab, CTU diagnostic 
Ab,sm, Imaging 
Ab, sm. Imaging 
Ab, sm, imaging 
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Seq 10 149 & 150 104888 
Seq 10 151 & 152 420159 
Seq ID 153 & 154 422330 
Seq ID 155 & 156 452451 
t> Seq ID157& 158 413324 
SeqtDl59&160 412420 
Seq ID 161 & 152 416655 
Seq ID 163 £164 407811 

„ Seq ID 165 40223D 

lU 3eq ID 166 & 167 402230 
SeqlD16B&169 43252B 
Seq ID 170 & 171 425721 
Seq ID 172 & 173 420370 

„ Seq ID 174 &>75 437652 

15 Seq ID 176 & 177 402075 
Seq ID 178 & 179 421110 
Seq ID 180 & 181 451668 
SeqlD1B2&183 451668 

„ Seq ID 184 &1B5 451668 

20 Seq ID 166 & 187 408243 
Seq ID 168 & 189 422282 
Seq ID 190 4 191 425852 
SeqlD192&193 43973B 
Seq ID 194 & 195 404875 

25 Seq ID 198 & 197 425893 
Seq ID 198 & 199 404977 
Seq ID 200 & 201 420876 
SeqlD202 &203 427747 

„ Saq ID 204 & 205 420281 

30 Seq ID 206 & 207 446673 
Geq ID 208 437553 
Seq ID 209 & 210 437553 
Seq ID 211 & 212 437553 
Seq ID 213 a 214 426900 

-35 Seq ID 215 & 216 426900 
Seq ID 217 & 216 426900 
Seq ID 219 & 220 405932 
Seq ID 221 & 222 405932 
. _ Seq ID 223 & 224 424008 

40 Seq ID 2254226 424008 
Seq ID 227 & 228 424008 
Seq ID 229 & 230 444342 
Seq ID 231 & 232 421379 
._. Seq ID 233 & 234 417079 

45 Seq K> 235 & 236 412966 
Seq!D237&238 4129B6 
Seq ID 239 & 240 412966 
SeqlD241&242 447072 

crs Seq ID 243 & 244 419723 

50 Seq ID 245 & 248 419723 
Seq ID 247*248 430890 
Seq D 249 & 250 456759 
Seq D 251 429466 
Seq ID 252 429466 

55 Seq ID 253 & 254 419721 
Seq ID 255 & 258 407034 
Seq ID 257 & 258 413472 
Seq ID 259 & 260 438350 
Seq ID 261 & 262 426271 

60 Seq ID 263 & 264 419704 
Seq ID 265 & 266 444471 
Seq ID 267 & 268 409395 
Seq ID 269 & 27Q 413063 
„ SeqlD271& 272 433800 

65 Seq ID 273 458435 
Seq ID 274 458435 
Seq ID 275 &27G 424343 
Seq ID 277 & 278 424998 
Saq ID 279 & 280 412709 

70 Seq ID 281 & 282 435615 
Seq ID 2B3& 284 404049 
Seq ID 285* 286 418932 
Seq ID 287 & 288 404029 
Seq ID 289* 290 436480 

75 Seq ID 291 & 292 452401 
Seq ID 293 & 294 452401 
Seq ID 295 & 296 438895 
Seq ID 297 & 298 421471 
0 . Seq JD 299 & 300 428296 

OU Seq ID301& 302 423508 
Seq ID 303 & 304 450001 
Seq ID 306 & 306 410407 
Seq ID 307 & 308 453496 



colon, stomach, uterine 
bladder, stomach 
pancreas, colon, Madder 

bladder, lung, head £. neck, ovarian, glioblastoma, stomach, colon, cervical 
bladder 

bladder, glioblastoma, king, stomach 

lung, ovarian, uterine, bladder 

Madder, pancreas, stomach, uterine, lung 

Madder 

Madder 

Madder 

bladder 

bladder 

bladder, lung 

bladder, lung, head & neck, cervical 

bladder, pancreas, stomach, ovarian, lung 

bladder, ovarian, lung 

bladder, ovarian, lung 

bladder, ovarian, lung 

bladder, stomach, head & neck, cervical 

bladder, lung, head ft neck 

Madder, lung, head & neck 

W adder Jung, cervical 

Madder 

Madder, pancreas 

Madder, ovarian 

pancreas, bladder 

Madder, king, ovarian, stomach 

lung, bladder, ovarian, pancreas 

bladder 

bladder 

bladder 

bladder 

Madder, prostate 
bladder, prostate 
Madder, prostate 

Madder, lung, head & neck, cervical 
bladder, lung, head ftnecls, cervical 
Madder, head & neck, stomach, cervical 
bladder, head & neck, stomach, cervical 
bladder, head & neck, stomach, cervical 
hepatttts C, king, Ibrosls, bladder 

breast pancreas, head & neck, lung, stomach, bladder, cervfcaJ, colon 
bladder, lung, tieai & neck, cervical 



glioblastoma 
glioblastoma 
glioblastoma, 
glioblastoma 



gfibblastoma. lung, cervical, bladder 
gSoNastoma 
pjSobtestoma, uterine 
na. | 



glioblastoma 
glioblastoma 
glioblastoma 



glioblastoma, lung, colon 
glioblastoma 

glioblastoma, ovarian, Madder, king 
gtobtastoma, lung 
glioblastoma 
glioblastoma 

glioblastoma, ova [fan, uterine 

glioblastoma 

glioblastoma 

glioblastoma 

glioblastoma 

glioblastoma 

glioblastoma 

glioblastoma 

pfedder, breast pancreas, head & neck, stomach, lung, arthritis, rend 

Madder, breast, pancreas, head & neck, stomach, tung, arthritis, renal 

breast, renal, ovarian, glioblastoma 

renal 

renal 

renal, colon 
renal, lung 

renal, lung, colon, stomach, ovarian, uterine 
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Ab, Imaging 

Ab, sro, CTl, Imaging 

Ab.sm. CTL, imaging, diagnostic 

CTL 

Ab, sm, CTL, diagnostic 

Ab, diagnostic 

Ab, CTL, diagnostic 

Ab, sm, Diagnostic 

sm, CTL 

sm, CTl 

CTL 

Ab, Imaging 
Ab, CTMmagfog 
Ab, sm, CTL, Imaging 



Ab, sm diagnostic 
Ab, am, CTL, diagnostic 
Ab, sm, CTl. diagnostic 
Ab, sn\ CTL, diagnostic 
Ab, diagnostic 
CTL diagnostic 
Ab,sm, CTL imaging 
Ab, sin, CTL Imaging 
sm, CTL 
Ab.CTL imaging 
Ab, sm. CTL, diagnostic 
Ab.sm, CTL, imaging 
sraCTL 
Ab, sm. imaging 
em, CTL 
Ab.CTL Imaging 
Ab.CTL, Imaging 
Ab.CTL Imaging 
Ab,CTL, imaging 
Ab.CTL imaging 
Ab.CTL, Imaging 
sm 
sm 

Ab^m, CTL, imaging 
Ab^m, CTL, Imaging 
Ab^m, CTL, imaging 
Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Ab. diagnostic 
Ab.sm, CTL imaging 
Ab.sm, CTL, Imaging 
Ab.sm. CTLknaglng 
sm, CTL 

Ab, CTL dlagnosEc 
Ab.sm, CTL Imaging 
Ab, CTL snaglng, diagnostic 
Ab,sm,CTL imagng 



Ab,sm, CTL, Imaging 
Ab, CTL, diagnostic 
Ab.sm.CTl, Imaging 
Ab, CTL diagnostic Imaging 
Ah, CTL diagnostic 
sm, CTL 

Ab,sm,CTUkn»gfnB 
Ab, CTL, diagnostic 
Ab, CTL, diajpicsflc 
Ab, CTL, Imaging 



Ab^ CTL imaging 
Ab, CTL diagnostic 
Ab^m, CTL Imaging 
Ab^m. CTL imaging 
Ab.sm, CTL imaging 
Ab.sm, CTL imaging 
Ab.8m, CTL imaging 
sm, CTL 

Ab, CTL, diagnostic 
Ab, CTL, diagnostic 
Abism, imaging 
Ab,sm, CTL imaging 
Ab,sm, CTL Imaging 
AbvCTUknaglng 
Abvsm, CTU imaging 
AUsm, CTL, Imagng 
AUsm. CTU Imaging 
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Sbq ID 309 4 310 420737 
Seq ID 311 4312 309931 
Seq ID 313 4 314 412719 
Seq 103154316 417034 
Seq ID 317 & 316 4304B6 
Seq ID 319 4 320 413753 
Seq ID 321 & 322 425650 
Seq ID 3238324 423673 
Saq ID 325 4 326 418663 
Seq ID 327 & 328 418663 
Seq ID 329 & 330 423610 
Seq ID 331 4 332 406690 
Seq ID 333 & 334 431846 
Seq ID 335 & 336 422168 
Seq ID 337 & 338 431959 
Seq ID 339 & 340 437044 
SeqED341 & 342 428484 
Seq 10343*344 429211 
Seq ID 345 4 346 417389 
Seq ID 347 & 348 431009 
Saq ID 349 & 350 417542 
Seq ID 351 & 352 449230 
Seq ID 353 & 354 410555 
Seq ID 355 & 356 410555 
Seq ID357435B 424887 
Seq ID 359 4 360 418462 
Saq ID 361 A 362 410274 
Seq ID 363 4 384 439606 
Seq ID 365 & 365 404877 
Seq ID 367 4 36B 444781 
Seq ID 3694 370 109424 
Seq ID371 4372 415817 
Seq ID 373 4374 415317 
Seq ID 375 & 376 415917 
Seq ID 377 4 378 415817 
Seq ID 379 4 360 415817 
SaqlD381&3B2 415817 
Seq ID 333 &3B4 421817 
Seq ID 385 &3B6 418678 
SBqlD387 &36B 418678 
Seq ID 389 & 390 409420 
SeqlD391&392 3321B0 
Seq ID393&394 406790 
Seq ID 3854396 408790 
Seq ID 397 & 398 439223 
Seq ID 399 & 400 409757 
Saq ID401& 402 428969 
Seq ID 403 & 404 428969 
Seq ID 405 4 406 428969 
Seq ID 407 & 408 428969 
Seq ID 409 4410 450701 
Seq 1D411&412 450701 
Seq ID 413 & 414 414774 
Seq 1D415&416 407944 
Seq 1D417M18 407944 
Seq 1D419 4420 428486 
SeqlD421 4422 457489 
Seq ID 423 & 424 432874 
Seq ID425 4426 445691 
Seq ID427442fl 445891 
Seq ID 429 4430 404682 
Seq ID 431 4432 429547 
Seq ID433 4434 425921 
Seq ID 435 4436 407242 
Saq ID437 4439 407242 
Seq ID439 4 440 407242 
Seq ID 441 4 442 432596 
Seq ID 4434444 444006 
Seq ID 4454446 423685 
Saq (D447 4440 428392 
SeqlD4494450 429597 
Seq ID451 4452 448030 
Saq ID 453 4 454 422109 
Seq ID 4554456 419235 
SeqlD457445B 449046 
SeqlD4594460 427333 
Saq ID461 4462 417931 
Seq B3 4634 464 419216 
Seq D 465 4 466 431629 
Seq ID 467 4 46B 413554 
Seq ID 469 4 470 452194 
Seq ID 471 4 472 452194 
Saq ID 473 4 474 425322 



rana! 
lung 

lung, head 4 neck, Madder, gfoUastorna, cervical 

lung, head & neck, a couple cervical 

lung, bladder, head 4 neck, cervical 

lung, bladder, head 4 neck, pancreas, cervical, colon 

lung, heed 4 neck, cervical, bladder 

bladder. Jung, head 4 neck, ovarian, pancreas, colon, stomach, uterine, cervical 
lung, bladder, head & neck, cervical 
lung, bladder, head & neck, cervical 
lung 

lung, head 4 neck, pancreas, stomach, bladder, colon, cervical 
lung, bladder, head 4 neck, uterine, carvtea!, stomach, ovarian 
head 4 neck, Madder, lung, cervical, stomach 

lung, bladder, cervical, head 4 neck, ovarian, colon, prostate, pancreas, breast 

head 4 neck, cervical, lung, bladder, h roast, prostate, ovarian, stomach 

fcing, glloNastomfl, bladder, head 4 neck, colon, cervical 

ting, bladder, head 4 neck, cervical stomach 

ovarian, &Jng, bladder, uterine, cervical pancreas, stomach 

lung, bladder, head 4 neck 

lung 

lung, cervical, head 4 neck, bladder, ovarian, colon 

lung 

fung 

head 4 neck, pancreas, fung, uterine, cervtcaf, colon, stomach 
lung, bladder 
lung, renal 

lung, bladder, heed & neck, cervical . 
lung. Madder 

lung, bladder, head 4 neck, cervical 
bladder, ling 

lung, globtestoma, head 4 neck, cervical, fibrosis 

lung, globtestoma, head 4 neck, cervical, fibrosis 

lung, globlaskmia, head 4 neck, cervical, fibrosis 

lung, globtaatana. head 4 neck, cervical, fibrosis 

lung, gflobbstoma, head 4 neck, cervical, fi cross 

lung, globtestoma, head 4 peck, cervical, fibrosis 

lung, cervicaf, head 4 neck 4 bladder 

lung, bladder, stomach, ovarian, pancreas 4 cervical 

lung, bladder, stomach, ovarian, pancreas 4 cervical 

lung, head 4 neck, pancreas, stomach, cervical, bladder 

lung 

lung 

lung 

lung, head 4 neck, cervical, bladder & colon 

pancreas, stomach, lung, Madder, stomach 

tong, cervical 

lung.carvfcaj 

Ions, cervical 

tag, cervical 

lung, head 4 neck, pancreas, breast, stomach, ovarian, colon 
lung, head 4 neck, pancreas, breast, stomach, ovarian, colon 
tag, bladder, head 4 neck, pancreas, stomach, ovarian 
tag, head 4 neck 
tag, head 4 neck 



pancreas, prostata, lung 
pancreas, stomach 
stomach, pancreas, ovarian 
stomach, pancreas, ovarian 



pancreas, head 4 neck, rung, ovarian 
stomach, pancreas 

pancreas, colon, bladder, head 4 neck, stomach, tung, ovarian, cervical 
pancreas, colon, bladder, head 4 neck, stomach, lung, ovarian, cervical 
pancreas, colon, bladder, head 4 neck, stomach, lung, ovarian, cervical 



pancreas, colon, tung, ovarian 4 cervical 
pancreas, uterine, coton 



pancreas, colon, stomach, lung 

pancreas, renal and stomach, 

pancreas, lung, colon 

pancreas, fibrosis, head 4 neck 4 lung 

pancreas, ovarian, uterine, glioblastoma, head 4 neck 4 lung 

pancreas, colon 

ovarian, pancreas, stomach, colon, uterine, prostate 
pancreas, lung, stomach, <jervicat prostate, head 4 neck 
pancreas, uterine, cervical, stomach 
pancreas, glioblastoma 
stomach, pancreas, renal, colon 
stomach, pancreas, renal colon 



Atvsm. CTL Imagmg 
ALsm, CTL, Imaging 
CTL 

Ab, CTL diagnostic 
Ah, soi, imaging 
CTL, diagnostic 
Ab r rnagmg 
Ab, CTL, diagnostic 
Ab. imaging 
Ab, rrnaging 
CTL. diagnostic 
Ab, imaging 
Ab, imaging 
diagnostic 
Ab, CTL, Imaging 
Ab, imaging 
Ab, sm, imaging 
Ab, imaging 
Ab, diagnostic 
Ab,sm, imaging 
CTL, dfegnosiic 
Ab, imaging 
Ab, sm, imaging 
Ab, sm, Imaging 
Ab,sm, diagnostic 
Ab, Imaging 



Ab.sna imaging 
CTL 

Ab, imaging 

Ab, am, CTL imaging 

Ab.sm, CTL imaging 

Ab,sn\ CTL imaging 

Ab.sm, CTL imaging 

Ab^m, CTL Imaging 

Ab^m, CTL imaging 

Ab^m, CTL imaging 

Ab^m,CTLfmaging 

CTL 

CTL 

CTL, diagnostic 
Ab, sm. imaging 
Ab^sm, CTL Imaging 
Attain, CTL imaging 
Ab. CTL, Imaging 
Ab, CTL diagnostic 
A^CTLtferjiostJc 
Ab^CTLotagnosHc 
Ab r CTL cSagnostic 
Ab, CTL ctagnoste 
AbfCTLtfagnostte 
Ab, CTL diagnostic 
Ab.sm, diagnostic 
Ab, sm, tmegng 
Ab, sm, imaging 
Ab, CTL diagnostic 
Ab, CTL diagnosOc 
Ab, CTL diagnostic 
Ab, CTL imaging 
Ab, CTL Imaging 
Ab, CTL diagnostic 
Ab, CTL, diagnostic 
Ab^m, CTL imaging 
Ab.sm, CTL imagfog 
Aiwm, CTL Imag&ig 
Ab^m, CTL Imaging 
CTL 

Ab, CTL Imaging 
CTL 

Ab, CTL diagnostic 
Ab^m, CTL Imaging 
Abvsm, imaging 
Ab, CTL diagnostic 
Ab, CTL Imagrig 
Ab, CTL Imaging 
Ab, sm, imaging 
Ab, diagnostic 
Ph. CTL diagnostic 
Ab, CTL diagnostic 
Ab, CTL diagnostic 
Ab^m. CTL imaging 
AbjSm, CTL imaging 
Ab, diagnostic 
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Saq 10 475 & 476 429010 
Seq ID 477 A 47B 431452 
Seq ID 479 & 480 44B243 
Seq 1D U 461 & 482 426427 
Seq ID 4B3&484 428187 
Seq ID 485 4486 428579 
Seq ID 487 4 488 428579 
Seq ID 489 & 490 428579 
Seq ID 491 & 492 424579 
Seq ID 493 & 494 445537 
Seq ID 495 6 496 422278 
Seq ID 4D7& 498 424620 
Seq ID 499 & 500 405400 
Seq ID 501 & 502 431130 
Seq ID 503 & 504 420440 
Saq ID £05 4 506 428450 
Seq ID 507 & SOB 446619 
SeqlD509 &5l0 4369&2 
Seq ID 511 & 51 2 453392 
Seq ID 513 431969 
Seq ID 514 439A20 
Seq ID 515 & 51 6 409178 
SeqlD517&516 426514 
3eqlD5lS&520 42l47B 
Seq 105214 522 421478 
Seq ID 523 & 524 425776 
Seq ID 525 & 526 425776 
Seq ID 527 & 528 452097 
Seq ID 529 & 530 416530 
SeqlD531&532 431515 
Seq ID 533 4 534 419452 
Seq ID 635* 536 412078 
Seq ID 537 & 538 412078 
Seq ID 539 & 540 411773 
Seq ID 541 & 54a 407792 
Seq ID 543 & 544 431616 
Seq ID 545 & 546 452792 
Seq ID 547 & 548 400294 
Seq?D549 &550 432653 
Seq ID 551 & 552 432653 
Seq ID 553 & 554 432653 
Seq ID 555 8.556 432653 
Seq ID 557 & 55B 400290 
Seq tD 559 & 560 410001 
Seq ID 56U 562 418396 
Seq ID 663 & 564 451027 
Seq ID 565 & 566 446057 
Seq ID 567 & 568 433456 
Seq ID 569 & 570 453370 
Seq ID 571 & 572 453370 
Seq ID 573 & 574 414569 
Seq ID 575 4576 413435 
Seq ID B77& 576 426501 
Seq ID 579 & 580 44B999 
Saq ID 5B1 & 582 408369 
Seq ID 5B3& 554 412628 
Seq ID 585 & 566 403047 
SeqlD587 &58B 403047 
SeqlD589 &690 408430 
Seq ID 591 & 592 445413 
Seq ID 693 & 594 451982 
Seq ID 595 & 596 427958 
Seq ID 597 & 598 421867 
Seq ID 599 & 600 425071 
Seq ID 601 & 602 432101 
Seq ID 603 & 604 407786 
Seq ID 605 & 606 416836 
Seq ID 607 & 008 416539 
S8q ID609& 610 1310B3 
Seq ID 61 U 612 131063 
SeqlD613&6l4 131083 
5eqlD615&6l6 131083 
Seq ID 617 & 618 422424 
Seq, ID 619 & 620 426970 
Seq ID 621 & 622 426330 
Seq ID 623 & 624 439018 
Seq ID 625 4 626 420610 
Seq ID 627 & 626 425723 
Seq ID 629 & 630 456662 
Seq ID 631 & 632 418281 
Seq ID 633 & 634 429903 



pen ere as 

pancreas, lung, glioblastoma 

ovarian, uterine, lung, 6tomacti. head & neck, glioblastoma, pancreas 

ovarian, king, head & neck, cervical, colon, uterine, stomach 

ovarian, uterine, cobn. stomach 

ovarian, Ewing, uterine, pancreas 

ovarian, Ewing. uterine, pancreas 

ovarian, Ewing, uterine, pancreas 

ovarian, Ewing. uterine, pancreas 

ovarian, bladder, uterine, breast, lung, nead & neck, renal, fibrosis, pancreas, 

ovarian, head & neck, bf adder, cervical, Jung 

ovarian 

ovarian, uterine 



ovarian, 
ovarian, 
ovarian, 
ovarian, 
ovarian 



ovarian, 
ovarian, 



uterine, cervical 

cervteaJ, pancreas, lung 

fibrosis, pancreas, head & neck, lung, cdan 

fibrosis 

lung, glioblastoma 

uterine, cervical, breast, prostate 
breast * 

colon, bladder, lung, cervical 
bifid [la r 



ovarian, uterine, lung 
ovarian, uterine, rung 
ovarian 

ovarian, uterine 

ovarian, prostate, tung 

ovarian, prostata, lung, breast uterine 

ovarian 

ovarian 

ovarian 

ovarian, uterine, cervfcat, pancreas 
prostate, pancreas, colon 
prostate, uterine, breast 
prostate, taxol prostate 
prostata, lung 
prostate, lung 
prostate, lung 
prostate, tung 
prostata, cofun 

ovarian, prostate, uterine, cervical, tung 
prostate 

prostate, uterine, glioblastoma 



prostate 
prostate 



prostate 

prostate, glioblastoma, pancreas 
prostate, breast, glioblastoma, tung 
prostata, glioblastoma 

prostate, lung, fibrosis, uterine, glioblastoma, cervical ovarian 
prostate 

prostate, bladder, colon 
prostate, bladder, colon 
prostate 

prostate, colon, uterine, ovarian, tung, pancreas 
prostate, bladder 
prostate, glioblastoma 
prostate 

prostate, colon, stomach, uterine, cervical, head & neck, pancreas 
prostate, pancreas 

prostate, colon, uterine, stomach, Inflammatory bowel disease, ovarian 

Prostate, Ewing, glioblastoma 

ZD1839 resists ni cancers, head & neck 

androgen rifrcYstfd prostata 

androgen wtthdrawat prostate 

androgen withdrawal prostate 

androgen withdrawal prostate 

bladder, pancreas, prostate, angfogsnBsfe, colon, stomach, lung 
stomach, pancreas, colon 

uterine, ovarian, fibrosis, prostate, pancreas, long, bladder, bead & neck 

uterine, stomach, prostate 

ufe me, ovarian erKforaebtoid. lung 

ovarian endometrioid, uterine, colon 



uterine, ovarian 
lung 



sm.CTL 
Ab, diagnostic 
Ab,sm, imaging 
sn%CTL 

Ab.srn, CTU Imaging 
Ab.sm, CTU imaging 
Ab.sm, CTL, tmogfng 
Ab,sm, CTL, imaging 
Ab^m, CTL, Imaging 
Ab, CTL, diagnostic 
Ab.srn, imaging 
Ab, CTL, diagnostic 
Ab, CrCdlagnostic 
Ab. CTL, diagnostic 
Ab, CTU diagnostic 
sm 

Ab, diagnostic 
Ab, CTL, diagnostic 
CTL 



Ab. CTU diagnostic 
Ab, CTL, diagnostic 
Ab, CTU Imaging 
Ah, CTU imaging 
AUsm.CTLjrnagtfig 
Abv&m, CTU Imaging 
Ab, sm, diagnostic 
Ab, CTU diagnostic 
Ab^m, CTL, Imaging 
Atom, CTU Imaging 
CTL 
CTL 

Ab, em, CTU diagnostic 
Ah, CTU. diagnostic 
Ab^sm, CTU imaging 
Ab, CTU Imaging 
Ab^m, CTU Imaging 
Ab,Hm, CTUnnegmg 
AbjEm, CTU imaging 
Ab,sm, CTU Imaging 
Ab^soi, CTU imaging 
Ab^m, CTU imaging 
Ab, CTL, diagnostic 
AbpSro, CTU imaging 
Ab.sm, CTL, Imaging 
Ab,sm, CTU Imaging 
Ab, CTU dtegnosfic 
Abvsm.CTUtmagiig 
Afxsm, CTU Imaging 
Ab.sm, CTU Imaging 
Abpsm, diagnostic 
Ab, CTU rm aging 
Ab,sm, CTL, Imaging 
Ab.sm, CTU bh aging 
Ab,CTUdfagnos& 
Ab^sm, CTU Imaging 
Ab^m, CTU imaging 
Ab^m, CTU Imaging 
diagnostic 
CTL 

Ab,sm, CTU Imaging 
Ab, CTU Imaging 
Ab, diagnostic 
Ab,sm, Imaging 
Ab.fim, Imaging 
Ab, CTU diagnostic 
Ab, sm CTU imaging 
Ab, am. CTU Imaging 
Ab, sm, CTU Imaging 
Ab,sm,CrU imaging 
Ab, sm, CTL, imaging 
Ab, CTL, diagnostic 
Ab,sm Imaging 
Ab, am, CTU diagnostic 
Abysm. CTU imaging 
CTL 

Ab, CTL, diagnostic 
CTL 

Ab, CTU diagnostic 
sm 
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8eq ID SSOt 1 DNA sequence 

Nucleic Acid Accession #; NM_001400 

Coding sequence : 2 51 - - 13 99 

1 11 21 31 41 si 

1)1111 

TCTAAAGGTC GGGGGCAGCA gcaagatgcg aagcgagccg tacagatccc GGGCTCTCCG 60 

AACGCAACTT CGCCCTGCTT GAGCGAGGCT GCGGTTTCCG AGGCCCTCTC CAGCCAAGGA 120 

AAAGCTACAC AAAAAGCCTG GATCACTCAT CGAACCACCC CTGAAGCCAG TGAAGGCTCT 180 

CTCGCCTCGC CCTCTAGCGT TCQTCTGGAG TAGCGCCACC CCGGCTTCCT GGGGACACAG 240 

GGTTGGCACC ATGGGGCCCA CCAGCGTCCC QCTGGTCAAG GCCCACCGCA GCTCGGTCTC 300 

TGACTACGTC AACTATGATA TCATCGTCCG GCATTACAAC TACACOGGAA AGCTGAATAT 360 

CAGOGCQGAC AAGGAGAACA GCATTAAACT QACCTCGGTG GTGTTCATTC TCATCTGCTG 420 

CTTTATCATC CTGGAGAACA TCTTTGTCTT GCTGACCATT TGGAAAACCA AGAAATTCCA 4 BO 

CCGACCCATG TACTATTTTA TTGGCAATCT GGCCCTCTCA GACCTGTTGG CAGGAGTAGC 540 

CTACACAGCT AACCTGCTCT TGTCTGGGGC CACCACCTAC AAGCTCACTC CCGCCCAGTG 600 

GTTTCTGCGG GAAGGGAGTA TGTTTGTGGC CCTGTCAGCC TCCGTGTTCA GTCTCCTCGC 660 

CATCGCCATT GAGCGCTATA TCACAATQCT GAAAATGAAA CTCCACAAOT GOAGCAATAA 720 

CTTCCOCCTC TTCCTGCTAA TCAGOGCCTG CTOGGTCATC TCCCTCATCC TGGGTGGCCT 7 BO 

GCCTATCATG GGCTGGAACT GCATCAGTGC GCTGTCCAGC TGCTCCACCG TGCTGCCGCT B40 

CTACCACAAG CACTAT AT CC TCTTCTGCAC CACGGTCTTC ACTCTGCTTC 7GCTCTCCAT 900 

CGTCATTCTG TACTGCAOAA TCTACTOCTT GGTCAGGACT CGGAGCCGCC GCCTGACGTT 960 

COQCAAGAAC ATTTCCAAGG CCAGCCGCAG CTCTGAGAAG TCGCTGGCGC TGCTCAAGAC 1020 

CGTAATTATC GTCCTQAGOG TCTXCATOGC CTGCTGGGCA CCGCTCTICA TCCTGCTCCT 1080 

GCTGGATGTG GGCTGCAAGG TGAAGACCTG TGACATCCTC TTCAGAGCGG AGTACTTCCT 1140 

GGTGJTAQCT QTGCTCAACT CCGGCACCAA CCCCATCATT TACACTCTGA CCAACAAGGA 1200 

GATGOGTOGG GCCTTCATCC GOATCATGTC CTGCTGCAAG TGCCCGAGOG GAGACTCTGC 1260 

TGGCAAATXC AAGCGACCCA TCATCGCCGG CATGGAATTC AGCCGCAGC7V AATCGGACAA 1320 

TTCCTCCCAC COCCAGAAAO ACGAAGGGGA CAACCCAGAG ACCATTATGT CTTCTGGAAA 1380 

CGTCAACTCT TCTXCCTAGA ACTGGAAGCT GTCCACCCAC OGGAAGOGCT CTTTACTTGG 1440 

TCGCTGGCCA CCOCAGTGTT TGGAAAAAAA TCTCTGGGCT TGGACTGCTG CCAGGGAGGA 1500 

GCTGCXGCAA GCCAGAGGGA ggaaggggga gaatacgaac AGCCTGGTGG TCTCGGGTGT 1560 

TGGTGGGTAG AGTTAGTTCC TGTGAACAAT GCACXGGGAA GGGTGGAGKT CAGGTCCCGG 1620 

CCTQGAATAT ATATTCTACC CCCCTGGAGC TTTGATTTTG CACTGAGOCA AAGGTCTAGC 1680 

ATTGTCAAGC TCCTAAAGOG TTCATTTGGC OCCTCCTCAA AGACTAATGT OCCCATGTGA 1740 

AAGG3TCTCT TTGTCTGGAG CTTTGAGOAG ATGTTTTCCT TCACTTTAGT TTCAAACCCA 1B0O 

AGTGAGTGTG TGCACTTCTQ CTTCTlTAGG GATGCCCTGT ACRTCCCACA CCGCAGCCTC 1860 

CCTTCCCTTC ATACCOCTCC TCAAOGTTCT TTTACTTTAT ACTTTAACTA CCTGAGAGTT 1920 

ATCAGAGCTG GGGTTGTGGA ATGATCGATC ATCTATAGCA AATAflGCTAT GT1GAGTAOG 1980 

TAGGCTGTGG GAAGATGAAG ATGGTTTGGA GGTGTAAAAC AATGTCCTTC GCTGAGGCCA 2040 

AAGTTTOCAT GTAAGCGGGA TCCGTTTXTT GGAATTTGGT TGAAGTCACT TTGATTTCTT 2100 

TAAAAAACAT CTTTTCAATG AAATGTGTTA CCATTTCATA TCCATTGAAG COGAAATCTG 2160 

CATAAGGAAG COCACTTTAT CTAAATGATA TTAGCCAGGA TCCTTGGTGT CCTAGGAGAA 2220 

ACAGACAAGC AAAACAAAGT GAAAACCGAA TGGATTAACT TTTGCAAAOC AAGGGAGATT 2280 

TCTTAGCAAA TGAGTCTAAC AAATATOACA TCCGTCTTTC CCACTTTTQT TGATGTTTAT 2340 

TTCAGAATCT TGTGTGATTC ATTTCAAGCA ACAACATGTT GTATTTTGTT GTGTTAAAAG 2400 

TACTTTTCTT GATTTTTGAA TGTATTTGTT TCAGGAAGAA GTCATTTTAT GGATTTTTCT 2460 

AACCCGTGTT AACTTTTCTA GAATCCACCC TCTTGTGCCC TTAAGCATTA CTTTAACTGG 2520 

TAGGGAAOGC CAGAACTTTT AAGTCCAGCT ATTCATTAGA TAGTAATTGA AGATATGTAT 2580 

AAATATTACA AAGAAXAAAA ATATATTACT GTCTCTTTAG TATGGTTTTC AGTGCAATTA 2640 

AACCGAGAGA TGTCTTGTTT TTTTAAAAAG AATAGTATTT AATAGGTTTC TGACTTTTGT 2700 
GGATCATTTT GCACATAGCT TTATCAACTT TTAAACATTA ATAAACTGAT TTTTTTAAAG 

Beq ID NO: 2 Protein sequence 
Protein Accession #i NF_001391 

1 11 21 31 41 51 

1 1 I I I 1 

MGPTSVPLVK AHRfiSVSDYV NYDIIVRHXW YTGKUUSAD XEWSIKHTSV VPILICCFH 
LEETIFVLLTI WKTKKFHRFM YYFIGNLALB DLLAGVAYTA NT J J, S PATTY KLXPAQWFLR 
EGSKFVALSA 8VFSLLAIAI ERYITMLKMK LHNGQNNFRL FLLISACWVI SI.ILGGLPIM 
GKHCISALSS CBTVIiPI/tfttX HYILFCTTVF TLTiTifiH rVIIj YCRIY8LVRT HSRRLTFKKK 
ISKASRS8BSK SWVLIiKTVII VLHVFIAjCWA PI>FILLLLDV GCKVKTCDIL FHAS*FI.VIA 
vrJJSffEMFII YTLiTNKEMRR AFIR3MSCCK CPSQD3AGKF KRPIIAGMEP SRSK8DN3SH 
PQKDEGDNFE TIMSSGNVN9 33 

Sep: ID NO; 3 DMA seocuence 
nucleic Acid Accession NM_002205-1 
Coding sequence: 1..3149 



60 
120 
160 
240 
300 
360 



1 
I 

ATGGGGAGOC 
CGCCGACCCC 
GGCTTCAACT 
GGATTCTCAG 
CCCAAQGCTA 
TGGGGTQOCA 
CTGGAGTCCT 
TGGTTCGGGG 
AGCTGGCGCA 
QATAACTTCA 



11 

I 



CGCTSSTGCC 
TAGAOGCGGA 
TGQAGTTTTA 
ATACCAGCCA 
GCCCCACACA 
CACXGTCCAG 
CAACAGTTCG 
CAGAGAAGGA 
CCGGAATTCT 



21 
I 

GTCCCCTCTC 
GCTGCTGTTG 
GGCCCCAGCA 
COGGCCGGGA 
GCCAGGAGTG 
GTGCACCCCC 
CTCAGAGGGA 
AGCCCATGGC 
GCOVCTGAGC 
GGAGTATGCA 



31 
I 

CACGCCGTGC 
CTGCTSSTGC 
GTACTCTOGG 
ACAGACGGGG 
CTGCAGGGTG 
ATTGAATTTtJ 
GAGGAGCCTG 
TCCTCCATCT 
GACGCCGTGG 
CCCTGCCGCT 




TCAGTGTGCT 
GTGCTGTCTA 
ACAGCAAAGG 
TGGAGTACAA 
TGGCATGOGC 
GCACCTGCTA 
CAGATTTCAG 



CXOCTTCTTC 
GGTGGGAGCA 
CCTCTGTCCT 

GTCCllGCAG 
TCCACTGTAC 
CCTCTCCACA 
CTGGGCAGCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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GGACAGGGTT 
TTAGGTGGAC 
ATTGCAGAAT 
CGCCAGGCCA 
TTCAGTGGTG 
GGCTATGTCA 
CAGATGGCCT 
GATGACTTGC 
GAGGTGGGCA 
CTTACCCTCA 
SACCXGGACC 
GAGCAGGGAG 
CfiGGTTCTGC 
CGAGGAGGCC 
GTGGACAAGG 
ATCTTCCCCG 
GCCTGCATCA 
GGTTTCACAG 
CTGTTCCTGG 
CGAGAGGATT 
CTCTOGCOGA 
CACGGCCTCA 



CAGAATGTGG 
GCTGAGTACT 
TTTGCOGTGA 
GCCAGTCTGT 
ATCCAGTTTG 
TCCTTTCGGC 
GAGGCAGTGC 
GACCTGGGAC 
AGCCAGGGTG 
GTGACCAGAG 
GAGTTGGATC 
TCTGCTTCCT 
TGTGAGCTCG 
TGGGCCAAGA 
TACAAAGCCC 
CAGGTGGCCA 
ATCATGATCC 
TACAAGCTTG 



ACTGCCAAGG 
CAGGAAGCTA 
CTTATTACCC 
GTTCCATCTA 
ATGACACAGA 
CCATCCTTAA 
CCTACTTTGG 
TGGTGGGGGC 
GGGTCTACGT 
CTGGCCATGA 
AGGATGGCTA 
TAGTGTTTGT 
AGCCCCTGTG 
GAGACCTGGA 

ctgtggtaxa 
ccatcttcaa 

ACCTTAGCTT 
TGGAACTTCA 
CCTCCAGGCA 
GCAGAGAGAT 
TTCACATCGC 
GGCCAQCCCT 
ACTGTGGAGA 
AOCATGTGTA 
GTGAGGGTGG 
CAGOACTCGT 
ACCAGAGCCG 
GGGGTGGGCT 
ACTTCCAGAT 
TCTCOGTGGA 
TATTCCCAGT 
CTGCTGTCCA 
TGCTGGAACT 
TTACGGGACT 
CCGAGGGTTC 
CGGGACCTCA 
GGCCCCTGCA 



AGGCTTCAGT 
TTTCTGGCAA 
CGAGTACCTG 
TGATGACAGC 
AGACT T TGTT 
TGGCTCAGAC 
CTATGCAGTG 
ACCCCTGCTC 
CTACCTGCAG 
TQAGTTTGGC 
CAATGATGTG 
ATTTCCTGGG 
GGCAGCCAGC 
TGGCAATGGA 
CAGGGGCCGC 
CCCRGAGGAG 
CTGCCTCAAT 
QCrOGACTGG 
GGCAACCCTG 
GAAGATCTAC 
TCTCAACTTC 
ACATTATCAG 
AGACAACATC 
CCTGGGTGAC 
CGCCTATGAG 
CAGACACCCA 
CCTGCTGGTG 
TCGGTTTACA 
CCTCAGCAAG 
GGCTCAGGCC 
AAGCGACTGG 
CCATGT CTAT 
CAGCTGTCCC 
CAACTGCACC 
CCTGCACCAC 
GATCCTGAAA 
CCAAOAAGAG 



TGAAGATGCC CTACCGAATC 
CAGCTGTGCA ATGGACCAAG 
TAGCCATCCT GTTTGGCCTC 
GATTCTTCAA ACGCTCCCTC 
CAGCCACCTC TGATGCCTGA 



GCCGAGTrCA 
GGCCAGATCC 
ATCAACCTGG 
TACCTAGGAT 
GCTGGTGTGC 
ATTCGATCCC 
GCCGCCACAG 
ATGGATCGGA 
CACCCAGCCG 
CGATTTGGCA 
GCCATCGGGG 
GGCCCAGGAG 
CACACCCCAG 
TATCCTGATC 
CCCATCGTQT 
CGGAGCTGCA 
GCTTCTGGAA 
CAGAAGCAGA 
ACCCAQACCC 
CTCAGGAACG 
TCCTTGGACC 
AGCAAGAGCC 
TGTGTQCCTO 
AAGAATGCCC 
GCTGAGCTTC 
GGGAACTTCT 
TGTGACCTGG 
GTCCCTCA.TC 
AATCTCAACA 
CAGGTCACCC 
CATCCCCGAG 
BAGCTCATCA 
CAGGCTCTGG 
AOCAATCACC 
CAGCAAAAAC 
TGCCCGGAGG 
AGCCAAAGTC 
CAOCCATTTA 
CTGOCTCGGC 
GCAGAAGGCA 
CTGGTCCTAG 
C CAT AT QQCR 



CCAAGACTGG 
TGTCTGCCAC 
TTCAGGGGCA 
ACTCTGTGGC 
CCAAAGGGAA 
TCTACAACTT 
ACGTCAATGG 
CCCCTGAOGG 
GGATAGAGCC 
GCTCCTTGAC 
CrCCCTTTGG 
GGCTGGGCTC 
ACTTCTTTGG 
TGATTGTGGG 
CCGCTAQTGC 
GCTTAGAGGG 
AACACGTTGC 
AGGGAGGGGT 
TGCTCATCCA 
AGTCAGAATT 
CCCAAaCCCC 
GGATAGAGGA 
ACCTGCAGCT 
TGAACCTCAC 
GGGTCACCGC 
CCAGCCTGAG 
GCAACCGCAT 



ACTCGCAAAG 
TGAACGGTGT 
ACCAGCCTCA 
ACCAAGGCCC 
AAGGTCAGCA 
CCATTAACCC 
GGGAAGCTCC 
CTGAGTGTTT 
TGCAGTTGCA 



CCGTGTGGTT 
TCAGGAGCAG 
GCTGCAGACT 
TGTTGGTGAA 
CCTCACTTAC 
CTCAGGGGAA 
GGACGGGCTG 
GCGGCCTCAG 
CACGCCCACC 

TGGGGAGACC 
TAAGCCTTCC 



CTCCCTCACC 
GAACCCTGTG 
TGACTCCATT 
ACGGCGGGCA 
GAATGGGGCT 
TCGAGACAAA 
AOTGGACAGC 
CAAGGCTCAG 
GGAAGTGTTT 
TTTCCATGCC 
CCCTCCAGAG 
CTGTOACTAC 
GAAGGCAGGA 
TAAGAAAACC 
CGACGTGGTT 
CTCCAAGCCT 
GAAGGAGGAG 
CAGCTCCATT 
GCTCCTATAT 
AAAGGGCCTG 
AAGOCGCAGC 
CAGGCTGCGC 
TTTCCGAGTC 



Seq ID MO; 4 Protein sequence 
Protein Accession #i NP_Q02195.1 



a 
I 

MQSH.TEESPL 
GFSVEFYRPG 



lAESYYPBYIi 
GYVTILNGSD 
EVGRVYVYLC; 
QQGWFVFPG 
VDKAWYRGS 
GETVKLQLJ3W 
LSPTHIALNF 
GBQNHVYIjGD 
FAVNQSHLLV 



1 

HAVQtiRKFQPR 
TDGVSVLVGA 
BEPVEYKSLQ 
PCRSDPSMAA 
IMCVQGQLQT 
IRSLYNFBGE 
HPAGIEPTPT 



SQGVLEfcSCP 
SASSGPQILK 
YKAIiKMPTfRI 
YKLGFFKR6L 



PIVSA£A£1»T 
QKQKGGVRRA 
SLDPQAPVDS 
KNALNUTFHA 
CDLpGNPMKAG 
QVTlllGVSKP 
OALEGQQU^Y 
CPEAECFRLR 
LPRQIiPQKER 
PYGTAMEKAQ 



21 

I 

RRPEfLUT.T.T. 
PKftKTSQPGV 
WFGATVRAHG 
GQGTCQGGFS 
SQASSIYDDS 
QMASYFGYAV 
LTLTGEDEFG 
QfVXrQPlWAAS 
IFPAMFUPEE 
LPIiASHQATIt 
HGLRPALHYQ 
QNVGEGGAYE 
ASLW3GLKET 
EAVLFPVEDW 
VTKVTGLNCT 
CELGPLHQQE 
QVA.TAVQWTK 
LKPPATfiDA 



31 
I 

liLLPPPPRVG 
LQGGAVYIjCP 
SSILACAPLY 



AGCTGCCCCA 

gctatggcgt 
gtctactgat 
ccgocatgga 



41 



GFNLDAEAPA 
WGASPTQCTP 



AAAAGAGCGT 
CCCACTGTGG 
CTACATCCTC 
AAAAGCTCAG 



51 
I 

VLSGPPGSFP 



YliGYSVAVGE 
AATDVHGDGL 
RFGSBLTELG 



LGGPGSVFHQ 
FSGDDTEDFV 
DBLIiVGAPLL 
DLDQDGTODV 
HGGRDLDGNG 



RSCSLKGNPV 
TQTW>IQjNGA 
SX8RIEDKAQ 
AELRVTAPPE 



TNEPINPKGL 
SQ9LQLHFRV 
AKGBYGVPliW 



Sag ID NO; 5 DMA sequence 
Huclcic Acid Accession #: 
Coding sequence: l.,23 97 



NM_002211 . 1 



1 
I 

ATGAATTTAC 
CAAACAGATG 
GCAGGGCCAA 
TCTGCACGAT 
GAAAATCCCA 
GGAACAGCAG 

ttgcgattaa 

TATCCCATTG 
AA0X3TAAAAA 



11 
I 

aaccaatttt 

AAAATAGAXG 
ATTGTGGGTG 
GTGATGATTT 
GAGGCTCCAA 
AGAAGCTCAA 
GATCAGGGGA 
ACCTCTACTA 
GTCTTGGAAC 



21 
I 

CTGGATTGGA 
TTTAAAAGCA 
GTGCACAAAT 
AGAAGCCTTA 
AGATATAAAG 
GCCAGAGGAT 
GCCACAGACA 
CCTTATGGAC 
AGATCTGATG 



31 

! 

CTGATCAGTT 
AATGCCAAAT 



AAAAAGAAGG 
AAAAATAAAA 
ATTACTCAGA 
TTTACATTAA 
CTGTCTTACT 
AATOAAATGA 



RKDCREMKIY 
ILIiDCQBDNI 
AEYSGTjVKHP 
IQFDFQILGK 
PIX3PAVHHVY 
ELDPEGSLHE 
WAXT5XQHEH 
riHiATLPGlj 



41 
I 

CAGTTTGCTG 
CATGTGGAGA 
TACAGGAAGG 

ATGTAAOCAA 
TCCAACCACA 
AA7TCAAGAG 
CAATGAAAGA 
GGAGGATTAC 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 BO 
1440 
1500 
1560 
1620 
1680 
1T40 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
22 B0 
2340 
2400 
2460 
2520 
2560 
2640 
2700 
2760 
2820 
208O 

2 940 

3 000 
3 060 
3120 



DPVGTCYLST 
GQILSATQEQ 
AGVPXGNI/TY 
MDRTPDGRPQ 
AXGAPFGGET 
ypDLXVGSFG 
ASGKHVADST 
LUNBSEFRDK 
CV PDL Ql>EJVF 
GWFSSIiSCOT 
NUSKfiQSDW 



CQKREAPERS 
QPPSIiQCEAV 
liliGLLmii 



51 
I 

TGTGTTTGCT 
ATGTATACAA 
AATGCCTACT 
AGATGACATA 
CCGTAGCAAA 
GCAGTTGQTT 
AGCTGAAGAC 
CGATTTGGAG 
TTCGGACTTC 



60 
120 
180 
240 
300 
3 60 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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AGAATTGGAT 
GCTAAGCTCA 
AATGTGCTCA 
ATATCTGGAA 
TGTGGATCAC 
GGGTTTCACT 
TGTCACCTGG 

caccttgt cc 

TTTCAGCCTG 
TCTGCAAATT 
GAAGTCATTT 
TGCAAGAACG 
GGAGATGAGG 
GACAGCTTTA 
ATCTGTGAAT 
AATGGGACAT 
GAATGCAGCA 
AGTTCAGAAA 
AGGGATAATA 
GAT AG AT CCA 
TGCAACCCCA 
GCCAGCAACG 
ACAGATCCGA 
GCTGAGCATA 
TGCACACAGG 
CAGCOGGTCC 
TTCTATTTTA 
CCAGAGTGTC 
QTTCTTATTG 
AGGGAGTTTG 
CCTTATTTATA 



TTGGCTCATT 
GGAACCCTTG 
GTCTTACTAA 
ATTTGGATTC 
TGATTGGCTG 
TTGCTGGAGA 
AAAATAATAT 
AGAAACTBAG 
TTTACAAGGA 
CTAGCAATGT 
TGGAAAAOGG 
□GGTGAATGG 
TTCAATTTGA 
AAATTAGQCC 
GTGAATGCCA 
TTGAGTGTGG 
CAGATGAAGT 
TCTGCAGTAA 
CAAATGAAAT 
ATGGCTTAAT 
ACTACACTGG 
GACAGATCTQ 
AGTTTCAAGG 
AAGAATGTGT 
AATGTTCCTA 
AACCTGATCC 
CGTATTCAGT 
CCACTGGTCC 
GCCTTGCATT 
CTAAATTTGA 
AGAGTGCOGT 



TGTGGAAAAG 
CACAAGTGAA 
TAAAGGAGAA 
TCCAGAAGGT 
GAGGAATGTT 
TGGGAAACTT 
GTACACAATQ 
TGAAAATAAT 
GCTGAAAAAC 
AATTCAGTTG 
CAAATTGTCA 
AACAGGGGAA 
AAT TAG CAT A 
TCTGGGCTTT 
AAGCGAAGGC 
CGCGTGCAGG 
TAACAGTGAA 
CAATGGAGAG 
TTATTCTGGC 
TTQTGGAGGA 
CAGTGCATGT 
CAATGGCC3GG 
GCAAACQTGT 
TCAGTGCAGA 
TTTTAACATT 
TGTGTCCCAT 
GAATGGGAAC 
AGACATCATT 
ACTGCTGATA 
AAAGGAGAAA 
AACAACTGTG 



ACTGTGATGC 
CAGAACTGCA 
GTATTTAATG 
GGTTTCGATG 
ACACGGCTGC 
GGTGGCATTG 
AGCCATTATT 
ATTCAGACAA 
TTGATCCCTA 
ATCATTGATG 
GAAQOAGTAA 
AATGGAAGAA 
ACTTCAAATA 
ACGGAGGAAG 
ATCCCTGAAA 
TGCAATGAAG 
GACATGGATG 
TGCGTCTGCG 
AAATTCTGOG 
AATGGTGTTT 
GACTGTTCTT 
GGCATCTGCG 
GAGATGTGTC 
GCCTTCAATA 
ACCAAGGTAG 
TGTAAGGAGA 
AAGGAGGTCA 
CCAATTGTAG 
TGGAAGCTTT 
ATGAATGOCA 
GTCAATCOGA 



CTTACATTAG 
CCAGCCCATT 
AACTTGTTGG 
CCATCATGCA 
TGGTGTTTTC 
TTTTACCAAA 
ATGATTATCC 
TTTTTGCAGT 
AGTCAGCAGT 
CATACAATTC 
CAATAAGTTA 
AATGTTCCAA 
AGTGTCCAAA 
TAGAGGTTAT 
GTCC5CAAGTG 
GGCGTGTTGG 
CTTACTOCAG 
GACAGTGTGT 
AGTGTGATAA 
GCAAGTGTCG 
TGGATACTAG 
AGTGTGGTGT 
AOACCTaCCT 
AAGGAGAAAA 
AAAGTCGGGA 
AGGATGTTGA 
TGGTTCATGT 
CTGGTOTG3T 
TAATGATAAT 
AATGGGACAC 
AGTATGAGGG 



CACAACACCA 
TAGCTACAAA 
AAAACAGCQC 
AGTTGCAGTT 
CACAGATGCC 
TGATGGACAA 
TTCTATTGCT 
TACTGAAGAA 
AGGAACATTA 
CCTTTCCTCA 
CAAATCTTAC 
TATTTCCATT 
AAAGGATTCT 
TCTTCAGTAC 
TCATGAAGGA 
TAGACATTGT 
GAAAGAAAAC 
TTGTAGGAAG 
TTTCAACTGT 
TGTGTGTGAG 
TACTTGTGAA 
CTGTAAGTGT 
TGGTOTCTGT 
GAAAGACACA 
CAAATTACCC 
CGACTGTTGG 
TGTGGAGAAT 
TGCTGOAATT 
TCATGACAGA 
GGGTGAAAAT 
AAAATGA 



Seq ID NO i 6 Protein sequence 
Protein Accession #t NP 002202.1 



MNLQPIFWIG 
SARCDDLEAIi 
LRLRSGEPQT 



ISGNLDSPEG 
CSItENNMYTZp 
SANSSNVIQL 
GDEVQFEISI 
NGTFEOGACR 



11 
1 

USSVCCVFA 

KKKGCppnai 

FTI»KFKHAED 
TVMPYISTTP 
GFDAIMQVAV 
SHXTTOYPSTA 
IIDAYNSLSS 
TSKTKCPKKDS 
CNEGRVGRKC 



ASNCGICNGR 
CTQKCSYFNI 
PECPTQPDII 
PIYKSAVTTV 



PIVAGWAGI 



21 
I 

QTDEKRCt/KA 
ENPRGSKDIK 
YPIDLYYLMD 
AKLiRNPCTSE 
CGELIGWHHV 
HIjVQKIjSENW 
BVILENGKLS 
DSFKIHPLGF 
ECSTDETOJSE 
DRSNQIiX CGG 
TDPKFQGQTC 
QPVQPDPVSH 
VLIGLAI.LUC 



31 
I 

NAKSCGEBCIQ 
KNKNVTNSSK 
LSYSKKDDIiE 
QKTCTfiPFSYK 
TJtfJjVFSTTJA 

iqtifavtee 

EGVTISYKSY 
TKEVEVITjQY 
DHDAYCRKEW 
NGVCKCRVCE 
EMGQTCW3VC 
CKEKDVUDCW 
HKLIiHXXHDR 



41 
I 

AGFNCGWCTN 
GTAEKLKPED 
NVKSLGTDLM 
NVLSLTNKGE 
GFHFAGDGKX* 
PQPVYKKTiKH 
CFKGVNGTGE 
ICECECQSEG 
SSEICSMWGE 
CKFNYTGSAC 
AEHKECVQCR 
FYFTYSVNGN 



Seq ID NO: 7 DMA sequence 

Nucleic Acid Accession NM_002425 

Coding sequence: 26. .1453 



1 

\ 

AAAGAAGGTA 
AGTCTGCTCT 
TGCCCAGCAA 
AAAGGACAGT 



TCCTGACGTT 
TACATACAGG 
TGAGAAAGCT 
AGGAGAGGCT 
TGATGGCCCA 
TATTCACTTT 
CGTTGCTGCT 
TTTGATGTAC 
TGATGTGAAT 



11 
I 

AGGGCAGTGA 
GCCTATCCTC 
TACCTAGAAA 
AATCTCATTG 
AAGCTAGACA 
GGTCACTTCA 
ATTGTGAATT 
CTGAAAGTCT 
GATATAATGA 



GTGCTTCGAT 
TTGGCGAAGA 
CTCTCTTCCA 
TTTTAAAGGA 
AGGCATCCAT 
CAAGGAAAAG 
TAGCCAGTCC 
GCCTAAGGTT 
ACAGTTTGAG 
GTTACATTGC 
ATTATTCATC 



GATGATGATG 
CATGAACTTG 
CCACTCTACA 
QGCATTCAGT 
AAATCTGTTC 
GCCATCAGCA 
TCCCACTGGA 
TCATATTTGG 
AATGAGTTCT 
ACCCTGGGTT 
AAGAAAACAT 
ATGGAGCAAO 
G ATG CTOTAT 
TTTGACCCCA 



TAATGTATTA 



21 
I 

GAATGATGCA 
TGAGTGGGGC 
AGTACTACAA 
TTAAAAAAAT 
CTGACACTCT 
GCTOCTTTCC 
ATACACCAOA 
GGGAAGAGGT 
TCTCTTTCGC 
TGGCTCATGC 
AAAAATGGAC 
GCCACTCCCT 
ACTCATTCAC 
CTCTCXACGG 
CTTOGGGATC 
CTCTGAGGGG 
ACCCTGAACC 
ATGCTGCATA 
GGGCCATCAG 
TTCCTCCAAC 
ACTTCTTTGC 
GCTTCC CTAG 
TACAGGCATT 
ATGCCAGGAT 
AGGGGGAAGA 
TGAGCCAAAA 



3L 
t 

TCTTGCATTC 
AGCAAAAGAG 
CCTCGAAAAG 
CCAAGGAATG 



TGGCATGCCG 
TTTQGCAAGA 
GACTCCACTC 
AGTTAAAGAA 
CTACOCAOCT 
AGAAGATGCA 
GGGGCTCTTT 
AGAOCTCGCC 
ACCTCCCCCT 
TGAGATGCCA 
AGAATATCTG 
TQAATTTCAT 
TGAAGTTAAC 
AGGAAATGAG 
CATAAGGAAA 
AGOGGACAAA 
ACTAATAGCT 
TGGATTTTTC 
GGTGACACAC 
CAGATATGGG 
TGGXTAATTT 



41 
I 

GAGGACTCCA 
GATGTGAAAC 
CAGAAGTTCC 
CGCAAGCCCA 
AAGTGGAGGA 
GATGCTGTTG 
ACATTCTCCA 
CATGGAGACT 



TCAGGCACCA 
CACTCAGCCA 
CAGTTCCGCC 
GCCTCTACTG 
GCCAAOTGTG 
TTCTTTAAAG 
TTGATTTCTG 
AGCAGGGACA 
GTACAAGCAG 
ATTGATGCAG 
TACTGGAGAT 
GATGACTTTC 
TACTTCTTCA 
ATATTAAAGA 
TGTTTTTAAT 
TTCCTGCATG 



51 
I 

ACAAGGATCT 
AGTTTAGAAG 
TTGGGTTGGA 
GGTGTGGAGT 
AAACCCACCT 
ATTCTGCCAT 
GGCTGTATGA 
TTTACTCTTT 
TTTATGGAGA 
ATTTAT TOCT 
ACACTGAAGC 
TTTCGCAAGA 
AGGAAOCCCT 



600 
660 
720 
7B0 
640 
90 0 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



51 
1 

STFIjQEt3HPT 
ITQIQPQQLV 
NEMRRITSDF 
YFNEIiVGKQR 
GGIVLPNDGQ 
UPK£AVGTL 
KGRKCSNISI 
1PESPXCHEG 
CVCGGCVCRK 
DCSLDTSTCE 
APSKGEKKDT 
USVKVHWEK 



ACAGATATTT 
CATTTTGGCC 
CCGTTTTTAT 
GTTATCCAAG 
CTGTTTCTGA 
TTGATGAAAA 
CAGGAGTTGA 
GTGGATCATC 
GTAACAGCTG 
AAATCTAATA 
TTCTGTGACT 



€0 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
€60 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



951 



WO 03/042661 



PCT/US02/36810 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATQTTTCT GGAATTCTTC 1620 
ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 
ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 
CTT 

Seq ID NO: 6 Protein sequence 
Protein Accession ft: HP 002416 



1 
I 

MHXAFLVLLC 
KIQjGMQXFLG 
PDLPRDAVDS 
HAYFFGFGDY 
PTEIiAQPRIjS 
RGEYLFFKDR 
rRGNEVQAGY 
PRLIADDFPG 



11 
I 

LPVCSAYPI3S 
LEVTGKIiDTD 
AIEKALKVKE 



QDDVWGIQSI, 
YFWRRSHWKP 
PRGIHTLGFP 
VEPKVDAVLQ 



21 
I 

GAAKEEDSNK 
TLEVMRKPRC 
EVTPLTFSRL 
HTEDASQTWIj 
YGPPPASTEE 
EPEFKLISAF 
PTIRKJDAAV 
AFGFPYFFSG 



3X 

! 

DLAQQYLEKY 
GVPDVGHFSS 
YEGBADIMIS 
FliVAAHELGH 
PLVPTKSVP9 
WPSLPSYLDA 
SDKBKKKTYF 



41 

i 

YNLEKDVKQF 
PPGMPKWRKT 
PAVKEHGDFY 
SLGLPHSANT 
GSEMPAKCDP 
AYEVNSRDTV 
FAADKYWRFD 
SMVTHILKSN 



51 
I 

RRKDSNIiXVK 
HLTYRIVNYT 
SFDGPGH3IA 
EALMYPLYHS 
ALSPDAISTL 
FIFKGflEFHA 



SHLHC 



Seq ID NO 7 9 DMA aequence 

nuclei c Acid Accession #* XM_0581B9-2 

Coding sequence : 169 . , 774 



GAAGACCAGC 
AGTATTTCAC 
TATGAAGTGA 
AAGTGTGGAG 
AACATATTAT 
AACTACGTGT 
ACAGTTCTTC 
AGCAAAAAAT 
GGATACTGCC 
GATGGCTGGG 
TGGATTCAGT 
CTCATAACCC 
TCCAAGATAC 
ACAAAATGTr 
GACTTGAGGG 
TtTGCAGTCC 
AATATGCATC 
GAATCTGTAC 
TGCAACATTA 
AGGATTAABr 
ATAAAAAATA 



11 
1 

TCAGCTCTTC 
ACTGAQATTG 
AGGGCTCTGA 
GCTGCCTAAQ 
TGTATTTCCC 
GGTATTTTGA 
TGGTACTGGA 
ATGTGACACT 
TGGTCATCTC 
AGTATGCTTT 
GCCTGGAACC 
TCAGTGGGCT 
TGTGTGGAAG 
TTCCATTATC 
CAATATTGAA 
TCACTGCACA 
AGAAACTTCA 
TAGGGAAGTA 
T TTAAT ATTC 



21 
I 

AGTTGTTGAT 
TCQGCT □ CGG 
CCCTGGAAGT 
TTGTTTGCTG 
GAATGGGCAA 
AGGAATCTGT 
GAATAATAAC 
GCTGTCAATT 
TGCCTTGGGT 
TGAAGGCACT 
TGCACATGTT 
TCAAG^GATC 
CTATTCAGTG 
AAGACATGGC 
ATGATGGTGC 
TGCAAGTATA 
GAAATACTTC 
AATAAGAAIA 
TGGAAAATTG 



31 
1 

CATTGTCTAT 
GTATATTOCA 
GGTTCTAAGC 
ATTCC3GCTTG 
ACTTCCTATG 
TTCTCAGGCA 
AACTATAAAT 
ATCIITTCIT 
CTTGTCCAAG 
GCTGGACGTT 
GTGGAGTGGA 
ATCK3CCTCA 
ATCTTCCAGC 
CATCTATCTA 
TTTCTGCATT 

ccACCcrrcc 

TGCCCTTTGA 
TGAGAGAAAC 



AATQTTTTCT 
AAGAGTACTA CATGGTTGTA AAA 



41 
I 

TGTTCTCCAA 
ATTCCCOGTC 
AGGGCAAAAT 
CACTTTGGAG 
CATCCAGCAA 
TCATGATGCT 
GTTGCCAGAG 
COCTCGGAAT 
GGCCATATTG 
TCCTTACAGA 
ACAXCATTTT 
TCAGAGTAGT 
CTGGAATCAT 
AATATTATAT 
TGGTGTTTAT 
ATTTAGTATG 
TCAAACAAAT 
CXTTATGCAA 
AAAATTCTAA 
GATATTAAAA 



51 
I 

ACAGTAAACC 
TCCTCATQAA 
GGGGTCTCGG 
TATAATCGTG 
TAAACTCAOC 
IATAGTAACA 
TGAAAACTGC 
IGCTTTTTCT 
CCGCACCCTT 
TTC TAG CAT A 
ATTTTCCATT 
CATGCAACTA 
TTGAATAAGG 
CAACTGTGXR 
TTGTAAAAAA 
TTTTTTAAGT 
CCATTTCCAA 
ATATGTATAT 
ACTCAGAGGA 
AAAAAATTAA 



Seq ZD VOt 10 Protein gggggncg 
Protein Accession #i XP_05B1B9*1 

1 11 21 31 41 51 

1 1 I 1 1 I 

HGSRKOGGCEi SCLLlPIiALK SIIVNILLYF PMGQTSYASS NXLTNYVWYF EGICFSGIMM 
LIVTTVLLVI. ENNNEYKCCQ SEHCSKKYVT fclaSIIFSSIiG IAFSGYOTVI SALGLVQGPY 
CRTLDGNEYA FEGTAGRFLT DSSIWIQC&B PAHWEWNII LPSILITLSG LQVIICLI3IV 
VMQLSKILDG SY3VIFQPGI I 

Seq JD HQ; 11 DMA sequence 

Nucleic Acid Accession #: NM_002421,2 

Coding sequences 1 . .1409 



1 
I 

ATGCACAGCT 
CCAGCGACIG 
TACTACAACC 
GTTGAAAAAT 



11 
1 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 



GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCTAACCT 
TTCFACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 



ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAAGC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 



21 
1 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCOC 
TGGCTGGGAG 
AGCAGATGXG 
GACATTCACC 
TCATCGGGAC 



31 
I 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 



CAGAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCOCATC 
AACTACGAIT 
CTTCTACCCG 
GCTTGAAGCT 



AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 



CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 



GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACZG 



TATCCCAAAA 



41 
I 

GTGTGGTGTC 
TCCAQaAATA 
GGAGAAATAG 
AAGTQACTGG 
TGOCTGATGT 
TGAOCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GXGTTGCGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 
TCARTTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TQATAGCACA 



SO 
120 
1B0 
240 
300 
360 
420 



€0 
12 0 
1B0 
240 
300 
360 
420 
480 
540 
£00 
660 
720 
780 
340 
900 
960 
1020 
1OB0 
1140 
1200 



60 
120 
1B0 



51 
I 

ACACAGCTTC 
CCTGGAAAAA 
TGGCCCAGTG 
GAAACCAGAT 
GGCTCAGTTT 
GATTGAAAAT 
CTTCCAACIC 
AGACATCATG 
TGGAGGAAAX 
TGATGAAGAT 
TCATGAACTC 
CCCTAGCTAC 
AGCCATATAT 
ATGTGACAGT 
TAAAGACAGA 

AGATGAAGTC 
ACACGGATAC 
CGATGCTGCr 



TGACTTTCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10 BO 
1140 
1200 
1260 



952 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



GGAATTGQCC ACAAAGTTGA TGCAGTTTTC ATGAAAQATG GATTTTTCTA TTTCTTTCAT 
GGAACAAGAC AATACAAAlT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 

Seg ID NO: 12 Protein sequence 
Protein Accession ft: NP_CD2412.1 



1320 
1380 



1 11 21 31 41 

I I I I I 

MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ 
VEKLKQMQSF FGIiKVTGKPD AETLrKVMKQP RCGVPDVAQF VLTEGNPHHE 
YTPDLPHADV DHAIEKAFQL HSHVTPLTFT KVSEGQADIM IBFVRGDHRD 
IJVEAFQPGPG IGGDAHFDED ERVTTDlNFREY NLHRVAAHED G3ISLQL3IIST 
TF3GDVQLAQ DDIDGIQAIY GR9QNPVQPI GPQTFKACDS KIiTFDAITTI 
FYMRTNPFYP EVKEOTIfiVF WPQLPNGLBA AYEFADRDEV RFFKCWKXWA 
PKDIYSSFGF PRTVKHIDAA i>SEENTGKTY FFVANKYWRY DEYKRSMDPG 
G1GHKVDAVF MKDGFFYFFH GTHQYKFPPK TKRII»TLQKA NSWFNCRKN 

Seq ID MO: 13 DMA sequence 

ITucleic Acid Accession ft: NM_002421.2 

Coding sequence: 1 , .1409 



51 

1 

VEKRRNSGPV 
OTHIiTYRIEN 
NSPFDGPGGN 
DIGALMYP9Y 
RGEVMFFKDR 
VQQQNVLHGY 
YPKMIAHDFP 



1 
I 

ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
XACAOGCCAG 
TGGAOTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGAOGTTCCC 
AAGCTAACCT 
TT CTACATG C 
TGGCGACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 
GGAATTGGCC 
GGAACAAGAC 
AATAfiCTGGT 



11 
1 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCRACC 

ccaacaattt 
ttggactctc 

GTGATGT TCA 
AAAATOCTGT 
TTGATQCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 
ACAAAOTTGA 
AATACAAATT 
TCAACTGCAG 



21 
I 

AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TCGCTGGGAG 
AGGAGATQTQ 
GACATTCACC 
TCATCX3GGAC 
AGGCCCAGGT 
CAGAGAQTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 
CTTCTACCOG 
GCTTGAAGCT 
GTACTGGGCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 
TGCAGTTTTC 
TGATCCTAAA 
GAAAAATTAG 



31 
I 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 



AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 
GAAGTTGAGC 
GCTTAOOAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 
ATGAAAGATG 



41 
1 

GTGTGGTGTC 
TCCAGAAATA 
GGAGAAATAG 
AAGT0ACTGG 
TGCCTGATGT 
TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGOGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAQC 
TGATGTTCTT 
TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAG CAT AT 
CTAACAAATA 
TGATAGCACA 
GATTTTTCTA 
TTTTGACTCT 



51 
1 

ACACAGCTTC 
CCTGGAAAAA 
TGGCCCAGTG 
GAAACCAGAT 
GGCTCAGTTT 
GATTGAAAAT 
CTTCCAACTC 
AGACATCATG 
TGGAGGAAAT 
TQATGAAQAT 
TCATGCCCTC 
CCCTAGCTAC 
AGCCATATAT 
ATGTGACAQT 
TAAAGACAGA 
TTCTGTTTTC 
AGATGAAGTC 
ACACGGATAC 
CGATGCTGCT 
CTGGAGGTAT 
TGACTTTCCT 
TTTCTTTCAT 
CCAGAAAGCT 



Seq H» NO j 14 Protein sequence 
Protein Accession NP_JJ 02412.1 



1 
1 

MESFPPTjTjLD 
VEKIiKQMQEF 
YTPDLPRADV 
IiAHAFQPGPG 
TFSGDVQXAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 
I 

LFWGWGHSF 
FGlfKVTGKPD 
DHAIEKAFQIx 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WgHVTPI^TPT 
ERWTNKFREY 
GESQNFVO.PI 
WPQLPKGLEA 



GTRQYKFDPK 



31 
1 

VDTiVQKYIiEK 
RCGVPDVAQF 
KVSEGQADIM 
KLHRVAAHAJj 
GPQTPKACDS 
AYEFADRDBV 
FFVANXYWHY 
TKRII/TLQKA 



41 
I 

YYKLKNDGRQ 
VLTEGKPRWE 
ISPVRCSIHED 
GHSLGLSHST 
KLTFDAlTTI 
RFFKGNKYWA 



51 
I 



QTHLTYRXEN 
NSPFDGFSGN 
DIGALMYPSY 
RGEVHFFKDR 
VQGQNVLBGY 
YPKMIAHDFP 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #t FGENESH predicted 

Coding sequence t 141..15B0 



1 
I 

TCTGCGTGTG 
AGGCAAACAD 
CTCTACAGGC 
TCCGCGAGTT 
GCGGGATCCT 
CAGTTCTCAT 
GGGTGGACGT 
ACTTCATGTA 
CCGCCACCTA 
GCTATGAACC 
CGGCCCCTCC 
CTCGAAGCTG 
GGAAAAAGTA 



11 
I 

CCGGGGCTAQ 
AGGAGGGAAG 
CTGTGTCGCT 
CACTCGCCAC 
CACTGACGTC 
CGCCTGCAGT 
GCTCTCTCTG 
CACTTCGCGC 
TTTGCAGATG 
TCTGGGCATC 
ACCAGGTAGT 
CAGTCAAGGC 
CAAGTACATC 



21 

I 

GGGCTGGAAG 
GCGTCTTAGG 

TCCTCCGACG 
ACGCXGCTGG 
□OCTTCTTCT 
CCCGGGGGTC 
CTGCGCCTCT 
GAGCACGTGG 
TCCCTGCGCC 
CCCAGGCGCT 
CCCCCCAGTC 
GTGCTAAACT 



31 
I 

TCCTGGCTCT 
ACTGCCTGGA 
CCGCCGCCCC 
TGCTGGGCAA 
TTGGCGGGCA 
ATTCAATTTT 
CCGAAGCGAG 
CTCCAGGCAC 
TCCAGGCATG 
CCCTGGAAGC 
COGAAQGACA 



CTCAGGCCTC 



41 
I 

AGTTGCACCT 
TCCAGAGCAC 
GGAGGGAGCG 
CCTCAACGAG 
ACCCCTCAGA 

AGGCTTCGCC 
TGCACCAGCA 
CCACCGCTTC 
AGAACCCCCA 
CCCAGAOCCA 
TGACCCCAAG 
CCAAGCAGGG 



120 
1B0 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
7B0 
840 
900 
960 
1020 
10BO 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 
300 
360 
420 



SI 
I 

CGGAAGGAAA 
TTTCCTCGGC 
CTGGGCTACG 
CTGCGCCTGC 
GCACACAAGG 
GCGGGAGTCG 
CCTCTATTGG 
GTCCTAGCGG 
ATCCAGGCCA 
ACACCCCCAA 
CCTACTGAAT 
GCCTGCAACT 
AGCCTGGTCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



953 



WO 03/042661 



GGGAGAGAAG TTCTGGTCAA CCTTGCCCCC AAGCCAGGCT CCCCAGTGGA GACGAGGCCT 840 

CCAJGCAGCAO CAGCAGCAGC AGCAGCAGCA GTGAAGAAGG ACCCATTCCT GGTCCCCAGA 900 

GCAGGCTCTC TCCAACTGCT GCCACTGTGC AGTTCAAATG TGGGGCTCCA GCCA3TACOC 960 

CCTACCTCCT CACATCCCAG GCTCAAGACA CCTCTGGATC ACCCTCTGAA CGGGCTOGTC 1020 

CACTACCGGG AAGTGAATTT TTCAGCTGCC AGAACTGTGA GGCTGTGGCA GGGTGCTCAT ID 60 

CGGGGCTGGA CTCCTTGGTT CCTGGGGACG AAGACAAACC CTATAAGTGT CAGCTGTGCC 1140 

GGTCTTCGTT CCGCTACAAG GGCAACCTTG CCAGTCATCG TACAGTGCAC ACAGGGGAAA 1200 

AGCCTTAGCA CTGCTCAATC TGCGGAGCCC GTTTTAACCG GCCAGCAAAC CTGAAAAOGC 1260 

ACAGCCGCAT CCATTCGGOA GAGAAGCCGT ATAAGTGTGA GACGTGCGGC TCGCGCTTTG 1320 

TACAGGTGGC ACATCTGOGG GCGCROGTGC TGATCCACAC CGGGGAGAAG CCCTACCCTT 1380 

GCCCTACCTG CGGAACCCGC TTCCGCCACC TE3CAOAOCCT CAAGAGCCAC GTTCGCATCC 1440 

ACACCGGAGA GAAGCCTTAC CACTGCGACC CCTGTGGCCT GCATTTCCGG CACAAGAGTC 1500 

AACTGOGGCT GCATCTGGGC CAGAAACACG OAGCTGCTAC CAACACCAAA OTGCACTACC 1560 

ACATTCTOGG GGGGOCCTAG CTGAGCGCAG GCOCAGGOCC CACTTGCTTC CTGCGGGTGG 1620 

GAAAGCTGCA GGOCCAGGCC T TG CTTOCCT ATCAGGCTTG GGCATAGGGG TGTGCCAQGC 16 SO 

CACTTTGGTA TCAGAAATTG OCACCCTCTT AATTTCTCAC TGGGGAGAGC AGGGGTGGCA 1740 

GATCCTGGCT AQATCTGCCT CTGTTTTGCT GGTCAAAACC TCTTCCCCAC AAGCCAGATT 1800 

GTTTCTGAGG AGAGAGCTAG CTAGGGGCTG GGAAAGGGGA GAGATTGGAG TCCTGGTCTC 1860 

CCTAAGGGAA TAGCCCTCCA CCTGTGGCCC CCATXGC&TT CAGTTTATCT GTAAATATAA 1920 

TTTATTGAGG CCTTTGGGTG GCACCGGGGC CTTCATTCGA TTGCATTTCC CACTCCCCTC 1980 

TTCCACAAGT GTQATTAAAA GTGACCAGAA ACACAGAAGG TOAGATCACA GCTCTGCTGG 2040 

CAGAGATTAC TAGCCCTTGG CTCTCXCGTT TGGCTTGGGT ATTTTATATT ATTTCTGTCA 2100 

TAACTTTTAT CTTTAGAATT GTTCTTTCTC CTGTTTGTTT GCTTGTTAGT TTGTTTAAAA 21S0 

TGGAAAAAGG GGTTCTCTGT GTTCTGCCCC TGTAATTCTA GGTCTGGAAC CTTTATTTGT 2220 

TCTAGGGCAG CECTGGGAAC ATGCGGGATT GTGGAATTGG GTCAGGAACC CTCTCTGGTA 22SO 

TMCTGGATGT TGTAGQTXCT CTAGGAGTCT AGAAATGGAT ACAQACATTT CTCTGTTCTiT 2340 

CAAGGGTGAT AGGAACCATT ATGTTGAGCC CAAAATGGAA GTAATAATAA ATGCCTCCTG 2400 

QAGGCTGTGG GTGTGGGQGA TTCTGTATCT GGATTCCGTA TCACTCCAAC TGGAQGCTGT 2460 

GGGTGTGGGG GATTCTGTAT CTGGATTCCG TATCACTCCA AGTGGAGGCT GGCAGGTTTT 2520 

TCTGCAAGAT GGTCCAGAAT CTAAAATGTC CCATTAATCT GGTCACTTGG GTTTGGCTCT 2 5B0 

GCTGTATCCA TCTATAGTGG TAGAGACCCA CCAGGGCTCA AGTGGAGTCC ATCATCCTCC 2640 

CACGGGGGCC TGTTCTTAGC ACTGAGTTGA TCGCTCCATG GGGGAGAGAT CAGACATTCC 2700 

TTATCAGAGA TGATGTGACC TTTTCTGACT CTGCCCAGTC TCTATGAATG TTATGGCCTA 2760 

GGGAAGAATC ATGAAACTCT TTAGGTTGAT TAGATGGTAA ACAGTGTTAA CCGATCCTTT 2820 

ACTACAGAGG CATATQGGTT TGAATGTTAC CTGGGGTTCT CTCTATTGAG TTGAGCCCCT 2880 

SCTTCCTTTA GTGGGTTTTG GACATCTTCT GGCAAGTGTC CAGATGCCAG AACCTTCTTT 2&40 

TCCTCTAGAA GGGATGGTGC TTGGTAACCT TACCTTTTAA AAGCTGGGTC TGTGACCTGG 3D00 

TCTTCCCATC CCTGCATTCC TGTCTGGAAC CAGTGAATGC ATTAGAACCT TCCATAQGAA 3 060 

AAGAAAAGGG GCTGAGTTCC ATTCTGGGTT TGCTGTAGTT TGGTTGGGAT TATTGTTGGC 3120 

ATTACAGATG TAAAAGATTG ACTAGCCCAT AGGCCAAAGG CCTGTTCTAG TTGACCAAGT 3180 

TTCAAGTAGG ATTAAGAGGT TGGTTGAGGG GTGCAGTTTC TGGTGTAGGC CAGGTAGGTA 3240 

GAAAGTGAGG AAjCAGGGTTG CCTCTTGGCT GGGTGGAGTC TCTGAAATGT TAGAAGAAGC 3300 

GCTGAAGCCT TGATTGATAG TTCTGCCCCT TGTTGCCCTG OGGCTTATCT GATTATGGGA 3360 

CGAGGGTAGA AAGTAAGAAG CACTTTTGAA TTTGTGGGGT AGAACTTCAA CAATAAOTCA 342 0 

GTTCTAGTGG CTGTCGCCTG GGGACTAGTG AGAAAGCTAC TCTTCTCCCT CTTCCCTCTT 3480 

TCTCCCCATG GCCCCACTGC AQAATTAAAG AAGGAAGAAQ GGAAGGCGGA GGAGTCTATA 3540 

AGAAGGAATC ATGATTTCTA TTTAGCAGAT TGGATGGGCA GGTGGAGAAT GCCTGGQGGr 3600 

AGAAATGTTA GATCTTGCAA CATCAGATCC TTGGAATAAA GAAGCCTCTC TGYGCWRAAA 366G 
AAAAAAAAAA AAAAAA 



Seq H> NO i 16 Protein sequence 
Protein Accession #: FGENESH predicted 



1 11 21 31 41 51 

1 I I 1 I i 

MGSPAAPEGA LGYVREFTRH 8SBVLGNLNB LRLRGILTDV TLLVGGQPUt AHKAVLIACS 60 

GPPYSIPRGR AaVQVUVI»SI» PGGPEAROPA PLLDFMYTSR LRIiSPATAPA VLAAATYLQM 120 

EHWOACHRF IQASYEPLGI SLRPLEAEPP TPPTAPPPGS PBRSBGHPDP PTE8RSCSOG 180 

PPSPASFDPX ACNWKKYKYI VLN9QA3QAG BIiVGERSSGQ PCPQAHLPSG DEASSSSS93 240 

S3SSEEGPIP GPQ9H.LSPTA ATVQFKCGAP ASTPYLLTSQ AQDTSGSPSS RARFLPGSEF 300 

FSCQNCEAVA GCSSGLDSLV PGDKDKPYKC QLCRfiSFRYK GKLASHRTVH TGEKFYHC S I 360 

DGARFNRFAW JJCTH5RTHSG KKPTKCETCG SRFVQVARLH ABVI*IHTGEK PYPCPTCGTR 420 
FRHLQTLKSE VRIHTGSKPY HCDPCGLHFR HKSOtRIJlIA OKHGAAXNTK VHYHILGGP 



Seq ID KO: 17 DNA gggugngg 

Nucleic Acid Acce^fi ion #j XM_039209 

Coding sequence; 1 , .2049 



1 11 21 31 41 51 

I 1 1 I I I 

ATGCTGAAGA TGCTCTCCTT TAAGCMCTG CTGCtGQCOG TtSGCTCTGGG CTTCETTGAA 60 

GGAGATOCTA AGTTTGGGGA AAGAAACGAA GGGAGCGGAG CAAGGAGGAG AAGGTGCCTG 120 

AATGGGAACC CCCOGAAGCG CCTGAAAAGG AQAGACAGGA GGATGATGTC CGAGCTGGAQ 180 

CTGCTQAGTQ GGGGAGAGAT GCTGTGCGGT GGCTTCTACC CTOGGCTGTC CTGCTGCCTG 240 

GGGAGTGACA GCOCGGGGCT AGGGQGCCTG GAGAATAAGA TATTTTCTGT TACCAACAAC 300 

ACAGAATGTG GGAAGTTACT GGAGGAAATC AAATGTGCAC TTTGCTCTCC ACATTCTCAA 360 

AGCCTGTTCC ACTCACCTGA GAGAGAAGTC TTGGAAAGAG ACCTAX3TACT TCCTCTGCTC 420 

TGOJUVaACT ATTGCAAAGA ATTCTTTTAC ACTTGC03AG GCCATATTCC AGGTTTOCTT 480 

CAAACAACTG CGGATGAGTT TTGCT T TTAC TATGCAAGAA AAGAXGGTGG GTTGTGCTTT 540 

CCAGATTTTC CAAOAAAACA AGTCAGAGGA CCAGCATCTA ACTACTTGGA OCAGATGGAA 600 

GAATATGACA AAGTGGAAGA GATCAGCAQA AAGCACAAAC ACAACTGCTT CTGTATTCAG 660 

GA0GTTGJTGA GTGGGClGCG GCAGCCCGTT GGTQCCCTGC ATAGTGGGGA TGGCTOGCAA 720 

OGTCTCTTCA TTCTGQAAAA AGAAGGTTAT GTGAAGATAC TTACCCCTGA AGGAGAAATT 780 

TTCAAOOAGC CTTATTTGGA CATTCACAAA CTTGTTCAAA GTGGAATAAA GGGAGOAGAT 040 



954 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 
20 



GAAAGAGGAC 
GTGTCCTATA 
GTTGTGGAAT 



CCTGACGGCT 
GAAATGGATG 
ATGTGTAACG 
CCCCCCGAAG 
CCGACTGATA 
TCATCAGCCA 
TTAGAATTCA 
CAGTCAGAAA 
CTCCAGCAAA 
GGGTCCTQTA 
GAAGTTTACA 
AAAATTGTAa 
GGACAGACAC 
GGAAAGTQCT 
CCAGCATGTC 
TATCTTG3TC 
ATCACCTAG 



TGCTAAGCCX 
CCACCAACCA 
ACACAGTATC 
AAGTTGCAGA 
TTTTGTACAT 
GGTTAAGTGA 
TGCCTTATTC 
TGTTTGCTCA 
TAAACATCAA 
GAATTCTACA 
AGCCATTCAG 
GATTGTATGG 
GTCCTGTGAC 
GAGGCTACTT 
TTTTATCAAG 
ATCCCAAAAG 
TGACTTCAGA 
GCTGCAGTCC 
GTCATGGAGG 
CTCAATGTQA 



CGCATTCCAT 
AGAACGGTGG 
CAGAAAAAAT 
ACTCCACAGA 
CATTCTTGGT 
TTTCACAGGC 
CATACCAAGG 
TGGGCTCCAC 
TTTAACGATA 
GATAATAAAG 
TAATGGTCCT 
AAGCTACGTG 
AAAGCAGTGG 
TTCCGGTCAC 
CAGTAAAAGT 
AJCCTTTAATG 
GTGCTCCAGG 
AGGCTGGGAG 

ACAAGTGGAC 



CCCAATTACA 
GCTATOGGGC 
CCACACCAAG 
AAGCATCTGG 
GAXGGGATGA 
TCAGTGCTAC 
AGCAACCCAC 
GATCCAGGCA 
CTGTGTTCAG 
GGGAAAGATT 
TTGGTTGGTG 
TTTGGAGATC 
CAAGAAAAAC 
ATCTTGGGAT 
ATGACCCAGA 
CCTGAGGAAT 
CTCTGTCGAA 
GGGGACTTCT 
AGACCGAACA 
AGAAACATCC 



AGAAAAATGG 
CTCATGACCA 
TTGATTXGAG 
GAGGACAACT 
TTACACTGGA 
GGCTGGATGT 
ACTTCAACAG 
GATGTGCTGT 
ACTCCAATGG 
ATGAAAGTGA 
GATTTGTATA 
GTAATGGGAA 
CACTCTGTCT 
TTGGAOAAGA 
CTCACAATGG 
GCAGAGC-CAC 
ACGGCTACTG 
GCAGAACTGC 
AGTGCCTCTG 
GCAGAGTGAC 



AAAGTTGTAT 
CATTCTTAGG 
AACAGCCAGA 
GCTCTTTGGC 
TGATATGGAA 
GQACACAGAC 
CACCAACCAG 
GGATAGAGAT 
AAAAAACAGA 
GCCATCACTT 
CCGGGGCTGC 
TETCCTAACT 
GGGCACTAGT 
TGAACTAGGT 
AAAACTCTAC 
GGTACAACCT 
CACCCCCACG 
AAAATGTGAG 
TAAAAAAGGA 
CAGGGCAGAC 



90Q 
960 
1020 
1000 
1140 
12 00 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
18D0 
I860 
1920 
1980 
2040 



25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Seq ID WO: 18 Protein sequence 
Protein Accession # : XP_039209 



1 
I 

MMCMLSFKIiIi 
XjI»8GGFMLCG 
SLFHSPEREV 
PDFPRJCQVRG 
ELFILEKEGY 
VSYTTNQKRW 
EDGFLYIIIiQ 
PPEVFAHGLH 
LEFKPFSNGP 
GSCRGYFSGH 
AQTUIfiECSR 
YW3PQCEQVD 



11 
I 

LlLAVAIjGFFE 
GFYPR&SCCI* 
LERDliV L PLl* 
PASNYLDQME 
VKILTPEGEI 
AIGPHDHILR 
DGMITLDDME 
DPGRCAVDRH 
LVGGFVYRGC 
ILGFGEOGDG 
LCHNGYCTPT 
RNIBRVTRAD 



21 
I 

GDAXFQERNE 
RSDSPGLGRTi 
CKDYCKKFFY 
EYDKVEEISR 
FKEPYLDIHK 
WEYTVGHKN 
BMDGLEDFTG 
PTDHJ1NLTI 
QSEHIiYGSYV 
BVYIfcSSSKS 
GKCCCSPGWE 
IT 



31 
I 

GSGARRRRCL 
ENKIPSVTNN 
TCRGHTPGFL 
KHKHKCFCIQ 
LVQSGIKGGD 
FHQVDEjRTAR 
SVLRIjDVDTD 
IjCSDBNGKNR 
FGDRNGHFLT 
MTQTHNGKTjY 
GDFCH.TAKCE 



41 
I 

NGNPPKRIiKR 
TECGKLUBBI 
QTTADEFCFY 
EWSGLRQPV 
ERGLLSliAFH 
VFXEVAKJjHR 
MCNVPYfilPR 
SSARILQIIK 
LQQSPVTKQW 
KIVDPKRPLM 
PACRHGGVCV 



51 
I 

RDERMMSQLE 
KCALCSPHSQ 
YARKDGGLCF 
GALHSGDGSQ 
PNYKKNGKLY 
KHLGGQLLPG 
SNFHFNSTNQ 
GKDYESEPSL 
QEKPIiCLGTS 
PEECRATVQP 
EPNKCIiCKKG 



Sect 3D NO: 19 DMA sequence 

Nucleic Acid Accession #t NM_oi433i-2 

Codtiig sequence: 1..1506 



1 
I 

ATGGTCAGAA 
AACGGGAGGC 
AAGAGGAAAG 
GGAATCTTCA 
ACCATCTGQA 
GGAACAACTA 



GTGATATCCC 
CCTGAACTTG 
AGCATGAGTG 

gcaattctqa 

TTTAAAGACG 
TATGGAATGT 
AACCCTGAAA 
TATGTGCTGA 
AATGCAGTGG 
ATCTTTGTTG 
TTATTCTAXG 
CGCAAGCACA 
TTCTCTGGAG 
GGGCTGGCAG 
TTCAAGGTGC 
CTTT0CCTCT 
GTCCCTGCGT 
TCAGAGAAAA 
TTATQAACTA 
TTTTTACTTC 
CAGTTATTTT 
AACTCTATGT 
GTCTCTGATA 
TCTCtACAAC 
7ATATATGGG 
TTTTCAATTC 
ATTTTACATT 
GCTTTAATGG 
TTAAAGAAGA 
AAAAATCCXT 



11 
I 

AGOCTGTTGT 
TGCCTTCCCT 
TCACTTTACT 
jCCTCTCCTAA 
CGGTGTGTGG 
TAAAGAAATC 
ttgtacgagt 
TGGCATTTGG 
CX3ATCAAGCT 
TCAGCTGGAG 
TAATTATAGT 
CGTXTTCAGG 
ATGCAIATGC 
AAAOCATTCC 
CAAATGTGGC 
CAGTGACCTT 

TTGDQTCTCG 
CTOCTCTACC 
ACCTCGACAQ 
TTGCTGGGCT 
CACTGTTCAT 
ATTCGGACCC 
ATTATCTCTT 
TAACCAGAAC 
ATGGACTTGA 
ATTTTCTGAA 
TATTCATATA 
AGTTATAGAA 
CCTACCTATT 
ATATGTTAGC 
TTTTGTAAAG 
TGAAAAAAAG 
GACATTGCAT 
ATTATACCCA 
GTTTCTAGGG 
QAGAATTTAT 



21 
I 

GTCCACCATC 
GQGCAACAAG 
GAGGGGAGTC 
GGaOGTGCTC 
GGTCCTGTCA 
TGGAGGTCAT 
CTGGGTGGAA 
ACGCTACATT 
CATTACAGCT 
CGGCCGGATC 
CCCTGGAGTT 

aagagattca 
tggctggttt 

CCTTGCAATA 
CTACT11ACG 



31 
I 

TCCAAAGGAG 
GAGCCACCTG 
TCCATTATCA 
CAGAACACGG 
CTATTTGGAG 
♦CACACATATA 
CTOCTCATAA 
CTGGAACCAT 
GTGGGCATAA 
CAGATTTTCT 
ATQC3U3CIAA 
AGTATTAOGC 
TACCTCAACT 
TGTATATCCA 
ACGATTAATG 



CTTTGGCTCC 
AGAOGGTCAC 
AGCTGTTATT 
TCTTTTGAAT 
GATT TATCTT 
CCCAGCTTTG 
AT TTAGTACA 
TATTATATGG 
ATTACAAATA 
GATCTTGGCA 
AGTCTAGAGA 
TTTTAGCATA 
AGTGAATATG 
GGGGTTAGGA 



ATGGTTTTAC 
CATACATCAT 
TGCTTCCCCT 
GAGCACTTTG 
GCTACTGTTT 
TATGTCAGAT 



ATGAACGGTG 
CTTCCAGAAA 
GTTTTGCACC 
TTC3CTC3VGTT 
CGATACAAAT 
TTTTCCTTCA 
GGGATTGGCT 
GACAAGAAAC 
ATACTGGAAG 
ATCTGCCCAA 
ATTACAACTT 
TTCGAACTAA 
CAGTTATTCT 
GAAAAGACTA 
ACCTTCAAAT 
ACACXACAGA 
GATTATGGCA 
TAGATACCAA 



41 

i 

GTTACCTGCA 
GGCAGGAGAA 
TTGGCACCAT 
GCAGOGTGGG 

TTTTGGAAGT 
XACGC CCTGC 
TTTTTATTOW 
CTGTAGTGAT 
TAACCTTTTG 
TTAAAGGTCA 
GGTTGCCACT 
TTGTTACTQA. 
TGGOCATTGT 
CTGAGGAGCT 
ATTTCTCATT 
GTGTGTTTGC 
TGCZCTQCAT 
CTTTGACAAT 
TTGCCAGGTG 
GCCCAGATAT 
CATGCCTCTT 
TCGTCATCAC 
CCAGGTGGTT 
TTGTACCAGA 



ATGAGACACA 
GTTtmTCAT 



TGGTGATAAA 
TTTCTAAGAA 
ATGAGTCGCA 
GACAATTACT 
TGAAGACTGA 
TGTCTATACT 
AAGAGGAGAG 
TTTAGATAAC 
ACTGGGGATT 
TCCAGGAGTT 
TCATTATCAG 



SI 
1 

GGGAAATGTT 
AGTGCAGCTG 
CATTGGAGCA 
CATGTCTCTG 
TGCTGAATTG 
CTTTGGTCCA 
AGCTACTGCT 
ATGTGAAATC 
GGTCCTAAAT 
CAAGCTCACA 
AACGCAGAAC 
GGCXTTTTAT 
ABAAGTAGAA 
CACCATTGGC 
GCTGCTTTCA 
AGCAGTTCCG 
TGTCTOCAGG 
GATTCATGTC 
GATAATGCTC 
GCTTTTTATT 
GCATOOTCCT 
CATGGTTGCC 
TCTGACTGGA 
TAGAATAATG 
AGAAGATAAG 
AAAATAGGGA 
CAAAAGOAGT 
ATTTAGTTAT 
CAATTCTTGA 
ATGTGGTCAT 
GATTTTTCTG 
GTGAAAAGTG 
AAAGAAATTT 
AAACACTCAT 
GTTGAATACA 
ATGTT T AAGT 
GAAGTTTTAG 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
430 
S40 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 , 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



955 



WO 03/042661 



PCT7US02/36810 



5 

10 
15 
20 
25 
30 



TTATCTGTCA 
AGCAAGAGTT 
TACCCCTGAT 
TGAGAGAAAT 
CTACATGCAA 
TGAATTTTOA 
CTTCAGATGA 
AAOAAATGTC 
ATCTAGGCTT 
CTGATAAGAA 
GTTTTGCCAG 
GCACTTTGGG 
CAACATGGAG 
GCTGGTAATC 
GAGGTTGCAG 
CCATCTCCAA 



TTTTTTTTTT 
AGTTTGGTAT 
GAGTCTATCT 
AACCAACAAA 
TGTTAGTAAT 
CAGTTTGTGC 
AACTGTCCAG 
GCTGTAAATA 
TGTCAGTAAT 
GAAAATTGAA 
TATTAGAAAA 
AGGCTGAGQG 
AAACCCCATC 
TCAGCTATTG 
TGAGCCAAGA 



TCACATCAGT 
TAAATCCTCA 
AAACATATGC 
GAAGATGTTC 
TCTGAAGTTT 
ATTTTCTTTA 
ATTAATTAGG 
AGATTTACAA 
TTCCACACCT 
ATGAGAATCT 
TACTGTGAGC 
GGTGGATCAC 
TCTACTAAAA 
AGGAGGCTGA 
TTOCACCACT 



TTGATCAGGA 
TTAGAACAAC 
ATTTTAAGCC 
AAAATAATAG 
TTTAAATTTA 
TACATTTTAT 
AAAAGGCATA 
CTGATGTTTC 
TAATTATCAT 
GTGGATAAGT 
CGGGCATGGT 
CIGAGGTCGG 
ATACAAAATT 
GGCAGGAGAA 
GTACTCCAGC 



AAGTGTATAA 
CACCTGTTTC 
TTCAAATTAC 
TOC CATATCT 
TGGCTATTTT 
ATTCTTCTGT 
TATTAACATA 
TAGAAAATTT 
TCAACTTGCA 
GTTTGTGTTC 

GGCTTACATC 
GAGTTCTAGA 
AGCTGGGCAT 
TTGCTTGAAC 
CTGGGTGACA 



CACATCTTAG 
ACTAATAACT 
ATTATCAAGA 
QTAATCATAT 
TACACGATGA 
TAAAATATCT 
AAAATTGCAA 
CCACTTCTAT 
AAAGAGACAA 
AGAAGATGTT 
TGTAATCCCA 
CCAGGCTGAC 
GGTGGCACAT 
CCGOGAGGCG 
AAGTCAGACT 



Seq id NO: 20 Protein sequence 
Protein Accession = np_Q55146.1 



I 

mvrkfwsti 

GIFISPKOVIj 
LPAFVHVWVE 



YGMYAYAGWF 
NAVAVTFSER 
RKHTPIiPAVI 
FKVPXFIPAL 
SEKITRTTjGI 



11 
I 

SKGGYLQGKV 
QNTGSVGMSL 
XJjI irpaata 

QIFLtTPCKLT 
YLNFVTEEVE 
LLGNPSLAVP 
VIiRFLTMlML 
F8FTCLFMVA 
JIiEWPSSDK 



21 
I 

KGHliPSIiGNK 
TIWTVCGVLS 
VISLAFGRYI 
A1LIIIVPGV 
NPEXTIPIiAI 
IFVALSCFGS 
F8GDLDSLUI 
LSLY6DPFST 
L 



31 
I 

EPPGQEKVQL 
LFGALSYAEL 
WBFFFIQCEI 
MQLIKGQTQNf 
CISMAITIGV 
MNGGVPAV3R 
FLSFARWLFI 
GIGFVITLTG 



41 

I 

KRKVTX.1*RGV 
GTTIKKSQOH 
PELAIKLITA 
FKDAFSGRDS 
YVLTNVAYFT 
liFYVASREGK 
GLAVAGL1YL 
VPAYKLFIIW 



2280 
2340 
2400 
2460 
2520 
2560 
2 £40 
2700 
2760 
2820 
2080 
2940 
3000 
3060 
3120 



51 
I 

SIIIGTIIGA 
YTYILHVFGP 
VGITWMVIiN 
SITKUPLAFY 
TINAEELW^S 
IiPEILBMIHV 
RYKCFDMHRF 
DKKPRWFRIM 



€0 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO; 21 DMA gggufence 
35 Nucleic Acid Accession #: NM_002422.2 
Coding sequence: 64,. 1497 

1 11 21 31 41 51 

40 AGAAGGAGGC AGGCAAGACA GCAAGGCATA GAGACAACAT AGAGCTAAGT AAAGCCAGTG 60 

GAAATGAAGA GTCTTCCAAT OCTACTGTTG CTGTGCGTGG CAGTTTGCTC AGCCTATCCA 120 

TTGGATGGAG CTGCAAGGGG TGAGGACACC AGCATGAACC TTGTTCAGAA ATATCTAGAA 160 

AACTACTACG ACCTCAAAAA AGATGTGAAA CAGTTTGTTA GGAGAAAGGA CAGTGGTCCT 240 

GTTGTTAAAA AAATCCGAGA AATGCAGAAG TTCCTTGGAT TGGAGGTGAC GGGGAAGCTG 300 

45 GACTCCGACA CTCTGOAGGT GATGCGCAAG CCCAOQTGTG GAGTTCCTGA TGTTGGTCAC 360 

TTCAfiAACCT TTCCTGGCAT CCCGAAGTGG AGGAAAACCC ACCTTACATA CAGGATTGTG 420 

AATTATACAC CAGATTTGCC AAAAGATGCT GTTGATTCTG CTGTTGAGAA AGCTCTGAAA 480 

GTCTGGGAAG AGGTGACTCC ACTCACATTC TCCAGGCTGT ATGAAGGAGA QGCTGATATA 540 

ATGATCTCTT TTGCAGTTAG AGAACATGGA GACTTTTACC CITTTGATGG ACCTGOAAAT 600 

50 GTTTTGGCCG ATGCCTATGC CCCTGGGCCA GGGATTAATG GAGATGCCCA CTTTGATGAT 660 

GATGAACAAT GGACAAAGGA TACAACAGGG ACCAATTTAT TTCTCGTTGC TGCTCRTGAA 720 

ATTGGCCACT CCCTGGGTCT CTTTCACTCA GCCAACACTG AAGCTTTGAT GTACCCACTC 780 

TATCACTCAC TCACAGACCT GACTCGGTTC CGCCTGTCTC AAGATGATAT AAATGGCATT 840 

CAGTCCCTCT ATGGACCTCC CCCTGACTCC CdOAOACCC CCCTGGTAOC CACGGAACCT $00 

55 GTCCCTCCAO AACCTGGGAC GCCAGCCAAC TGTGATCCTG CTTTGTOCTT TGATGCTGTC 36Q 

AGCACTCTGA GGGGAGAAAT CCTGATCTTT AAAGACAGGC ACTTTTGGCG CAAATCCCTC 1020 

AGGAAGCTTG AACCTGAATT GCATTTGATC TCTTCATTTT GGCCATCTCT TCCTTCAGGC 1080 

GTGGATGCCG CATATGAAGT TACTAGCAAG GACCTCGTTT TCATTTTTAA AGGAAATCAA 1140 

TTCTGGGCCA tcagaggaaa tgaggtacga gctggaxacc caaoaggcat CCACACCCTA 1200 

60 GGTTTCCCTC CAACCGTGAG GAAAATCGAT GCAGCCATIT CTGATAAGGA AAAGAACAAA 1260 

ACATATTTCT TTGTAGAGQA CAAATACTGG AGATTTGATG AGAAGAGAAA TTCCATGGAG 1320 

CCAGGCTTTC CCAAGCAAAT AGCTGAAGAC TTTCCAGGGA TTGACTCAAA GATTGATGCT 1380 

GTTTTTGAAG AATTTGGGTT CTITTATTTC TTTACTGGAT CTTCACAGTT GGAGTTTGAC 1440 

CCARATGCAA AGAAAGTGAC ACACACTTTG AAGAGTAACA GCTGGCTTAA TTGTTGAAAG 1500 

65 AGATATGTAG AAGGCACAAT ATGGGCACTT TAAATGAAGC TAATAATTCT TCACCTAAGT 1560 

CTCTGTGAAT TGAAATGTTC GTTTTCTCCT GCCTGTGCTG TGACTCGAGT CACACTCAAG 1620 

GGAaCTTGAG OGTGAATCTG TATCTTGOOG GTCATTTTTA TGTTATTACA GGGCATTCAA 1680 

ATGGGCTGCT GCTTAGCTTG CACCTTGTCA CATAGAGTGA TCTTTCCCAA GAOAAGGGGA 1740 

AGCACTCGTG TGCAACAGAC AAGTGACTGT ATCTGTGTAG ACTATTTGCT TAT TTAATAA 1800 

70 AGACGATXTG TCAGXTGTTT T 

Seq ID MOt 22 Protein sequence 
Protein Accession ffr NP_0O24l3 

75 1 11 21 31 41 51 

1 1 I I I 1 

MKSIjPIUjUj CVAVC 9 AYP h DGAARGKDTG HNXjVQKYT ,KN YYDE>EKDVKQ FVRRKDSGPV 60 

VKKIREMQKF LGLEVTGKUD SDTLEVMRKP RCGVPDVGKP RTFPGXPKHR KTHLTYRIVN 120 

YTPDI»PKDAV DSAVEKAtSV WREVTPLiTFS RI*YEGEADIM ISPAVREHGD FYPFDGPGKV 180 

80 IiAHAYAPGPG UIGDAEFDDD EOWTKDTTQT NI*FJLVAftHEI GH6LGLFKSA KTEALMYPLY 240 

ESIjTDIiTRPR LSQDDINGIQ SLYGPPPD8P ETPLVPTEPV PPEPGTPANC D P ALSFDAVS 300 

TUtGErLIFK DRKFWRKSIiR KLEPBLHIilS SFWPSLPSGV DAAYEVT3KD L.VFIFKGCTOF 360 

WAIRGNEVRA GYPRGZH.TLG PPPXVRKIDA AI8DKEKNKT YPFVEDKYWR FDEKRNSMBP 420 
QFPKQIAEDF PQIDSKIDAV PEEFGFFYFF TG8SQLBFDP NAKKVTHTIiK SUSWLNC 

956 



WO 03/042661 



PCTYUS02/36810 



Seq ID NO: 23 UNA sequence 
Nucleic Acid Accession #r NM_D06528 
Coding sequences 57., 764 

1 11 21 31 41 51 

\ I I i I ! 

GOCGCCAQCC3 GCTTTCTCGG ACGCCTTGCC CAGCGGGCCG CCCGACCOCC TGCACCATGG 60 

ACCCCGCTCG CCCCCTQQGG CTGTCGATTC TGCTGCTTTT CCTGAOGGAG GCTGCACTGG 120 

GCGA1GCT8C TCAGGAGCCA ACAGGAAATA ACGCGGAGAT CTGTCXCCTG CCOCTAGACT 1B0 

ACGGACCCTG CCGGGDCCTA CTTCTCCGTT ACTACTACGA CAGGTACACG CAGAGCTGCC 240 

GCCAJSTTCCT gtacgggggc tgcgagggca ACGCCAACAA TTTCTACACC TGGGAGGCTT 300 

GCGACGATGC TTGCTGGAGG ATAGAAAAAG TTCCCAAAGT TTGCCGGCTG CAAGTGAGTG 360 

TGGACGACCA GTGIGAGGGG TCCACAGAAA AGTATTTCTT TAATCTAAGT TCCATGACAT 420 

GTGAAAAATT CTTTTCCGGT GGGTGTCACC GGAACCGGAT TGAGAACAGG TTTCCAfiATG 4 BO 

AAGCTACTTG TATGGGCTTC TGCGCACCAA AGAAAATTCC ATCATTTTGC TACAGTCCAA S40 

AAGATGAGGG ACT3TGCTCT GCCAATGTGA CTCGCTATTA TTTTAATCCA AGATACAGAA 600 

CCTGTGATGC TTTCACCTAT ACTGGCTGTG GAGGGAATGA CAATAACTTT GTTAGCAGGG 660 

AGGATTGCAA ACGTQCATGT GCAAAAGCTT TGAAAAAGAA AAAGAAGATG CCAAAGCTTC 720 

JAJ GCTTTGCCAG TAGAATCCGG AAAATTCGGA AGAAGCAATT TTAAACATTC TTAATATGTC 78D 

ATCTTGTTTG TCTTTATGGC TTATTTGCCT TTATGGTTGT ATCTGAAGAA TAATATGACA 840 

GCATGAGGAA ACAAATCAlT GGTGATTTAT TCACCAGTTT TTATTAATAC AAGTCACTTT 900 

TTCA AAAATT TGGATTTTTT TATATATAAC TAGCTGCTAT TCAAATGTGA GTCTACCATT 960 

TTTAATTTAT GOTTCAACTG TTTGTGAGAC GAATTCTTGC AATGCATAAG ATATAAAAGC 1020 

AAATATGACT CACTCATTTC TTGGGGTCGT ATTCCTGATT TCAGAAGAGG AT CATAACTG 10 8 O 

AAACAACATA AOACAATATA ATCATGTGCT TTTAACATAT TTGAGAATAA AAAGQACTAG 1140 
CC 



10 



15 



25 



Seq ID HO i 24 Protein sequence 
3\J Protein Accession fix NP_006519 

1 11 21 31 41 SI 

1 1 1 I ] I 

MDPAHPLGLS ILLJjFLTEAA LGDAAQEPTG NNAEICLIiPI. DYGPCRALLL RYYYDRYTQS 60 

3 J CRQFliYGGCE GNANNFVTWE ACDDACWRIB KVPKVCRIiQV SVDDQCEGST EKYFFNL6SM 120 

TCEKFFBGGC ERHRIENRFP DEATCMGFCA PKKIPSFCY9 PKDEGLCSAN VTRYYFKPRY 180 
RTCDAFTYTG CGGNDNNFVS REDCKHACAK ALKKKKKMPK LRFABRlftKI RXKQF 

Beq ID NOt 25 DNA sequence 
4U Sucleic Acid Accession # = NM_0054Sa.l 
Coding sequence; 1..2S2S 

1 11 21 31 41 51 

I I I I 1 I 

*tD ATQGCTTOCC CGCGGAGGrC CGGGCAGCCA GGGCGGCCGC CGCOGCCGCC AC0GCCGCCC 60 

GCGCGCCTGC TACTGCTACT GCTGCTGCCG CTGCTGCTGC CTCTGGCGCC CGQGGCCTGG 120 

GGCTGGGCGC GGGGOGCCCC CCGGCOGCCG OCCAGCAGCC CGCX3QCTCTC CATCATGGGG 180 

CTCATGCCGC TCACCAAGGA GGTGGCCAAG GGCAGCATCG GGCGGQGTGT GCTOOODGCC 240 

n GTGGAACTGG CCATCGAGCA GATCCQCAAC GAGTCACTCC WSCfiCCCCTA CTTCCTCGAC 300 

DU CTGCX3GCTCT ATGACACGGA GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTAOGATGCA 360 

ATAAAATACG GGCCGAACCA CTTGATGGTG TTTGGAGGCG TCTGTCCATC CGTCACATCC 420 

ATCATTGCAG AGTCCCTCCA AGGCTGGAAT CTGGTGCAGC TTTCTTTTGC TGCAACCACS 480 

CCTGTTCIAG CCGATAAGAA AAAATACOCT TATTTCTTTC GGACCGTCCC ATCAGACAAT 540 

GCGGTGAATC CAGOCATTCT GAAGTTGCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG GOO 

CTGACGCAAG ACGTTCAGAG GTTCTCTGAG GTGCGGAATG ACCTGACTGG AQTTCTGTAT 660 

GGCGAGGACA TTGAGATTTC AGACADOGAG AGCTTCTCCA ACX3ATCCCTG TACCAGTGTC 720 

AAAAAGCTGA AQGGGAATGA TGTGCGGATC ATCCTTGGCC AGTTTGACCA OAATATGGCA 7 B0 

GCAAAAGTGT TCTGTTGTGC ATACGAGGAG AACATGTATG GTAGTAAATA TCAGTGGATC 840 

^ ATTCCGGGCT GGTACGA0CC TTCTTGGTGG GAGCAGGTGC ACACGGAAGC CAACTCATCC 900 

OU CGCIGCCTCC GGAAGAATCT GCTTGCTGCC ATGGAGGGCT AOAITGGC0T GGATTTCGAG 960 

CCOCTGAGCT OCAAGCAGAT CAAGACCATC TCAGGAAAGA CTCCACAGCA GEATGAQAGA 1020 

GAGTACAACA ACAAGCGGTC AGGCGTGGGG CCCAGCAAGT TCCAGGGGTA CGCC7A0GAT 1080 

GGCATCTGGG TCATCGCCAA GACACTGCAG AGGGCCATGG AGACACTGCA TGCCAGCAGC 1140 

^_ OGGCAGCAGC GGATCCAGGA CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC 1200 

OD AATGCCAIGA ACGAGACCAA CTTCTTCGGO GTCACGGGXC AAGTTGTATT CCGGAATGGG 1260 

GAGAGAATGG GGACCATTAA ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAGGTGGGA 1320 

GAGTACAACG CTGTGGOCGA CACACTQGAG ATCAXCAATG ACACCATCAG GTTCCAAGGA 13 B0 

TCCGAAGCAC CAAAAGACAA GACCATCATC CTGGAGCAGC TGCGGAAGAT CTCCCTACCT 1440 

CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCGGGATGA TCATGGCCAG TGCTTTTCTC 1500 

/U TTCTTCAACA TC AAGA ACCG GAATCAGAAG CTCATAAAGA TGTCGAGTCC AXACATGAAC 1S60 

AAC CTTATCA TCCTTGGAGG GATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTGAT 1620 

GGATCCTTTG TCTCTGAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC 1680 

ACCGTGGGCT ACACGACCGC TTTTGGGGCC ATGTTTGC3\A AGAOCTGGAG AGTCCACGCC 1740 

„^ ATCTTCAAAA ATGTGAAAAT GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 1800 

/-> GTGGGGGGCA TGCTQCTGAT CGACCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC I860 

CTGCGAAGGA CAGTGGAGAA GTACAQCATO OAGCCGGAGC CAGCAGGACG GGATATCTCC 1920 

ATCCGCCCTC TCCTGGAGCA CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC I960 

TATGCCTACA AGGGACTTCT CATGTTGTTC GGTTGTTTCT TAGCTTGGGA OACCCX3CAAC 2040 

Qn GTCAGCATCC CCGCACTCAA OGACAGCAAG TACATCGGGA TGAQTGTCTA CAAOGTGGGG 2100 

OU ATCATGTGCA TCATCGGGGC CGCTGTCICC TTCCTGACCC GGGACCAGCC CAATGTGCAG 2160 

TTCIGCATCG TGGCTCTGGT CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC 2220 

GTGCCGAAGC TCATCACCCT GAGAACAAAC CCAGATGCAG CAACGCAGAA CAGGCGATTC 2280 

CAGTTCACTC AGAATCAGAA GAAAGAAOAT TCTAAAAOGT CCACCTOGfQT CACCAGTGTG 2340 

AACCAAGGCA GCACATCCCG OCTGGAGGGC CTACAGTCAG AAAACCATCG CCTGCGAA3G 24DD 
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40 

45 
50 
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AAGATCACAG 
GAAAAGACCA 
GGAAACTTCA 
AATCCCCAGC 
GATATAAACl" 
CACGCCTACC 
CCCACCGCCA 
CTGTAA 



AGCTGGATAA 
CCTACA.TTAA 
CTGAGAGCAC 
TACAGTGGAA 
CTCCAGAACA 
TCCCATCCAT 
GCCCCCGCCA 



AGACTTGGAA 
ACAGAACCAC 
AGATGGAGGA 
CACAACAGAG 
CATCCAGCGT 
OGGAGGCGTG 
CAGACATGTG 



GAGGTCACCA 
TACCAAGAGC 
AAGGCCATTT 
CCCTCTCGAA 
CGGCTGTCCC 
GACGCCAGCT 
CCACCCTCCT 



TGCAGCXGCA 
TCAATGACAT 
TAAAAAATCA 
CATGCAAAGA 
TCCAGCTCOC 
GTGTCAGCCC 
TCCX3AGTCAT 



GGACACACCA 2460 

CCTCAACCTG 2S20 

CCTCGATCAA 2580 

TCCTATAGAA 2640 

CATCCTCCAC 2700 

CTGCGTCAGC 2760 

GGTCTCGGGC 2820 



9eq ID HQ: 26 Protein sequence 
Protein Accession #s NP_005449.l 



3. 
I 

MASPRRSGQP 
LMPIVTKEVAK 
IKYGPNHIiMV 
AVNPAIIiKIiXi 
KKLKGNDVP.T 
RCLRKNI>LAA 
GIWVIAKTLQ 
ERMGTIKPTQ 
IiYSlI»SALTI 
GSFVSEKTFE 
VGGMLLIDLC 
YAYKGLUOaF 
FCIVALVIIF 
NQASTSELBG 
GNFTESTDGG 
HAYLPSIGGV 



11 
I 

GRPPPPPPPP 
G9IGRGVI.PA 
FGGVCPSVTS 
KHYQVTKKVGT 
1LGQFDQNMA 
WEGYIGVDFE 
RAMEILHASS 
FQESREVKVG 
LGMIMASAPL 
TLCTVRTWIIi 
IIiICHQAVPP 
GCFLAWETHN 
CSTITLCfcVF 



KAHjKHHLDQ 
DASCVSPCV6 



21 
I 

AHLLIiltUiLP 
VELAIEQIRMT 
IIAESLQGMK 
LTQDVQRFSE 
AKVFOCAYES 
PltSSKQIKTZ 
RHQRIQDFNY 
EYNAVADTLE 
PFNIKMHHQK 
TVGYTTAFGA 
IJ&RTVEKYSM 
VSIPALNDSK 
VFKIiITIiRTtt 
KITBUDKDIiE 
NPQLQWNTTE 
PTA3PRHRHV 



31 

i 

LliliPLAPGAW 
ESLLRPYFLD 
ItVQIi9FAATT 
VFNDLTGVkY 
NMYG9KYQWI 
SGKTPQQYER 
TDHTLGRIXL 
IINDTIRFQG 
LIKMSSPYMH 
MFAKTWRVHA 
S PDPAGRD I & 
YIGMSVYNVG 
PtJAATQNRRF 
EVTMQLQDTP 
PSRTCKDPIB 
PPSFRVMVSG 



41 
1 

GWAHGAPRPP 
LKLYDTECDN 
PVLADKKKYP 



IPGWYEPSHW 
EYHNKRSGVG 
MAMNETNFFG 
SEPPKDKTII 
NLIILGGJ4L6 
IFKNVKMKKK 
IRPLLEHCEN 
IMCIJGAAVS 
OjFTQNQKKED 
EKTTYIKQNH 
DINSPEHIQR 
Zi 



51 
I 

ESSPPLSIMG 
AKGLKAFYDA 
YFFRTVPSDN 
SFSNDPCT3V 
EQVHTSANSS 
PSKFHGYAYD 
VTGQWPRUG 
LBQliRKTSIaP 
YASIFI»FGLD 
IIKDQKIlLVI 
THMTIWLGIV 
FIiTRDQPNVQ 
SKTSTSVTSV 
YQELWDILHI3 
RLSLQLPIIiH 



60 
120 
180 
240 
300 
360 
420 
480 
540 
£00 
660 
720 
780 
840 
900 



Seq ID NOs 27 DMA sequence 

Kucleic Acid Accession #: MM_0004S0.1 

Coding sequencer 117.. 1949 

1 11 21 31 41 51 

111111 

CCTGAGACAG AGGCAGCAGT GATACCCACC TGAGAGATCC TGTGTTTGAA CAACTGCTTC 
CCAAAACGGA AAGTATTTCA AGCCTAAACC TTTGGGTGAA AAGAACTCTT GAAGTCATGA 
TTGCTTCACA GTTTCTCTCA GCTCTCACTT TGGTGCTTCT CATTAAAGAG AGTGGAGCCT 
GGTCTTACAA CACCTCCACG GAAGCTATGA CTTATGATGA GGCCAGTGCT TATTGTCAGC 
AAAGGTACAC ACACCTGGIT GCAATTCAAA ACAAAGAAGA GATTGAGTAC CTAAAJCTCCA 
TATTGAGCTA TTCACCAAGT TATTACTGGA TTGGAATCAG AAAAGTCAAC AATGTGTGGG 
TCTGGGTAGG AACCCAGAAA CCTCTGACAG AAGAAGCCAA GAACT3GGCT CCAGGTGAAC 
CCAACAATAG GCAAAAAGAT GAGGACTGCG TGGAGATCTA CATCAAGAGA GAAAAAGATG 
TGGGCATGTG GAATGATGAG AGGTGCAGCA AGAAGAAGCT TQCCOTATGC TACACftGCTG 
CCTGTACCAA TACATCCTGC AOTOGCCACG GTGAATGTGT AGAGACCATC AATAATTACA 
CDTCGCAAGTG 1GACCCTGGC TTCAGTGGAC TCAAGTGTGA GCAAATTGTG AACTGTACAG 
CCCTGGAATC CCCTGAGCAT GGAAGCCTGG TTTGCAGTCA CCCACTGGGA AACTTCAGCT 
ACAATTCTTC CTGCTCTATC AGCTGTGATA GGGGTTACCT GCCAAGCAGC ATGGAGACCA 
TGCAGTGTAT GTCCTCTGGA GAATGGAGTG CTCCTATTCC AGCCTGCAAT QTGGTTGAGT 
GTGATGCTGT GACAAATCCA GCCAATGGGT TCGTGGAATG TTTCCAAAAC CCTGGAAGCT 
TCCCATGGAA CACAACCTGT ACATTTGACT GTGAAGAAGG ATTTGAACTA ATGGGAGCCC 
AGAGOCTTCA GTGTACCTCA TCTGGGAATT GGGACAACQA GAAGCCAAGG TGTAAAGCTG 
TGACATGCAG GGCOGTCOGC CAGCCTCAGA ATGGCTCTGT GAGGTGCAGC CATTCCCCTO 

ctggagagtt caccttcaaa tcatcctgca ACTTCACCTG TGAGGAAGGC ttcatgttgc 

AGGBACCAGC CCAGGTTGAA TGCACCACTC AAGQGCAGTO GACACAGCAA ATCCCAGTTT 
GTGAAGCTTT CCAGTGCACA GCCTTGTCCA ACCCCGAGCS AGGCIACATG AATTGTCTTC 
CTAGTGCTTC TGGCAGTTTC CGTTATGGGT CCAGCTGTGA GTTCTCCTGT GAGC3W3GGTT 
TTGTGTTGAA GGGATCCAAA AGGCXCCAAT GTGGCCCCAC AGGGGAGTGG GACAACGAGA 
AGCCCACATG TGAAGCTGTG AGATGCGATG CTGTCCACCA GCCCCOGAAG GGTTTGGTGA 
GGTGTGCTCA TTCCCCTATT GGAGAATTCA CCTACAAGTC CTCTTGTGCC TTCRGCTOTG 
AGGAGGGATT TGAATTATAT GQATCAACTC AAjCTTGAGTG CACATCTCAG GGACAATGGA 
CAGAAGAGGT TCCTTCCTGC CAAGTGGTAA AATGTTCAAQ CCTGGCAGTT CCGGGAAAGA 
TCAACATGAG CTGCAGTGGG GAGCC03TGT TTGGCACTGT GTGCAAGTTC ©CCTGTOCTG 
AAGGATGGAC GCTCAATGGC TCTGCAGCTC GGACATGTGG AGCCACAGGA CACTGGTCTG 
GOCTGCTACC TACCTGTGAA GCTCCCACTG AflTCCAACRa? TCCCTTGGTA GCTGGACTTT 
CTGCIGCTGG ACTCTCCCTC CTGACATTAG CACCATTTCT CCTCTGGCTT OOGAAATOCT 
TACGGAAAGC AAAGAAATTT GTTCCTGCCA GCAGCTGCCA AAGCCTTGAA TCAGACGGAA 
GCTACCAAAA GCCTTCTTAC ATCCTIXAAG TTCAAAAGAA TCAGAAACAG GTGCATCTGG 
GGAACTAGAG GGATACACTG AAGTTAACAG AQAT!AGATAA CTCTCCXCGG GTCTCTGGOC 
CTTCITGCCT ACTATGCCAG ATGCClTTAT GQCTGAAACC GCAACACCCA TCACCACTTC 
AATAGATCAA AGTOCAGCAG GCAAGGACGG CCTTCAACTG AAAAGACTCA GTGTTCCCTT 
TCCTACTCTC AGGATCAAGA AAGTGTTGGC TAATGAAGGG AAAGGATATT TTCTTCCAAG 
CAAAGGTGAA GAGACCAAGA CTCTGAAATC TCAGAATTCC TTTTCXAACT CTCCCTTGCT 
CGCTGTAAAA TCXTOGCACA GAAACACAAT ATTTTGTGGC TTTCTTTCTT TTGCCCTTCA. 
CAGTGTTTOG ACAGCTGATT ACACAGTTGC TGTCAlAAGA ATGAAXAATA ATTATCCAGA 
GTTTAGAGGA AAAAAATGAC TAAAAATATT ATAACTTAAA AAAATGACAG ATOTTGAATG 
CCCACAGGCA AATGGATGGA GGGTTGTTAA TGGTGCAAAT CCTACTGAAT GCTCTGTGCG 
AGGGTTACTA TGCACAATTT AAXCACTTTC ATCCCTATGG GATTCAGTGC TTCTTAAAGA 
GTTCTTAAGG ATTGTGATAT TTTTACTTGC ATTGAATATA TXATAATCTT CCATACTTCT 
TCATTCAATA CAAGTGTGGf AGGGACTTAA AAAACTTGTA AATGCTGTCA ACTATGATAT 
GGTAAAAGTT ACTTATTCTA GATXACCCCC TCATTGTTTA TTAACAAATT ATGTTACATC 
TGTTTTAAAT TTATTTCAAA AAGGGAAACT ATTGTCCOCT AGGAAGGCAT GATGTTAACC 

958 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 

iboo 
1B60 

1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2B20 



WO 03/042661 



AGAATAAAGT TCTGAGTGTT TTTACTACAG TTGTTTTTTG AAAACATQGT AGAATTGGAG 2BB0 

AGTAAAAACT GAATGGAAQG TTTGTATATT GTCAGATATT TTTTCAGAAA TATGTGGTTT 2940 

CCACGATGAA AAACTT C CAT GAGGCCAAAC GTTTTGAACT AATAAAAGCA TAAATGCAAA 300D 

CACACAAAGG TATAATTTTA TGAATGTCTT TGTTOGAAAA GAATACAGAA AGATGGATGT 3060 

GCTTTGCATT CCTACAAAGA TGTTTGTCAG ATGTGATATG TAAACATAAT TCTTGTATAT 3120 

TATGGAAGAT TTTAAATTCA CAATAGAAAC TCACCATGTA AAAGAGTCAT CTGGTAGATT 31B0 

TTTAACGAAT GAAGATGTCT AATAGTTATT CCCTATTTGT TTTCTTCTGT ATGTTAGGGT 3240 

GCTCTGGAAG AGAGGAATGC CTGTGTGAGC AAGCATTTAT GTTTATTTAT AAGCAGATTT 3300 

AACAATTCCA AAGGAATC7C CAGTTTTCAG TTGATCACTG GCAATGAAAA ATTCTCAGTC 3360 

AGTAATTGCC AAAGCTGCTC TAGCCTTGAG GAGTGTGAGA ATCAAAACTC TCCTACACTT 3420 

CCATTAACTT AGCATGTGTT GAAAAAAAAA GTTTCAGAGA AGTTCTGaCT GAACACTGGC 3430 

AACGACAAAG CCAACAGTCA AAACAGAGAT GTGATAAGOA TCAGAACAGC AGAGGTTCTT 3540 

TTAAAGGGGC AGAAAAACTC TGGGAAATAA GAGAGAACAA CTACTGTGAT CA3GCTATGT 3 6 DO 

ATGQAATACA GTQTTATTTT ClTTGAAATT GTTTAAGTGT TGT AAAT AT T TATGTAAACT 3660 

GCATTAGAAA TTAGCTGTGT GAAATACCAG TGTGGTTTGT GTTTOAOTTT TATTQAGAAT 3720 

TTTAAATTAT AACTTAAAAT ATTTTATAAT TTTTAAAGTA TAT ATT T ATT TAAGCTTATG 3780 
TCAGACCTAT TTGACATAAC ACTATAAAGG TTGACAATAA ATGTGCTTAT GTTT 

Seq ID NOs 28 Protein segueace 
Protein Accession ff : HP_000441.1 

1 11 21 31 41 51 

I I I I I 1 

KIASQFliSAI* TL.VU1.IKESG AWSYNT9THA MTYTJEASAYC QQH.YTHLVAI QNKEEIEYXN 60 

SILSYSPSYY VJIGIRKVMNV WVWVGTO.KPL TEEAKNWAPG EFMNXQXDED CVEIYIXREK 120 

DVGMHNDERC 6KKKLALCYT AACTNTSCSG HGECVETiNK YTCKCDPGFS GLXCEQXVNC 180 

TALESPEHGS LV CSH PIX3N F SYW6SCSISC DRGYUSSME TMQCMSSGEW SAPIPACNW 240 

ECDAVTflPAtf GFVECFQNPG SFPWNTTCTF DCEEGPELMG AQSLQCTSSG NWDNEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGEPTFKSS CNFTCEEGFM LQ3PAQVECT TQGQWTOOIP 360 

VCEAFQCTAL SNPERGYMNC USA3GSFRY GSSCEFSCEQ GFV&KGSKRL QCGPTGEttDK 420 

EKPTCEAVRC DAVHQPPKGL VRCAHSPIGH FTYKSSCAFS CKEGFELYGS TQLECTSQGQ 480 

WTEEVPSCQV VKCSSLAVPG KINMSCSGEP VPGTVCKFAC PEGWTUK3SA ARTCGATGHW S40 

SGLliPTCEAP TESNIPLVAG LSAAGLSLLT LAPFLLWIiRK CLRXAKKFVP AfiSCQSLESD 600 
GSiTOKPSYII, 

Seq ID NO: 29 DHA sequence 

Nucleic Acid Accession ft: NM_007036 

Coding sequence : 56 * - 610 

1 11 21 31 41 51 

I I ) I 1 I 

CTTCCCACCA GCAAAGACCA CGACTGGAGA GOCGAGCCGG AGGCAGCTGG GAAACATQAA 60 

GAGCGTCTTQ CTBCTGACCA CGCTCCTCGT GCCTGCACAC CTGGTGGCCG CCTGGAGCAA 120 

TAATTATGCG GTGGACTGCC CTCAACACTG TGACAGCAGT GAGTGCAAAA GCAGCCOGCG 180 

CTGCAAGAGG ACAGTGCTCQ ACGACTGTGG CTGCTGCCGA GTGTGCGCTG CAGGGCGGGG 240 

AGAAACTTGC TACCGCACAG TCTCAGGCAT GQATGGCATG AAGTGTGGCC CGGGGCTGAQ 3 00 

GTGTCAGCCT TCTAATGGGG AGGATCCTTT TGGTGAAGAG TTTGGTATCT GCAAAGACTG 360 

TCCCTACGGC ACCTTCGGGA TGGATTQCAG AGAGACCTGC AACTGOCAGT CAGGCATCTG 420 

TGACABGGG9 ACGGGAAAAT GCCTGAAATT CCCCTTC3TC CAATATTCAG TAACCAAGTC 4 B0 

TTCCAACAGA TTTGTTTCTC TCACGGAQCA TGACATGGCA TCTGGAGATG GCAATATTGT 540 

GA3AGAAGAA GTTGTGAAAG AGAATGCTGC CGGGTCTCCC GTAATGAGGA AATGGTTAAA 600 

TCCACGCTGA TCOCGGCTGT GATTTCTGAG AGAAGGCTCT ATTTTOGTGA TTGTTCAACA 660 

CACftGCCAAC ATTTTAGGAA CTTTCTAGAT ATAGCATAAG TACATGTAAT TTTTGAAGAT 720 

CCAAATTGTG ATGCATGGTG GATCCAGAAA ACAAAAAGTA GGATACTTAC AATCCATAAC 780 

ATCCATATGA CTGAACACTT GTATGTGTTT QTTAAATATT CGAATGCATG TAGATTTGTT 840 

AAATGTGTGT OTATAGTAAC ACTGAAGAAC TAAAAATGCA ATTTAGGTAA TCTTACATGG 900 

AGACAGGTCA ACCAAAGAGG GAGCTAGGCA AAGCTGAAGA CCGCAGTGAG TCAAATTAGT 960 

TCTTTGACTT TGATGTACAT TAATGTTGGG ATATGGAATG AAGACTTAAO AGCAGGAGAA 1020 

GATGGGGAGG GGGTGGGAGT GGGAAATAAA ATATTTAGOC CTTGCTTGGT AGGTAGCTTC 10 B0 

TCTA3AATTT AATTGTGCTT TTTTTTTTTT TTTGGCTTTG GQAAAAGTCA AAATAAAACA L140 

ACCAGAAAAC CCCTGAAGGA AGTAAGATGT TTGAAGCTTA TGGAAATTTG AGTAACAAAC 1200 

AQCTTTGAAC TGAGAGCAAT TTCAAAAGGC TGCTGATGTA GTTCCCGGGT TACCTGTATC 1260 

TGAAGGACGG TTCTGGOGCA TAGGAAACAC ATACACTTCC ATAAATAGCT TTAACGTATG 1320 

CCACCTCAGA GATAAATCTA AGAAGTATTT TAOGCACTGG TGGTTTGTGT GTGTATGAAG 13 B0 

GTAAATATTT AT AT ATTTT T ATAAATAAAT GTGTTAGTGC AAGTCATCTT CCCTACCCAT 1440 

ATTTATCATC CTCTTGAGGA AAGAAATCTA GTATTATTTG TTGAAAATGG TTAGAATAAA 1500 

AACCTATGAC TCTATAAGGT TTTCAAACAT CTGAGGCATG ATAAATTTAT TATCCATAAT 1560 

TATAGGAGTC ACTCTGGATT TCAAAAAATG TCAAAAAATG AGCAACAGAG GGACCTTATT 1620 

TAAAGAXAAG TGCTGTGACT TCGGTGAATT TTCAATTTAA GGTATGAAAA TAAGTTTTTA 1680 

GGAGGTTTGT AAAAGAAGAA TCAATTTTCA GCAGAAAACA TGTCAACTTT AAAATATAGG 1740 

TGGAATTAGG AGTATATTTG AAAGAATCTT AGCACAAACA GGACTGTTGT ACTAGATGTT 1800 

CTTAGGAAAT ATCTCAGAAG TATTTTATTT GAAGTGAAGA ACTTATTTAA GAATTATTTC 1860 

AGTATTtACC TGTATTTTAT TCTTQAAGTT GGCCAACAGA GTTGTGAATG TGTGTGGAAG 1920 

GCCTTTGAAT GTAAAGCTGC ATAAGCTGTT AGGTTTTQTT TTAAAAGGAC ATGTTTATTA 1980 

TTGTTCAATA aaaaaqaaca agatac 

8eq TD NO: 30 Protein sequence 
Protein Accession NP_008967_i 

1 11 21 31 41 51 

I 1 I I 1 1 

MKSVMjLTTL LVPAELrVAAN SUNYAVDCPO HCDfiSECKSS PHCSCRTVLDD OGCCRVCAAG 60 

HGETCYRTVS GMEX3MKCGPG LRCQPSWGKD PFGEEFQICK DCPYOTFGMD CRETCUCQSG 120 

ICDRGTGKCIi KFPFFQYSVT KS5WRFVSLT EHDMASGDGH IVREEWKEH AAQSFVMRKM 180 
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Seq ID NOr 32 DMA sequence 
Hucleic Acid Accession #s NM_0O0963 
Coding sequence i 135*.l949 

1 11 21 31 41 51 

\ I I I I 1 

CAATTGTCAT ACGACTTGCA GTGAGCGTCA GGAGCACGTC CAGGAACTCC TCAGCAGOGC 60 

CTCCTTCAGC TCCACAGCCA GACGCCCTCA GACAGCAAaG CCTACCCCCG CGCCGCGCCC 120 

TGCCCGCCGC TCGGATGCTC GCCCGCGCCC TGCTGCTGTG OGOGGTCCTG GCGCTCAGCC 180 

ATACAGCAAA TCCTTGCTGT TCCCACCCAT GTCAAAACCG AGGTGTATGT ATGAGTGTGG 240 

GATTTGACCA GTATAAGTGC GATTGTACCC GGACAGGATT CTATGGAGAA AACTGCTCAA 300 

CACCGGAATT TTTGACAAGA ATAAAATTAr TTCTGAAACC CACTCCAAAC ACAGTGCACT 360 

ACATACTTAC CCACTTCAAG GGATTTTGGA ACGTTGTGAA TAACATTCCC TTCCTTCGAA 420 

ATGCAATTAT GAGTTATGTC TTGACATCCA GATCACATTT GATTGACAGT CCACCAACTT 480 

ACAATGCTGA CTATGGCTAC AAAAGCTGGG AAGCCTTCTC TAACCTCTCC TAT TAT ACTA 540 

GAGCCCTTCC TCCTGTGCCT GATGATTGCC CGACTCCCTT GGGTGTCAAA GGTAAAAAGC €00 

AGCTTCCTGA TTCAAATGAG ATTGTGGAAA AATTGCTTCT AAGAAGAAAG TTCATCCCTG €60 

ATCCCCAGGG CTCAAACATG ATGTTTGCAT TCTTTGCCCA GCACTTCACG CATCAGTTTT 720 

rCAAGACAGA TCATAAGCGA GGGCCAGCTT TCACCAACGG GCTGGGCCAT GGGGTGGACT 760 

TAAATCATAT TTACGGTGAA ACTCTGGCTA GACAGCGTAA ACTGCGCCTT TTCAAGGATG 840 

GAAAAATGAA ATATCAGATA ATTGATGGAG AGATGTATCC TCCCACAGTC AAAGATACTC 900 

AGGCAGAGAT GATCTACCCT CCTCAAGTCC CTGAGCATCT ACGGTTTGCT GTGGGGCAGG 960 

AGGTCTTTGG TCTGGTGCCT GGTCTGATBA TGTATGCCAC AATCTGGCTG CGGGAACACA 1020 

ACAGAGTATG CGATGTGCTT AAACAGGAGC ATCCXGAATG GGGTGATGAG CAGTTGTTCC 1080 

AGACAAGCAG GCTAATACTG ATAQGAQAGA CT AT T AAGAT TGTGATTGAA GATTATGTGC 13.40 

AACACTTGAG TGGCTATCAC TTCAAACTGA AATTTGACCC AGAACTACTT TTCAACAAAC 1200 

AATTCCAQTA CCAAAATCGT ATTGCTGCTG AATTTAACAC CCICTATCAC TGGCATCCCC 1260 

TTCTGCCTGA CACCTTTCAA ATTCATGACC AGAAATACAA CTATCAACAG TTTATCTACA 1320 

ACAACTCTAT ATTGCTGGAA CATGGAATTA CCCAGTTTGT TGAATCATTC ACCAGGCAAA 1380 

TTGCTGGCAG GGTTGCTGGT GGTAGGAATG TTCCACCCGC AGTACAGAAA GTATCACAGG 1440 

CTTCCATTGA CCAGAGCAGG CAGATGAAAT ACCAGTCTTT TAATGAGTAC CGCAAACGCT 1500 

TTATGCTGAA GCCCTATGAA TCATTTGAAG AACTTACAGG AGAAAAGGAA ATGTCTGCAG 15 60 

AGTTGGAAGC ACTCTATGGT GACATCGATG CTGTGGAGCT GTATCCTBCC [ZTTCTGGTAG 1620 

35 AAAAGCCTCG GCCAGATGCC ATCTTTGGTG AAACCATGGT AGAAGTTGGA GCACCATTCT 1680 

CCTTGAAAGG ACTTATGGGT AATGTTATAT GTTCTCCTGC CTACTGGAAG CCAAGCACTT 1740 

TTGGTGGAGA AGTGGGTTTT CAAATCATCA ACACTGCCTC AATTCAGTCT CTCATCTGCA 1800 

ATAACGTGAA GGGCTGTCCC TTTACTTCAT TCAGTGTTCC AGATCCaGAG CTCATTAAAA I860 

CAGTCACCAT CAATGCAAGT TCTTCCCGCT CCGGACTAGA TGATATCAAT CCCACAGTAC 1920 

TACTAAAAGA ACGTTCGACT GAACXGTAGA AGTCTAATGA TCATATTTAT TTATTTATAT 19 bo 

GAACCATGTC TATTAATTTA ATTATTTAAT AATATTTATA TTAAACTCCT TATGTTACTT 2040 

AACATCTTCr GTAACAGAAG TCAGTACTCC TGTTGCGGAG AAAGGAGTCA TACTTGTGAA 2100 

GACTTTTATG TCACTACTCT AAAGATTTTG CTGTTGCTGT TAAGTTTGGA AAACAGTTTT 2160 

TATTCTQTTT TATAAACCAG AGAGAAATGA GTTTTGACGT CTTTTTACTT OAATTTCAAC 2220 

TTATATTATA AGAACGAAAG TAAAGATGTT TGAATACTTA AACACTATCA CAAGATGGCA 2280 

AAATGCTGAA AGTTTTTACA CTGTCGATGT TTCCAATGCA TCTTCCATGA TGCATTAGAA 2340 

GTAACTAATG TTTGAAATTT TAAAGTACTT TTGGTTATTT TTCTGTCATC AAACAAAAAC 2400 

AGGTATCAGT GCATTATTAA ATOAATATTT AAATTAGACA TTACCAGTAA TTTCATGTCT 2460 

ACTTTTTAAA ATCAGCAATG AAACAATAAT TTGAAATTTC TAAATTCATA GGGTAGAATC 2S20 

ACCTGTAAAA BCTTGTTTGA TTTCTTAAAG TTATTAAACT TGTACATATA CCAAAAACJAA 2580 

GCTGTCTTGG ATTTAAATCT GTAAAATCAG ATGAAATTTT ACTACAATTG CTTGTTAAAA - 2640 

TATTTTATAA GTGATGTTCC TTTTTCACCA AGAGTATAAA CCTTTTTAGT GTGACTGTTA 2700 

AAACTTCCTT TTAAATCAAA ATGCCAAATT TATTAAGGTG GTGGAGCCAC TGCAGTGTTA 2760 

TCTCAAAATA AGAATATTTT GTTGAGATAT TCCAGAATTT GT TTATATGG CrGGTAACAT 2B20 

GTAAAATCTA TATCAGCAAA AGGGTCTACC TTTAAAATAA GCAATAACAA AGAAGAAAAC 2680 

CAAATTATTG TTCAAATTTA GGTTTAAACT TTTGAAGCAA ACTTTTTTTT ATCCTTGTGC 2940 

ACTOCAGGCC TGGTACTCAG AJTTTOCTAT GAGGTTAATG AAGTACCAAG CTGTGCTTGA 3000 

ATAACGATAT GTTTTCTCAG ATTTTCTGTT GTACAGTTrA ATTTAGCAGT CCATATCACA 3060 

- TTGCAAAAGT AGCAATGACC TCATAAAATA CCTCTTCAAA ATGCTTAAAT TCATTTCACA 3120 

OU CATTAATTTT ATCTCAGTCT TGAAGCCAAT TCAGTAGGTG CATTGGAATC AAGCCTGGCT 3180 

ACCTGCATGC TGTTCCTTTT CITTTCTTCT TTTAGCCATT TTGCTAAGAG ACACAOTCTT 3240 

CTCATCACTT CGTTTCTCCT ATTTTGTTTT ACTAGTTTTA AGATCAGAGT TCACTTTCTT 3300 

TGGACTCTGC CTATATTTTC TTACCTGAAC TTTTGCAAGT TTTCAQGTAA ACCTCAGCTC 3360 

AGGACTGCTA TTTAGCTCCT CTTAAGAAGA TTAAAAOAQA AAAAAAAAGG CCCTTTTAAA 3420 

65 AATAGTATAC ACTTATTTTA AGTGAAAAGC AGAGAATTTT ATTTATAGCT AATTTTAGCT 3480 

ATCTGTAACC AAOATGOATG CAAAQAGQCT AGTGCCTCAG AQAQAACTGT ACGGGGTTTG 3540 

TGACTGGAAA AAGTTACGTT CCCATTCTAA TTAATGCCCT TTCTTATTTA AAAACAAAAC 3600 

CAAATGATAT CTAAGTA3TT CTCAGCAATA ATAATAATGA CGATAATACT TCTTTTCCAC 3660 

ATCTCATTGT CACTGACATT TAATGGTACT GTATATTACT TAATTTATTG AAGATTATTA 3720 

TTTATQTCTT ATTAGGACAC TATGGTTATA AACTGTGTTT AAGCCTACAA TCATTGATTT 3780 

TTTTTTGTTA TGTCACAATC AGTATATTTT CTTTGGGGTT ACCTCTCTOA ATATTATGTA 3840 

AACAATCCAA AGAAATGATT GT ATT AAGAT TTGTGAATAA ATTTTTAGAA ATCTGATTGG 3900 

CATATTGAGA TATTTAAGGT TOAATGTTTG TCCTTAGGAT AQK3CCEATGT GCTAGCCCAC 3960 

AAAGAATATT GTCTCATTAG CCTGAATGTG CCATAAGACT GACCTTTTAA AATGTTTTGA 4020 

GGGATCTGTG &ATGCTTCGT TAATTTGTTC AGCCACAATT TATTOAQAAA ATATTCTGTG 4080 

TCAAGCACTG TGGGTTTTAA TATTTTTAAA TCAAACGCTG ATTACAGATA ATAGTATTTA 4140 

TATAAATAAT TDAAAAAAAT TTTCTTTTOG OAAGAGGGAG AAAATOAAAT AAATATCATT 4200 

AAAGATAACT CAGGAGAATC TTCTTTACAA TTTTACGTTT AGAATGTTTA AGGTTAAGAA 4260 

AGAAATAGTC AATATGCTTG TATAAAACAC TGTTCACTGT TrrrTTTAAA AAAAAAACTT 4320 

GATTTGTTAT TAACATTGAT CTGCTGACAA AACCTGGQAA TTTGGGTTGT GTATGCGAAT 4380 

GTTTCAGTGC CTCAGACAAA TGTGTATTTA ACTTATGTAA AAGATAAGTC TGGAAATAAA 4440 
TGTCTBTTTA TTTTTGTACT ATTTA 
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Seq ID NO: 32 Protein sequence 
Protein Accession #: ISP_000954 

1 11 21 31 41 51 

111)11 

MLiARALIlLCA VIAI^HTAtfP CCSHPCQNRG VCMSVGFDQY KCDCraTGFY GENCSTPEFI* SO 

TRI1Q.FI.KPT PNTVHYILTH FKGFWEFWNN IPFLRNAIMS YVLTSRSHLI DSPPTYHADY 120 

GYKSWEAFSN LSYYTRALPP VPDDCPTPLG VKGKKQLPDS NEIVEKLI^LR RKFIPDPQGS ISO 

NrtMFAFFAQH FTHQFFKTDE KRGPAFTNGL GHGTOLNHIY GETLARQRKL HLFKDGKMKY 240 

QIIDGEMYPP TVKDTQAEMI YPPQVPEHI.R FAVGQEVFGit VPGIMMYATI WLREHNRVCD 300 

VLXQEHPBWG DEQI>FQTSRIj ILIGBTIKTV IEDYVOHLSG YHFKLKFDPE LLFNKQFQYQ 360 

MRIAAEFNTIj YHHEPI.I*PDT FQIHDQKYMY QQFIYNNSIL LEHGITQFVE SFTRQIAGHV 420 

AGGRNVPPAV QKVSOASIDQ SHQMKYOSFN HYRKRPWLKP YESFHKLTGS KliMSAELEAL 4B0 

YGDIDAVELY PAIiI.VF.KPRP DAIFGETMVE VGAPffiLKGL MGNVIC8PAY WKPSTFGGEV 540 

QFQIINTASI QSJblCNNVXG CPFTSFSVPD PELIXTVTIN ASSSRSGLUD IHPTVIiliKER 600 
SIEL 

9eq ID WO: 33 DNA aeguenCfe 

Nucleic Acid Accession NM_0O15O8-l 

Coding sequence: 1 - - 13 SI 

1 11 21 31 41 51 

1 I 1 I I I 

ATGGCTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCOCAAA TCATTGATCA CAGTCATGTC GO 

CCCGAGTTTG AGGTGGCCAC CTGGATCAAA ATCACCCTTA TTCTGGTGTA CCTGATCATC 120 

TTCGTGATGG GCCTTCTGGG GAACAGCGTC ACCATTCGGG TCACCCAGGT GCTGCAGAAG 1B0 

AAAGGATACT TGCAGAAGGA GGTQACAGAC CACATGGTGA GTTTGGCTTG CTCGGACATC 240 

TTGGTOTTCC TCATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCTGACC 300 

ACGTCCAGCT ACACCCTGTC C7TGCASGCTG CACACTTTCC TCTTCGAGGC CTGCAGCTAC 360 

GCTACGCTGC TGCACGTGCT GACGCTCAGC TTTGAGCaCT ACATCGCCAT CTGTCACCCC 420 

TTCAGGTACA AGGCTOTGTC SGGACCTTGC CAGGTGAAGC TGCTGATTGG CTTCGTCTGG 480 

GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA GTACCCCCTG 540 

GTGAACGTGC CCAGCCACCG GG3TCTCACT TGCAAOCGCT CTAGCA0OCG CCAOCACGAG 600 

CAGCCCGAGA CCTCCAATAT GTCCATCTGT ACC&ACCTCT CCAGCCGCTG OACCGTGTTC 660 

CAGTCCAGCA TCTTCGGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCTTC 720 

ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAAGCC&GA AGGQCTCGCT GGCCGGGGGC 780 

ACGCGGCCTC CGCAGCTGAG GAAGTCCGAG AGGGAAGAGA GCAGGACCGC CAGGAGGCAG 640 

ACCATCATCT TGCTGAGGCT GATTGTTGTG ACATTGGCCG TATQCTGQAT GCCCAACCAG 900 

ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAGCACGACT GGACGAGGTC CTACTTCCGG 960 

GCBTACATGA TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT ACCTCAGCTC GGTCATCAAC 1020 

CGGCTCCTGT ACACGGTGTC CTCGCAGCAG TTTCGGCGGG TGTTCGTGCA GGTGCTGTGC 1080 

TGCCGCCTGT CGClGCAGCA OGCCAACCAC GAGAAGCGCC TGCGCOTACA TGCGCACTCC 1140 

ACCACCGACA GOGCCCGCTT TGTGCAGC3C CCGTTGCTCT TOGCGTCCCG GCGCCAGTCC 1200 

TCTGCAAGGA GAACTGAGAA GATTTTCTTA AGCACTTTTC AGAGCGAGGC COAGCCCClAG 1260 

TCfAAGTCCC AGTCATTGAG TCTOGAGTCA CTAGAGCCCA ACTCAGGCGC GAAACCAGCC 1320 
AATTCTQCTO CAGAGAATGG TTTTCAGGAG CATGAAGTTT GA 

Seq ID NO: 34 Protein sequence 
Protein Accession #= NP_001499.1 

1 11 21 31 41 51 

I I I I I I 

WASPSLPGSD CSQIICHSHV PEFEVATWIK ITIiILVYIiII FVMGLLGNSV TIRVTQVLQK 60 

KGYLQKEVTD HMVSLACRDI LVFLIGMPME FYSIIWNBIiT TSSYTJjSCKL HTFLFEACSY 120 

ATI*HVUI1*S PSRYIAICHP FRYXAVSGPC QVKLLIGFVW VTSALVALPL LFAKGTBYPI. L80 

VKVPSERGLT CNRSSTRHHE QPETfiNMSIC TNLSSRMTVP Q3SIPGAWV YLWLIiSVAF 240 

HCWNmQVLH KSQKG9IAGG TRPPO/LRKSE SESSRTAKHQ TIIFLRLIW TLAVCWMPWQ 300 

IRRIMAAAKP KHDWTR8YFR AYMILLPFSE TFFYLSSVIN PLLYTVSSQQ FRRVFVOVLC 360 

CRLSIjQKftNH BKRIiKVBAHS TTDSARFVQR PLLFASRRQS SARRTEKIFL STFQSEAEPQ 420 
SK6QSLCLBS LSPNSGAKPA NSAAEHGFQB HEV 

Seq- ID UOs 35 DNA sequence 

HMcleic Acid Accession ft; KM_0 06475.1 

Coding sequence* 2 n . .2538 

1 11 21 31 41 51 

1 1 1 I I I 

AACAGAACTG CAACGGAGAG ACTCAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 

TTGCTGCTTA ttgttaaocc tataaacgcc aacaatcatt atgacaaoat cttggctcat 120 

AQTCGTATCA GGGGTCGGGA CCAAGGOCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 180 

ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 

AAAACGACTG TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 

TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT QGQAOCCACC 360 

ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGATCGAGGG AAAGGGATCC 420 

TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTOA TATCCGTAGA 4 B0 

GGT3TGGAGA GCAACGTGAA TGTTGAATTA CTGAATCCTT TACATAGTCA CATGATTAAT 540 

AAGAGAATGT TGACCAAGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT QTATAACAAT 600 

TTGGGGCTTT TCATTAACCA TTATOCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 

ATCCATGGGA ACCAGATTGC AACAAATGGT GTTQTCCATG TCATTGACCG TOXGCTTACA 720 

CAAATTGGPA CCTCAATTCA AOACTTCATT GAAGCAGAAO ATGACCTTTC ATCTTTTAGA 780 

GCAOCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 

TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTTCCACGAG GTGTCCTAGA AAGGTTCATG 900 

GGAGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 

TCTGAGTCTA TTATQGGAGG AGCAGTCTTT QAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
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GGATGTGACG 
GTGACAAATA 
CAAGTTATTG 
GGCTTGGCAT 
GCATTTTCTG 
CACATATTGA 
ATOGGAGGCA 
TGCATGGAGA 
ATCAAGCCAG 
TTCCTCAGCC 
ACATTATTTG 
CTQATACGGG 
TTCATTGGAA 
AAAATC^TTC 
TCTGACATCA 
OACACACCTG 
CAAATTAAGT 
AAAATTATAA 
ATTATCAAAA 
AOACTGATTA 
AAATCACACCA 
GAACGAATCA 
ACAGAAGAAA 
GAAGQTGOTG 
ACACCCGTGA 
6RAGGTOQTT 
AATAACCTGA 
ACAAAGAAGC 
QAAACATQAG 
CACCTTACAC 
AGAAAAATCC 
CCATTGAAAA 
TATCTCTCCA 
TCAAAAGGCT 
CAACTAATTT 
TCTCAAAGGT 
ATTCAGAAAA 



GTGACAGTAT 
ATGGTGTGAX 
AGCTGGCTGG 
CTGCTCTGAG 
ATGATACTCT 
AAGTAAAAGT 
AACAGCTCAG 
AAGGGAGTAA 
CAGAGAAATC 
TACTTGAAGC 
TGCCAACCAA 
ACAAAAATGC 
AAGGATTTGA 
TGAAAGAAGT 
TGACRACAAA 
TTGGAAATGA 
TTGTTCGTGG 
CCAAAGTTQT 
CTGAAGGACC 
AAGAAGGTGA 
AAATCATTGA 
TTACAGQTCC 
CTCTGAAGAA 
ATGGTCATTT 
GGAAGTTGCA 
CTCAGTGAAA 
CCTTAGAAAA 
AATCATCAAA 
GGAAATTGTG 
CCTTTTTCAT 
TTGTCACCAG 
GACCGAGCCT 
TGGGAAGCTA 
TTGCACATTT 
TGTACTCTCA 
TTCAATAAAA 
ACTCAAGATT 



AACAGTAAAT 
CCATTTGATT 
AAAACAGCAA 
GCCAGATGGA 
CAGCATGGTT 
TGGCCTTAAT 
AGTCTTCGTA 
GCAAGGGAGA 
CCTCCATGAA 
TGCAGACITG 
TGATGCTTTT 
TCTTCAAAAC 
AjCCTQGTGTT 
AAATGATACA 
TGGTGTAATT 
TCAACTGCTG 
TAGCACCTTC 
GGAACCAAAA 
CACACTAACA 
AACAATAACT 
TGGAGTGCCT 
TGAAATAAAA 
ATTGTTACAA 
ATTTGAAGAT 
AGCCAACAAA 
ATCCAAAAAC 
TTGTGAGAGC 
TAATTCTGAA 
GAGTTAGCCT 
CTTGACATTA 
ATTCATTACA 
TGTATGTATG 
AGTTATAAAA 
CTATATGAGT 
GAATGTTTGT 
CCATTTTTCA 
TAAGTTAAAA 



GGAATCAAAA 
GATCAGGTCC 
ACCACCTTCA 
GAATACACTT 
CAGCGCCTCC 
GAGCTTTACA 
TATCGTACAG 
AACGGTGGGA 
AAGTTAAAAC 
AAAGAGCTCC 
AAGGGAATGA 
ATCATTCXTT 
ACTAACATTT 
CTTCTGGTQA 
CATGTTGTAG 
GAAATACTTA 
AAAGAAATCC 
ATTAAAGTGA 
AAAGTCAAAA 
GAAGTGATCC 
GTGGAAATAA 
TACACTAGGA 
GAAGAGGTCA 
GAAGAAATTA 
AAAQTTCAAQ 
CAGAAAAAAA 
CAAGTTGACT 
CACAAATTTA 
CCTGTGGTAA 
AAAGTTCTGG 
ATTCAAATCG 
TTATGGATAC 
ATAGGTGCTT 
GGGTTTACTG 
CATATflCTTC 
GATATAAAGA 
AGTGGlTTGG 



TGGTGAACAA 
TAATTCCTGA 
CGGATCTTGT 
TGCTGGCACC 
TTAAATTAAT 
ACGGGCAAAT 
CTGTCTGCAT 
TTCACATATT 
AAGATAAGCQ 
TGACACAACC 
CTAGTGAAGA 
ATCACCTGAC 
TAAAGACCAC 
ATGAATTGAA 
ATAAACTCCT 
ATAAA.TTAAT 
CCGTGACTGT 
TTGAAGGCRG 
TTGAAGGTGA 
ATGGAGAGCC 
CTGAAAAAGA 
TTTCTACTGG 
CCAAGGTCAC 
AAAGACTGCT 
GTTCTAGAAG 
TGTTTATACA 
TCAGGAACTG 
ATATTTTTTT 
AGGAATTGAA 
CTAACTTTGG 
AAGAGTTGTG 
ATAAAATGCA 
GGTGTACAAA 
GTAAATTATG 
TTGCAATGCA 
GAATTACTTC 
ACTTGGGAA 



AAAGGATATT 
TTCTGCCAAA 
GGCCCAATTA 
TGTGAATAAT 
TCTGCAGAAT 
ACTGGAAAOC 
TGAAAATTCA 
CCGCGAGATC 
CTTTAGCACC 
TGGAGACTGG 
AAAAGAAATT 
ACCAGGAGTT 
ACAAGGAAGC 
ATCAAAAGAA 
CTATCCAGCA 
CAAATACATC 
CTATACAACT 
TCTTCAGCCT 
ACCTGAATTC 
AATTATTAAA 
GACACGAGAA 
AGGTGGAGAA 
CAAATTCATT 
TCAGGGAGAC 
ACGATTAAGG 
ACCCTAAGTC 
AAACATCAGC 
TTCTGAATGA 
GAAAATATAA 
AATCCATTAG 
AACTGTTATC 
CGCAAGCCAT 
ACTTTTTATA 
TTATTTTTTA 
TATTTTTTAA 
AAATTGAGTA 



Seq 133 NOi 36 Protein sequence 
Protein Accession. 4t: NP_006466-1 



1 

i 

MIPFT*PMFSIj 
KNWYKKSICG 
KLRBBIEGKG 
NGMriPSMVN 



21 

1 

ANNHYDKILA 
PGYMRMEGHK 
AWDNLDSDIE 
NGWTVNCAR 
SALGRDGHFT 



MKYHIIMTLQ 
IDQVLIPDSA 
VQHIiKLILQ 
ENGAIHIPRE 



FKEIFVTVYT 
TKVEHGEPII 
QBEVTKVTKF 



11 
1 

LLLLXVNPrM 
QKTTVLYECC 
S h'FY FAP SKTi 
WLGLFINHYP 
RAAAtTBDXI. 
CSESIMGGAV 
KQVISlxAGKQ 
HHIUCVKVOL 
IIKPAEKSLH 
ILUUDKNALQ 
ESDIMTTNGV 
TKIITKWEP 
KKYTKIIDGV 
lEGGDGHIiFB SBBIKRlZiQG 



31 
I 

HSRIHGRDQG 
GCPAVLPJDH 
RGLESNVNVE 
I IHGNQI ATK 



QTTFTDLYAQ 
NELYNGQILE 



NIILYBLTPG 
IHWDKLLTP 
KXXVIEGSLQ 



IGCDGDSITV 
LGLASALKPD 
TIGGKQLKVF 
TFXSDLEAAD 
VFIGKGFEPG 
AETPVQNEQJ, 
PIUCTEGPTL 
EERIITGPEI 
DTPVHKT/QAU 



41 
I 

PNVCALQQIL 
VYGTLGIVGA 
LLNALHSHMI 
GWHVlDRVIi 
KLPRGVLBRF 
KG1KMVHKKD 
GEYTLLAPVN 
VYRTAVCI2H 
LKELLTQPGD 
VTMILKTTQG 



KfTRISTGGG 
KKVQGSRHRli 



Seq id NO: 37 DMA sequence 

Jtaclelc Acid Acceooion #i HM_002416 

Coding sequence: 4.D..417 



ATCCAATACA 
TTCCTCTTGG 
AAGGGTCGCT 
GACCTTAAAC 
AAGAATGGAG 
AAGTGGGAGA 
AAGAAAGTTC 
ACCACTTCAC 
TTCCAARGGA 
ATTACTCTGA 
TTGTTAAAGG 
TTAAGGCCAT 
CTCACAACAG 
tATCTGAGGC 
TGAAATTGAG 
CTACAJGOCCT 
ACTGGAGATC 
TTCCATCXTG 



CATATGCTCT 



11 
I 

GGAGTGACTT 
GCATCATCTT 
GTTCCTGCAT 
AATTTGCCCC 
TTCAAACATG 
AACAGGTCAG 
TGAAAGTTCG 
CAATAAGTAT 
GGATGGCATA 
AATTGTAACT 
CTATGATTGT 
GATTTTAGCA 
CTGCCTGGAA 
ACATGTCAGC 
CTGGACCTCA 
CACACACAAT 
ACCAGTGTGT 
CGCGCTCAGG 
TTCTCCATCC 
GATTTATCTG 



21 
I 

GGAACTCCAT 
GCTGGTTCTG 
CAGCACCAAC 
AAGCOCTTGC 
TCTAAACCCA 
OCAAAAGAAA 
AAAATCTCAA 
TCTGTGTTAA 
TAATACAAAG 
AAAGTTAGAA 
CTTTGTTCTT 
ATACCCATGT 
GAGCAGCCCT 
AAQTCCTflAG 
CCAAGCTGCT 
GTGTCTGAGA 
GGCTTTCAGA 
CTGACCACTT 
TACCACAATG 
AGTCAACTCC 



31 
I 

TCTATCACTA 
ATTGGAGTGC 
CAAGGGACTA 



GATTCAGCAG 
AAGCRAAW3A 
CGTTCTOGTC 
AAATGTTCTA 
GCTTATTAAT 
AGTTGATTTT 
CTACCACCCA 
CTACACAGAT 
AGGCrTCCAC 
CCTGTTAGCA 
GTGGCCATCA 
GATTCATGCT 
GOCTCCTOTC 
rATTTCTTTT 
CAGTGCCTTT 
TTTCTCATCT 



41 
I 

TGAAGAAAAG 
AAGGAACCCC 
TCCAOCTACA 
TTGAAATCAT 
ATGTGAAGGA 
ATGGGAAAAA 
AAAAGAAGAC 
TTTTAATTAT 
TTGACTAGAA 
AAGAATCCAA 
CGAGTTGAAT 
GTTCAOCCAA 
GTACTGCAGC 



1080 
liao 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
X6B0 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
222D 
2280 
2340 
2400 
2460 
2S20 
25B0 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



51 
1 

GTKKKYFfiXC 
TTTQfRYSDAS 
NKEMLTKDIiK 
TQIQT9XQDP 
MGOKVASEAL 



NAFSDDTLSM 



WTLFVPTNDA 
SK1FLKEVND 
IQ1KFVRGST 
FRLIKE&ETJ 
ETEETLKKLL 



ACCTCTGTAT 
GATTGTTATT 
TGGCTTTGGA 
TGTTCCCCTT 
CTTCTCTCCA 
TGTCCCCAAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



51 
I 

TGGTGTTCTT 
AGTAGTGAGA 
ATCCTTGAAA 
TGCTACACTG 
ACTGATTAAA 
ACATCAAARA 
TACATAAGAG 
ACCGCTATCA 
AATTTAAAAC 
AOGTTAAGAA 
TTCATCATGC 
CCACATCCCA 
CTCCAGAGAG 
CCAAGCAGTT 
TTGAATCAGC 
GGGTATCACC 
AQCCATGTGA 
TGCTTCAXTC 
GTCfCnCCTGT 
ACCCCACAGA 



60 
120 
1B0 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



962 



WO 03/042661 



AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT 1260 

AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 

CACATGGGTG AACACTCAAT GGTTAAC£AA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1360 

AGATTGTCAG CTCCTTGAGG GCAAfiAGCCA CAGTATATTT OCCTGTTTCT TCCACAGTGC 1440 

CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA IS 00 

TGGCAACCAG ACCATTGTCT CAGAGGAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG IS CO 

CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAjC TCACEACAGG GACCAGGGAT 1620 

GATGCAACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AGCTTCTCCA ACAATAAQAA 1680 

GCACGTGGTA AAACACTTGC GGATATTCTG GaCtGTTTTT AAAAAATATA CAGTTTACCG 1740 

AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCftGTGAC CAACCTTTTC 1800 

CCAACCATAC AAAAATTCCT TTTCCCGAAG OAAAAGGGCT TTCTCAATAA GCCTCAGCTT I860 

TCTAAGATCT AACAAQATAG CCACCGAGAT CCTTATOGAA ACTCATTTTA GGCAAATATG 192 0 

AQTTTTATTG TCCGlTTACT TGTTTCAGAG TTTGTATTGT GATTATCAAT TACCACACCA 19BD 

TCTCCCATGA AGAAAGGOAA CGGTGAAGTA CTAAGCGCTA GAGOAAGCAG CCAAGTCGGT 2040 

TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 

GGAGGTTCA3 TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 

CTTTCCCAAA TTGAATCACT GCTCACACTG CTGATGATTT AGAGTGCTGT CCGGTGGAGA 2220 

TCCCACCCGA ACGTCTTATC TAATCATGAA ACTOCCTAGT TCCTTCATGX AACTTCCCTG 2280 

AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC ACTTACCTTG CATCTCACAG 2340 

GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTOACACA CACGTTATAA 2400 

TCATTTATCA 1ATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 

AAACAGTATT GACTTBTATA CCTTGTAATT TGAAATATTT TCTTTOTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 

Seq IB NOt 38 Protein sequence 
Protein Accession ; NP__O02407 

1 11 21 31 41 51 

I 1 1 I I I 

MKKSGVLPLI, GIILLVLIGY QGTPWRKGR CfiCIS-nffQGT IHLQgT iKTTfl >K QFAPSPSCEK 60 

IEI1ATLKNG VQTCWIPDSA DVKEblKKHE XQVSQKKKQK NGKKHQKKKV liKVRKfiQRSR 120 
QKKTT 

} 

Seq ID HO: 39 DNA sequence 

Nucleic Acid Accession HMJJ06G70 

Coding sequence; 85,-1347 

1 11 21 31 41 51 

I I I I I I 

CCOGCTCGCG CCCTOOGGGC OCAGCCTCCC GAGCCTTCGG AGCGGGCGCC GTOCCAGCCC 60 

AGCTCGGGGG AAACGCGAGC CGCGATGCCT GGGGGGTGCT CCCGGGGCCC CGCCGCGGGG 120 

GACGGGCGTC TGCGGCTGGC GCGACTAGCG CTGGTACTCC TGGGCTGGGT CTCCTCGTCT 180 

TCTCCCAOCT CCTCHGCATC CTCCITCTCC TCCTCGGOGC COTTCCTGGC TTCCGCCGTG 240 

TCCGCCCAGC CCCCGCTGCC GGACCAGTGC CCCGCGCTGT GCGAGTGCTC CGAGGCAGCG 300 

CGCACAGTCA AGTGCGTTAA CCGCAAXCTG ACCGAGGTGC CCACGGACCT GCCCGCCTAC 360 

GTGCGGAACC TCTTCCTTAC CGGCAACCAG CTOGCCCTGC TCCCTGOGGG OGCCTTCGCC 420 

CGCCGGCCGC CGCTGGCGGA GCTGGCCGCG CTCAACCTCA GOGGCAGGCG CCTGGAGGAG 480 

GTGCGCGCGG GCGCCTTCGA GCATCTGCCC AGCCTGCGCC AGCTOGACCT CAGCCACAAC 540 

CCACTGGCCG ACCTCAGTCC CTTOGCXTTC TCGGGCAGCA ATQCCAGCGT CTCGGCCOCC 600 

AGTCCCCTTG TGGAACTGAT CCTGAACCAC ATCGTGCCCC CTGAAGATGA GCGGCAGAAC 660 

CGGAGCTTCG JVGGQCATGGT GGTGGCGGCC CTGCTGGCGG GCCGTGCACT GCAGGGGCTC 720 

CGOCGCTTGG AGCTGGCCAG CAACCACTTC CTXTACCTGC CGCGGGATGT GCTGGCCCAA 780 

CTGCCCAOCC TCAGGCACCT GGACTTAAGT AATAATTCGC TGGTGW3CCT GACCTACGTG 840 

TCCTTCOGCA ACCTGACACA TCTABAAAGC CTCCAOCTGG AGGACAATGC CCTCAAGGTC 900 

CTTCACAATG GCACCCTGGC TGAGTTGCAA GGTCTACCCC ACATTAGGGT TTTCCTGGAC 960 

AACAATCCCT GGGTCTQCGA CXGCCACATG GCAGACATGG TGAOCTGGCT CAAGGAAACA 1020 

GAGGTAGTQC AGGGCAAAGA GCGGCTCACC TGTGCATATC CGGAAAAAAT GAGGAATCGG 1080 

GTCCTCTTGG AACTCAACAG TGCTQACCTG GACTGTGACC OGATTCTTCC CCCATCCCTa 1140 

CAAACCTCTT ATGXCTTCCT GGGTATTQTT TTAGCCCTGA TAGGGGCTAT TTTCCTCCTG 1200 

GTTXTGTATT TGAACCQCAA GGGGATAAAA AAGTGGATGC ATAACATCAG AGATGCCTGC 1260 

AGGGATCACA TGGAAGGGTA TCATTACAGA tCATGAAATCA ATGOGGACCC CAGATTAACA 1320 

AACCTCAGTT CTAACTCGGA TGTCTGAGAA ATATTAGAQG ACAGACCAAG GACAACTCTG 1380 

CATGAGATGT AGACTTAAGC TTTATCCCTA CTAGGCTTGC TCCACTTTCA TCCTCCACTA 1440 

TAOATACAAC GGACTTTGAC TAAAAGCAGT GAAGGGGATT TGCTTCCTTG TTATGTAAAG 1500 

YTTCTOGGTG TGTTCTGTTA ATGTAAGACS ATGAACAGTT GTGTATAGTG TTTTACCCTC 1S60 

TTCTTTTTCT TGGAACTCCT GAACAGGTA7 GGAGGQATTT TTCAGGTTTC AGCATGAACA 1620 

TGQGCTTCTT GCTGTCTGTC TCTCTCTCAG TACAGTTCAA GGTGTAGCAA GTOTACCCAC 1680 

ACAGATAGCA TTCAACAAAA GCTGCCTCAA CTTTTTCGAG AAAAATACTT TATTCATAAA 1740 

TATCAGTTTT ATTCTCATGT ACCTAAGTTG TGOAGAAAAT AATTGCATCC TATAAACTGC 1800 

CTGGAGAGGT TAGCAOGCTC TTCAAAATAA CTCCATGGTG CACAGGAGCA CCTGCATCCA I860 

AGAGCATGCT TACATTTTAC TGTTCTGCAT ATTACAAAAA ATAACTTGCA ACTTCATAAC 1920 

TTCTTTGACA AAGTAAATTA CTTTTTTGAT TGCAGTTTAT ATQAAAATGT ACTGATTTTT 1980 

TTTTAATAAA CTGCATCGAG ATCGAACGGA CTGAATTGTT AAAAAAAAAA AAAAATAAAQ 2040 
ATTCTTAAAA GAA 

Seq ID NO± 40 Protein segrience 
Protein Acceeoion #: UP_qo6661 

1 11 21 31 41 51 

I ! I 1 I I 

MFGGCSRGPA AGDGRLRLAR LALVlxLGWVS S&SPTSSASS FSSSAPPXAS AV8AQPPLPD 60 

QCPAUCECSB AA&TVKCVNR NLTEVPTDLP AYVRHLFl,XG WQIAVLPAGA FARRPPLAEI* 120 

AAUfliSGSRI* DEVKAGAFKH LPSLRQLDLS HUJPIxADLiSPP AFSGSNASVS APSPIiVEbrL 180 

HHIVPPEDER QHRSPEOMW AALLAGRALQ GUIRLELASN HFIiYTiPRDVL AQLPSLRHU3 240 



963 



WO 03/042661 



PCT/US02/36810 



L SNMS L»VSl»T YVS FRNI»THIa ESfiFTiKDWAL KVLHNGTLAE LQGLPHIRVF LDNNPWVCDC 300 

HHADWVTWUK ETEWQGKDR LTCAYP E KMR NRVLLBLNSA DLDCDPILPP SLQTSYVFLG 360 
IVLAliIGAlF IiLVL,YLNRKG ZKKHMHNIRD ACRDHMEGYH YRYElNADPR LTNLS6NSDV 

5 Seq ID NO-. 41 DMA sequence 

Nucleic Acid Accession tts Eos sequence 
Coding sequence: 1..92? 

„ 1 11 21 31 41 51 

10 | I I I 1 I 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCOGGGGAOG GGOGTCTGOG GCTGGCGCGA GO 

CTAGCGCTOG TACTCCTGGG CTGGOTCICC TCGXCTTCTC CCACCTCCTC GGCATCCTCC 120 

TTCTCCTOCT GQGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 180 

CAGTGCCCCG CGCTGTGOGA GTGCTCCQAG GCAGCGCGCA CAQTCAW3TG CGTTAACCGC 240 

15 AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 

AAGCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 

AGCCTCAGGC ACCTGGACTT AAGTAATAAT TGGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 

CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 

ori AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 

2\J CCCTGGGTCT GDGACTGCCA CATGGCAOAC ATGGTQACCT GOCTCAAGGA AACAGAGGTA 600 

GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCOGGAAA AAATGAGGAA TCGGGTCCTC 660 

TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCOCCATC CCTGCAAACC 720 

TCTTATGTCT TCCTGGGTAT TGTTTTAGOC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 

TATTTGAACC GCAAGOGOAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 040 

25 CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGGATGTCCT CGAGTGA 



30 



35 



Seq ID WOi 42 Protein sequence 
Protein Accession #x Eos sequence 

1 11 21 31 41 SI 

1 I I I I I 

MPOGCSRGPA AGDGRLHLAR LALVLLGWVS S9SPTSSASS FSSSAPFDAS AVSAQFPIiFD 60 
QCPALCBCSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG KQLASNHFLY LPRDVLAQLP 120 
SIiRHI»DLSNH SLVSIjTYVSF SNLTHJjESLH lednauotlh nqtiaelgql FHIRVFLDNN 1B0 
FWVCDCHMAD HVTWLKETEV VQGKDRUTCA YPEKMRNRVL LBLNSADLDC DPILPPSLQT 240 
SYVFLGIVIiA ZiXGAlFliliVZ* YLNRKGIXKW MHNIRDACRD HMSGYHYSYE IKADPRIiTNL 300 
SSNSDV&B 

40 Seq ID WO: 43 DMA sequence 

Nucleic Acid Accession #1 XMJD58173 
Coding sequence; 68. .340 

1 11 21 31 41 SI 

45 1 1 1 I I I 

AGCGCCTTGC CXTCTCT TAG GCXTTGAAGC ATlTTrGXCT GTGCTCCCTG ATCTTCAGGT 60 

CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 120 

TGCCCAGAAT CCGACAACAG CTCCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 180 

TGATGAAGCC OCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA CTGCTGCTOC 240 

3U TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TrTTACCCAA 300 

A1GGGTTGGG GATCTCCCGA ATOOTAOAGT GTGTCCCTGA GATGGAATCA GCTTGAGTCT 360 

TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATC CAACTA CTTACCTTGC 420 

CTACGATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 
GCGAGCTAAC AT 



55 



70 
75 
80 



Seq ID NO: 44 Protein sequence 
Protein Accession #: NP 477521 



1 11 21 31 41 SI 

50 1 I ] J 1 1 

MKFLAVLVItlr GVSIPLVSAQ WPTTAAPADT YPATGPADDE APDAETTAAA TTATTAAPTT 60 
ATTAASTTAR KDIFVLPKWV GDLPNGRVCP 

- Seq ID NO: 45 UNA sequence 

65 Nucleic Acid Accession #s NM_0 00 0 95.1 
Coding sequence: 26. .2299 

1 11 21 31 41 51 

I I I i I 1 

CAGCACCCAG CTCCCCGCCA COGCCATGGT CCCOGACACC GCCTGCGTTC TTCTGCTCAC 60 

CCTGGCTGCC CTCGGOGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

GCAGATGCTT CQGGAACTGC AGGAAACCAA CGCGGGOCTG CAGGAGGTGC GGGACTGGCT 130 

GOGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGTCAGTAC GCACCGGOCT ACOCAGCGTG CGGCCCCTGC TCCACTQCGC 300 

GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 

CCCCTGCCCC GCGGGCTTCA CGGGCAACGG ClCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

CGCCCACCCC TGCTTCOCCC GAGTOCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 

GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGQGGCTGG CTTTOGCCAA 540 

GGCCAACAAG CAGGTTTGCA OGGACATCAA COAnTGTGAG ACCGGGCAAC ATAACTGCGT 600 

CCCCAACTTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCOGCCCGT GCCA0CCC33G 660 

CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GOGCaGCQCA CAGCGCTTCT GCCCCGACGG 720 

CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

GTGOGTGTGT CGCGTTGGCr GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 

AGACGGCTTC CCGGACGAGA AGCTGOGCTG CCCGGAGCCG CAGTGOOGTA AGGACAACTG 900 

964 



WO 03/042661 



CGTGACTGTG CCCAACTCAG GGCAGGAGQA TGTGGACCGC GATQGCATOG GAGACGCCTTG 960 

CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

GAAJCCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATBCGTGCG ACAACTGCGG 10SO 

6TCCCAGAA6 AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 12 00 

CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

GAAGAGCAAC CCGGATCAGG CGGATGTGGA CGACX2ACTTT GTGGGAGATG CXTGTGACAG 1320 

CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 

TAACAGTGCC CAGGAGGACT CAGAOCA0GA TQGCCAGGGT GATGCCTGCG AGGACGACGA 1440 

CGACAATGAC GGAGTOCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCGGGCCA IS 00 

GGAGGACGOG GACAGGOACG GCGTGQGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 

GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

GGCCTTCCAG ACAGTCGTQC TGGACCOGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGGGACCCAG GCCTGGCTGT 1*740 

GGGTTACACT GCCTTCAATG GOGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

OGATGACGAC TATGCGGGCT TCATCTTTGG CTAOCAGGAC AGCTCCAGCT TCTACGTGGT I860 

CATGTGGAAG CAGATGOAGC AAACGTATTG GCAGGCGAAC CDCTTCCGTG CTGTGGCCGA 192 0 

GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 

CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAQGTGCGG CTGCTGTGGA AGGACCCGCG 20 40 

AAACOTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

GGGCTACATC AGGGTQGGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 

CTTGQACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCSCC AGGAGAACAT 2220 

CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 22S0 

TCAGCTGOGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 

GGGGCTGQAT BGGGGCTCTG CACCCAGCCC AAGGGGTGQC CGTCCTGAGG GGGAAGTOAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG XGTGCAGGG 

Sec| ID NO: 46 Protein sequence 
Protein Accession #: NP_000OBfi.l 

1 11 21 31 41 51 

I 1 I I 1 I 

MVPDTACVLli LILAALGASG QCQSPLGSDL GPQMLRKLQE TNAALQDVRD Wl^RQ^VBEIT 60 

FLKSITVMHCD ACGMQQSVRT GLPBVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 

KGSHCIDVNE OflAHPCFPRV RCXHTSFGFR CEACPPGYSG PTHQGVG1AP AKAHKQVCTD 180 

INECETGQHN CVFKSVCINT RG6FQCGFCQ PGFVGDQASG CQRGAQHFCP EGSPSECHEK 240 

ADCVIiKRDGS RSCVCRVGHA GKGILCTRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 

EDVDRDG1GD ACDPDADGDG VPNHKDNCPL VRNPDQSNTD EDXWGEACDN CHSQKNDDQK 360 

D1DQDGRGOA CDDDIDGDRI RNQADNCPRV PNSCQKDSDG DGIGDACDHC PQKSKPDQAD 420 

VDHDFVGDAC DSDQDQDGDG HQD8HDNCPT VPNSAQEDSB HDGQGDA03D DDDNDGVPDS 480 

KDKCRLVPnP GQEDADRDGV GDVCQUDFDA DKWDKIDVC PENAEVTI/TD FRAFQTWLD 540 

PEGDAQIDPN VfWLNQQRSI VQTMMSDPGL AVGYTAFNGV DFEGTFHVUT VTDDDYAGFI 600 

FOYQDSSSFY VVMWKQMEQT YWQANPFRAV AEPGIQLKAV KBSTGPGEQI* RNAliWHTGDT 660 

ESQVRUliWKD PRNVGWKDKK GVftWPLQHRP QVGTIKVRFY EBPELVADBN WLDTTMRGG 720 
RLGVFCFSQE HIIWANLRYR CNDTIPEDYE THQIjROA 

Seq ZD NO i 47 DMA sequence 

Nucleic Acid Accession #? HMJJ01S65.1 

Coding sequence: 67-. 3 63 

1 11 21 31 41 5a 

111111 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTOCAGTCTC 60 

AGCACCATGA ATCAAACTGC GATTCTGATT TGCTC3CCTTA TCTTTCTGAC TCTAAGTGGC 120 

ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 180 

CCTOTTAATC CAAGGTCTTT AGAAAAACTT GAAATTAXTC CTGCAAGCCA ATTTTGTCCA 240 

CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA .300 

TOGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTASCAAOQ AAATGTCTAA AAGATCTCCT 360 

TAAAACCAGA GGGGAGCAAA ATGGATGCAG TGCTTOCAAG GATGC3ACCAC ACAGAGGGTG 420 

CCTCTCCCAT CACTTGCCTA CATGGAGTAT ATOTCAAGGC ATAATTGTTC TTAGTTTQCA 480 

GTTACACXAA AAGGTOACCA ATGATGGTCA CCAAATCAGC TGCTACTACI CCTGTAGGAA 540 

GGTTAATGTI CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 

GCTCTACTQA GGTGCTATGT TCTTAGTGGA TGTTCTGADC CXGCTTCAAA TATTTCCCTC 660 

ACCimCCCA TCTTCCAAGG GTACEAAGGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 

TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATC1G CTTTTTAAAG AATGCTCTTT 780 

ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTAGCTA B40 

CATACAATTC CAAACACATA CAGGAAGGTA GAAATATClG aaaatgtatg tgtaagtatt 900 

CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 

TTTCAGTGTA CATGGAATAA CATGTAAITA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 

TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 10 BO 
TTTTCAAATA AAAATQAGGT ACTCTCCTGG AAATATTAAG 

Seq ID NO: 48 Protein sequence 
Protein Accession #s 2^^001556.1 

1 11 21 31 41 51 

I 1 1 I I I 

KNQTAIlilCC LIFIiTLSGtQ <5VPX,BRTVRC TCISISNQPV HPRSUEKLEI IPASQFCPKV 60 
EIIATHKKKG KKRCLWPESK. AlKNIiLKAVS KBMBKRSP 

5eq ID HO: 49 DNA aequence 

Hucleic Acid Accession ft: XM_057014 

Coding sequence: 143.. 874 
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a 
I 

GGOAGGGAGA 
CGCGGCGGAG 
CGCTGCCCGG 
CCGCGGOCTC 
CCCCAAGGGG 
AATGTGCTTA 
CATTCCGGGT 
TCTGAGGOAA 
ATTGAATTAT 
AAATAQTGCT 
CTGTCAGCGT 
AGCTATAATT 
CACTTCTTCT 

TTCTOSCATC 
TT1TTTTATT 
CATCTGAATG 
TTTAAATCTA 
TGGTTAGAAT 



TGTACAATTT 
CAACCTTAAA 



11 
I 

GAGGCGCGCG 
CCAGACGCTG 
CAGCCGGGAG 
CTGCTGCTCC 
AAGCAAAAGG 

ACACCTGGGA 
AGCTTTGAGG 
GGCATAGATC 
CTAAGAGTTT 
TGGTATTTCA 
TATTTGGAOC 
GTGGAAGGAC 
ACT TGT TCAG 
ATTATTGAAG 
ATGCCTTGOA 
AAAAGCAAAG 
GCATTATTCA 
ACTTTCTTCA 
TTTTTCTCTT 
GTAAATGTTA 
AAAAAAAAAA 



21 
I 

GGTGAAAGGC 
ACCACGTTCC 
CCATGCGACC 
TGCTGCTGCA 
CGCAGCTCOG 
CAGGAGTGCC 
TCCCAGQTP3 
AGTCCTGGAC 
TTGGGAAAAT 
TGTTCAGTGG 
CATTCAATGG 
AAGGAAGCCC 
TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTATAGCAT 
AGAATTTTTT 
AAAA 



31 
I 

GCATTGATGC 

CCAGGGCCCC 
GCTGCCDGCG 
GCAGAGGGAG 
TGGTCGAGAC 
GGATGGATTC 
ACCCAACTAC 
TGCGGJU5T3T 
CTCACTTOGG 
AG CTTQAATQ T 
TGAAATGAAT 
AATTGGTGCT 
AGGAGATGCT 
ATAAATGCTT 
TAAATGACAT 
TTACAQACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



41 
I 

AGCCTGCGGC 

GCOGCCTCCC 
CCOTCGAGCG 
GTGGTGGACC 
GGGAGCCCTG 
AAAGGAGAAA 
AAGCAGTGTT 
ACATTTACAA 
CTAAAATGCA 
TCAGGACCTC 
TCAACAATTA 
GGATTAGTGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAQ 
AAGTGTGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAGC 
TAAATAAAAA 



Bl 
1 

GGCCTOGGAG 
TCCAGCTCCQ 
CGCAGOGGCT 
C3CTCTGAGAT 
TGTATAATGG 
GGGCCAATGG 
AGGGGGAATG 
CATGGAGTTC 
AGATGCGTTC 
GAAATGCATG 
TTCOCATTGA 
ATATTCATCG 
ATGTTGCTAT 
GGAATTCAGT 
TTGCTACCTC 
TTTATGTAIA 
TCACACTGTT 
TTTTTTTAGT 
ATATTGTTGT 
TACCAATCTT 
TTATTTCCAA 



60 
120 

iao 

240 
300 
360 
420 
4.80 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1D80 
1140 
1200 
1260 



Seq ID NO: SO Protein sequence 
Protein Accession fli XP_D57014 



1 11 21 31 41 51 

) 1 I I 1 1 

MRPQGPAA3P QPJbRQ^T ,T,1 ,T, KLQLPAPSSA SEIPKGKQKA QT.RQREWDL YKGMCLQGPA 60 

GVPGRDGSPG ANGIPGTPGI PGHDGFKGEK SWTFHYKQCS WSfiLNYGIDL 120. 

GKIASCTFTK MRSNSALRVL FfiGSLRIiKCR NACCQRWYFT EHGAECSGPL PIEAIIYLDQ 160 

GSPEMNBTIN IKRT8SVEGL CEGIGAGLVD VAIHVGTCfiD YPKGDA6TGW N8VSRIIIEE 240 
TiPK 

Seq id no i 51 DNA sequence 

Nucleic Acid Accession #2 nm_020974 

Coding sequence: 81.. 3 080 

1 11 21 31 41 51 

I I I I I I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG COGCCACCGC CGGCACTCCG COGCCTCTGC 60 

OCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGSCCHCAA CCGTCCCGGG GCGGCCTGGG 120 

CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCOGTCCCGC 180 

CGGGTCGGGG CCGTGCOGCG GGGCCGCAGG AGGA7GTAGA TGAQTOTGCC CAAGGGCTAG 240 

ATGACTGCCA TGCCGAOGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 

AGCCTGGCTA CCAAGGGGAA GGCA GGCAG T GTGAGQACAT CGATGAATGT GGAAATGAGC 360 

TCAATGGAGG CTGTGTCCAT QACTGTITGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 

TTGATGGCTT GATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 

AGAACAATGG OGGCTGCCAG CATAOCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCIGCT 540 

GCAAGGAGGG GTTTTTCCTG AGTGACAATG AGCACACCTG CATTCACCGC TCGGAAGAGG 600 

GCCTGAGCTG CAlGAATAAG GATCACGGCT GTAGTCACAT CTGCAAQGAQ GCCCCAAGGG 660 

GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 

TCTIGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGAjCQAT ACAGCCGATG 780 

GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCAGAC AGATGGGAGG AGCTGCCTTG 840 

AGGGAOAGGA. CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 

ATAAAOSGGT GAAACGGCGG CTGCTCATGG AAACGlGTGC TGTCAACAAT GGAGGCTGTG 960 

AOCGCACCTG TAAGGATACT TCGACAGGTG TCCACTBCAG TTGICCTGTT GGATTCACTC 1020 

TCCAGTTGSA TGGOAAOACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGQAGGTT 1080 

GTGATCATTT CTGCAAAAAC ATCGTBGGGA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1140 

AATTATTAAC AGATOAGAAG TCTTQCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 

GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 

OOCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 

GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 

AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAQGG GCTCCTQCCC ACRAGTGTGT 1440 

CACCCCGTGT GTCCCTGCAC TGCQGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 

GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 

AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GT7JTCCCGAG GGTCTGCGAC 1620 

CAGCACXACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCXAGGTA AACCTTACA.T 1680 

GCAGCTCTGG CAAGCAAGTC CCAGGAGCGC CTGGCOGACC AAGCACCGCT AAGGAAATGT 1740 

TTATCACTGT TGAGTTTGAG CTXGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1000 

TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA I860 

AGGOCGTCGA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GAOGTGGCTA 1920 

AAAAGCCTCC CAGAACATCT GAACGCCAGG CAOAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 

CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAOAACGCT 2040 

GCATTTTATG TCCAAATGGA AOCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 

GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCOCAGA AGCTTGGAAT ATGTCTGAAT 2160 

GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACGT TGCCAGCTCT 2220 

GTGGCCTGGG CACGTTCCAG CClGAAGCTG GTGGAACTTC CTGCTTCCCC TGTGGAGGAG 2260 

GCCTTGCCRC CAAACATCAG GGAGCTACXT CCTTTCAGGA GTGTGAAACC AGAOTTCAAT 2340 

GTTCACCTGG ACATTTCTAG AACACCACCA CTCACCGATG TATTCGTTOC CCAGTGGGAA 24O0 

CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG OCCAGGAAAT ACTACQACTG 2460 

ACTTTGATOG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 



966 
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GAGATTTCAC 
AGTGTACGTG 



CCAATTCTGT 
CCAGGTCAAA 
TCCAGGTCCC 
GAGATGG ORG 
TCAAGGCTCT 
AGTCCCGAGA 
TTTTGAGACC 
GGITGGTGGG 
CCGTATCAGT 
GAACTTGGTT 
CAGCTTCTCA 
TTGGTCAGCC 
TGTAGTGGAA 
CCGGCCCTCT 
CAAGAGGGGA 
ACTCAGTTTC 
AGTTCTAAGC 
AGCACTTCTG 



TGGGTACATT 
GACCATCAAC 
CATAGAGOAC 
GACAACATAT 
GAAGCTGTGQ 
ATACGTGACA 
GCTCTATGCA 
GTTTGATGTC 
GATGTTTCCA 
TTACAAATGA 
ACAGAGCTGT 
GACTCATTAG 



CTGCTGTGGG 
TAGGTGAGAC 
AGGAGGOCAC 
CTAAGC3GAGC 
GGGAAGGAGA 
TCCACAQCCT 
AGTGCTCGTG 
GAGACAT 



GAATCCCCAA 
CCACCCCCCA 
GACTGTGQGG 
GAAACCTGCC 
ATTCAGTTCA 
TATGATGAGG 
TCTGAGAACC 
CTGGCCCATC 
AGATCGTTCA 
CTCAGCCCAC 
CTTCCTTTCTG 
AGTTCAATTT 
AGCATOGTGG 
CGGATGTCTT 
TCACCTGTCC 
AGAATAAGCT 
CCTCTGCAOT 
CCCCTGCAGG 
TCTCCAGCCT 
AAAAAAAAAA 



ACTACCCAGG 
AGCGCCGCAT 
ACTATCTGGT 
AGACCTACGA 
AGTCCAATGA 
ACTACCAC3GA 
ATCAGGAAAT 
CCCAGAACTA 
TCCGATTGCT 
GTGCCACTCA 
CATGTCAGCA 
TTATAGATAA 
ATGTAGACTG 
GGATAGATCA 
TTCTGGGGTC 
GCTTATTCTG 

cgtgtgcagg 
ctccctccac 

GTGTGATACA 
GCAGAAAGAA 



CAATTACCCA 
CCIGATOGTG 
GATGOGGAAA 
ACGCCCCATC 
AGGGAACAGC 
ACTCATTGAA 
ACTTAAGOAT 
TTTCAAGTAC 
ACGTTCCAAA 
ATACAAATGT 
CAGTCGGGTA 
TACAGATATT 
AQAATGGCTT 
OGGGCTGGCT 
TTACTCCTCC 
AAACTTCAGC 
CTCTGACCAG 
CCACCTTGAG 
AGTTTGATCC 
TTAGAAATAA 



GCCAACACCG 
GTCCCTGAGA 
ACCTCTTCAT 
GCCTTCACCT 
GCTAGAGGGr 
GACATAGTTC 
AAGAAACTTA 
ACAGCCCAGG 
GTGTCCAGGT 
TCTGCTATAG 
TTGCTGCCTC 
TTGGTAAATT 
TGAGTGGCAT 
GAGCTGGACT 
TCAAGGAGTC 
TTCCTCTAGC 
GCAGAACAGG 
ACCTGGGAGG 
CAGGAACTTG 
ATAAAAACTA 



Seq ID NO i 52 Protein sequence 
Protein Accession NP_066025 



1 
1 

MGVAGRtfRPG 
LCQWTPTSYK 
KDGHNClxDVD 
DHGCSHICKE 
PQYKMHTDGR 
3TGVHCSCPV 
SCQDVDECSLr 
TVQSYECQGH 
SSDVTTIRTS 
FGAPGRPSTP 
FHLQL8GMNL 
TFQNEEGQMT CKPCPRPGUfi GALKTPEAWN 
PEAGRT8CFP CGGGLATKHQ GATSFQDCET 
KHNCVSCPGH TTTDFDGSTM ITQCKMRRCG 
PBPKRRJLIV VPEIFLPIED DCGDYLVMRK 
IQPKSNBGMS ARQFQVPYVT YDEDYQHKEE 
IiAHPQNYFKY TAQESREMFP RSFIRLLRSK 

Seq ID NO* S3 DKA sequence 
Nucleic Acid Accession tti NM_01<C 
Coding sequence: 1S7,.1479 



11 

1 

AAWAVIiLttLL 
CSCKPGYQGE 
ECliE5HIGGCQ 
APRGSVACEC 
SCLEREDTVL 
GFTIiOLDGKT 
DRTCDHSCIN 
PGYKIiHWNKK 
VTFKLNBGKiC 
KEWFITVEFB 
DVAKKPPRTS 



21 
I 

IlLPPLLLIiAG 
GRQCEDIDEC 
HTCVNVMG9Y 
RFGFBLAKNQ 
EVTESNTT3V 
CKDIDEOQTH 
HPGTFACAOJ 
DCVEVKGLLP 
SIiKNAKIiFPE 
liETNQKSVTA 



31 
I 



GNELNGGCVH 
ECGCKBGFFI* 
RDCILTCNHG 
VJX3DKRVKER 
NGGCDKFCKN 
RGYTLYGFTH 
TSVSPRVSDH 
QLRPALPEKE 
SCDLSCIVKR 
QGHAEKQCVS 
M8ECGGLOQP 
RVQCSFGHFY 
GELGDFTGYI 
tssshsvtty 
DIVRDGRIjYA 
VSRFXHPYK 



41 

! 

QPQEDVDECA 
DCLNIPGNYR 
SDHQEXC2HR 
NGGCQHSCDD 
LU4ETCAVNN 
IVGSFDCGCK 
CGDTNKCS1N 
CGKSGGGDGC 
SSVKESFRYV 
TEKRLRKAJR 
CRAGTYYDGA 
GEYSADGFAP 
NTTTHRCIRC 
ESPNYPGNYP 
ETCQTYERPI 
SFJJHQE ILKD 



1 
1 

GGGACAGGGC 
GTCCTGOCTG 
CCTACTTCAG 
TTCGTGTGTC 
GJVJ03TCGGCA 
AACAAATXrC 
ATTGCAAGTA 
CGACAGCGCT 
GCCCGCCTCG 
TTOCTCCATG 
CTGTATGCCC 
ATGGACACAC 
GAGTXCAOCT 
CAGTACACCA 
TACACTAGAT 
ACCTACGTTC 
GATTCAGTCC 
ATGATCGGGT 
TACCTGGGGA 
TACAGTTCCT 
GTCAGTATTA 
GCCAGCATTG 
GACAAGTTCA 
CAAAACCOCA 
GCCAATGTAT 
TGCCATAGGT 
ACCCACATCC 
ATGAAGCTCC 
TAGAAACATC 
TTACRAATGT 
AAGAATGGGA 
TATACATGTT 
GAGTAACATT 



11 
I 

TGAGGATGAG 
TAGGCCTGAA 
CCCCTTGGTG 
TGAGTCTCTT 
GAAGTGACAA 
TCAGGOCCAA 
TCTCTAGCAT 
GGATGGACCA 



21 
1 

GAGAACCCTQ 
GGACTTGCCC 
TGAGCAGCTT 

cactgagagg 
gctttccctg 
ttttggtgga 



31 



GOGAC5CCAGA 



AAGTCACTGT 
TCAGAATCAC 



GGCTGAGAGG 
TAGAQCGGTA 
TGGTCTrACA 
CTTCCACTTT 



CCCGCACTTC 
TCTGCTTTAG 
TACAGCAGAT 
CTAATATCAl 
AAATTTOCAG 
AGTTTGTCTT 
GTAATGTTGA 
TTTACTGGGC 
CTTCAACAGG 
AATGGTQCTA 
AACCATTGTT 
AGrCCATTCC 
ACTCAGGGCT 
AGGAGACCAT 
TTTTACTAAA 
TTCTAGTTlX 



GCGGCTGGTG 
CTGGGTGCCA 
GGGAAACAOG 
GACAACTGTT 
GTTGCAGCTG 
GAACGACTCT 
TTTGACCTTA 
GTTTGAGCTT 



CTGCATTGGA 
TCTTO CCAAC 
CTTTGXGTTT 
GGCAGCCAAA 
CAACAGCTCC 



CTCAACATGA 
ATGTGCATCC 
CCTGGCTTTG 
GAACCOGTAC 
AACATGGACT 
TTTGAAGGCA 
GATAC TTACA 
CTCATCCGCC 
G CATGTAA CA 
GAAAGCTGGG 
GTGGGTGGAC 
GTCAGCAGAtT 
CGGAGGAATG 
TEGTCCTGGG 



41 

! 

AGACCGTGCC 
TCAACAACEA 
ACTACAGCCT 
AGGGGAGTCA 
AOAACCTCAC 
AGATAGCGCT 
ACACAGCCAC 
ACAAGAGCTT 
TTGTGGAGTC 
TCTTCTCCRA 
TGGATCTGTC 
GCIATGATGG 



CCGAGAAAAG 
TCACTATTCC 
ATACTACATG 
ACAAGATAAT 
CAAGTGACTG 
CTAAGCTGTG 
TCTTTCATCT 
GTTTATTCGG 
TGGGTAACCC 
TCTCTGCAGT 
TGTTTCTGGT 



ACCAACTGCT 
GGGGCCTTGC 
GATAGGGGQA 
ATCTCCAGCT 
GACTAC3VSTG 
ATGGGCAGGA 
AAACTACTGT 
TATTTTXGAG 
QATGTAAATG 
AAATAATATT 
TAGAAGTCCT 
TAATCAAGGA 
TGaCTCCCTG 
TCAAGTGTCA 
GCTTATAAAA 
TAAAATGAAA 



OGCAGCAGGA 
TTCTGTATTT 
TTTCATTTTG 
TGTTATCAAT 
TCATCAAGGC 
TftGAATATGC 
CAACAAAGQA 
TTAAACGGAA 
ACTTGACAAT 
TTGTTGATTA 
TTOCTTTGAT 
TCAATGTTAA 
GTATTTTAGG 
TGAGTCTTTC 
AG CAT TAT AG 
CATTOCCATG 
GTTTGCATTT 
GAAGTTGTTT 
TACATTGTTG 
TATGGGCTTA 



2560 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
34B0 
3540 
3600 
3660 
3720 



51 
I 

CGLDDCHADA 
CTCFDGFMLA 
SBBGLSCMttK 
TADGPECSCH 
GGCDRTCKDT 
KGFFXJbTDEK 
NGGCQQVCVU 
FbRCHSGIHL 
NLTCfiSGKQV 
TIiRKAVHREQ 
RERCILCPUG 
CQtiCAlGTFQ 
PVGTYQPEFG 
ANTECTWTIN 



KKLIKALFDV 



51 
I 

TTGCGGGGAA 
CCTGGTGATT 
CCACTTGGCC 
GTTCAACGTC 
AGCAGGATAT 
GACTCTGGAC 
CATATACCTC 
CACTCTGOAT 
CAAGAAGTCC 
TGGCACGGTC 
TAAATACC5CC 
AAATGATGTG 



GACAGGAAAT 
CATTTTGGAA 
GATCTCTCTC 
GACCACACTG 

catcgatgtg 

AGTTGCTCAC 
AGTAGAAGAA 
GATCAGCTTT 
GAAAACCAGC 
TTTCACAATT 

ttttatgcta 

ATTTCTTGCA 
CCAAGTGTGC 
TGCTCAAAGA 
QRTCTTGTAA 
GAGCCCAAGA 
ACCTGATATA 
CTAAAGTAAC 
CCTATTTAGG 
TGTCZAATTCA 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7S0 
840 
900 
960 



60 
120 
100 
240 
300 
360 
420 
4BO 
540 
600 
660 
720 
780 
840 
BOO 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16 BO 
1740 
1BO0 
I860 
1920 
1980 
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TTGGAAGTCA 
ATACCACAAC 
AGTGAAOAAA 
GGATTCCCCA 
TTTATTATTA 
GGAAATGGAA 
AGGCXTGCAG 
AAAGTACTGG 
GAGCAACACT 
TGCTCTAATG 
GGCTTAGCTT 
TGAGTGGCTT 
AACAATGAGA 
CCAGCCAGAG 
AGTGTGCCAG 
TTAATTTCTT 
ACCAAGGCTC 
CTTTTCCCAT 
ACAAAATATT 
TATGAGOCAA 
AACCCCACTT 
AATATACCAT 



ATGCACTAAC 
AG AAT TAT CC 
AAATTAGTAG 
TACTGGAAGG 
TACACACATC 
GATTTTTTTG 
AATXGAGTCC 

GTTGACT cag 
CTCCCAGTGG 
ATCAGGAATG 
AAOTAAACTT 
ATGCCGCATG 
CACGTTACAG 
ATGCACATTC 
GGTAAAGGCT 
AGCXTCCTGT 
TAAAAGATGA 
TATACTTCTC 
TCTAACGGGA 
TCATATTTGT 
AAGCATTQTT 
ATTAGCTACC 



TCAATACCAA 
CCAA1TTCCA 
ATCAACAATC 
ACTCTGAGGC 
CATCCTAAAC 
TAACTTGTTC 
ATTTTCTAGC 
AGAGTCGCTG 
CAGATCCCCT 
ATGCTTATTA 
GGCTTTGCTC 
AGCAGGAGCG 
AACCTATGTT 
CTCGGCCAGT 
TCCAG7ITCAG 
TCTAATAAAT 
TTTCCCTTCT 
ACAATTCAGT 
ATGGGTGGGA 
GATTTTTTAA 
TXTATATAAA 
CACG 



GATGAGTTTT 
ATAAGTCCTA 
TAAACAAATC 
TTTATTCCCC 
TATACTAAAG 
TAGAAGTCTT 
TGCCTTTATT 
TCATTCIGTC 
GTATCATTCC 
GAAAACAAAC 
AGATCCCTGA 
TGCTGGCXXZT 
CAGGTTGCGG 
CTCAGCCAAC 
CCrCAGTTAT 
GCACGGCTTT 
GTAACTCCCT 
TTCTATGAGT 



TAAATAATGA 
TCATTQAAAA 
CCTCGGTTCT 
CACTATGCAT 
CCCXTTTCCC 
AATATGGGCT 
CACATAGTGA 
ATTGCTGCTA 
AAGAGGAGCA 
TGCXTGACCC 
TCCTTCCAGC 
GAGTACTGAA 
GTGAGCTGCC 
AGTAOCAAAA 
TTTAGACAAT 
ACCTTTCCIG 
AGAGCCACAG 
T.TQATCACCT 
AAAGAGATGA 



AAAAAGTTTA 
AACAAXGATA 



AAGATGTGAA 



ATATTATTTA 
TTCAAATATA 
AAGATACAAT 
ATCTTATCAT 
ATGCATGGAT 
GTTGCCAT6A 
TGGGGTACTA 
CTCTAACACT 
TTCATCCCTT 
AGGAACAAGT 
TGGTCTGCTC 
CTTTCTGAGT 
CTCTCCAAAT 
GTGATTTTTG 
CTCGCCATCT 
TCAGAAATAA 
GTTCTCATTC 
GATTTTTTTA 
AATGTGGTTG 
TCTGTTCTGA 
CTGTGAAATA 



Seq IS HO: 54 Protein sequence 
Pxoteiri Acc«esion #s NP 05S026 



X 

I 

MHYSIHLAFV 
VQIAX/TfcDIA 
YiVEEiOCSFL 
WGYDGNDVF.F 
NVliYFIIiSTY 
CFIKAIDVYL 
SFKHKISFAS 
I.FPL3FMLAH 



11 
I 

CLSLFTERMC 
9ISSISESNM 
HBVTVGtmfcl 
TWLrRGND3VR 
VFSTFLWLS 
GICF3FVFOA 
IEISSDNVDY 
VFYWAYYMYF 



21 31 

I \ 
IQGSQFNVEV GRSDKLSIiPG 
DYTATIYLRQ HWMDQRLVFE 
RkPSNGTVIiY AIiRITTTVAC 
GLEHLULAQY T1BRYFTLVT 
WSFWr9LD8 VPARTCIGVT 
IiLEYAVAHYS 6LQQMAAKDR 
SDL.TMKT9DK FKFVFREKMG 



41 
1 

FEHKTAGYNK 
GaJKSFTLDAR 
HMDLSKYPHD 
RSQQETGNYT 



51 

I 



GTTKEVEEVS 
RIVDYPTIQN 



Seq ID NO: 55 UNA sequence 

Nucleic Acid Accession #s XM_DB40O7 

Coding g equence : 13B . . 24 0 5 



1 11 21 

I 1 t 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG 

CCAOTGOaCC CGTGTGGAAC caaacctgcg 
GCGGAGACGA AGGCGCAATG GCGAGOAAGT 
TCTCTGTCAC AAATCCCCTT CATGAACTAA 
AAATTAGTCC GAATTGGGAA TCTGGCATTA 
ATCATCTACA ACAGGTTTTC TACCGCTATG 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG 
ACCAGGACCA TCACTCAGAC CACGAGCATC 
ABCATGACTC AGACCACGAG CATCACTCTG 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC 
GT AAAGATCC TAGAAACAGC CAGGGGAAAG 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA 
TCTCTGAAGG AACTCACTTT CTAGAGACAA 
CCAAAGATGT AAGCAGCTCC ACTCCACCCA 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA 
GAAACACAAA TGAAAATCCT CAGGAGTGTT 
GCATGGGCAT CCAGGTTCCG CTGAATGCAA 
TCAACCAAAT TGATGCTAGA TCTTGTCTGA 
CTCCAAAGAC CTATTCATTA CAAATAGCCT 
TCAOTTTCCT GTCTCTGCTG GGGGTTATCT 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG 
TACACCTTCT TCCACAXTCT CATGCAAGTC 
CAATGGAAAT GAAAAOAGQA CCACTTTTCA 
GTGCCTATTT TGATTCCACG TGGAAGGGTC 
TTCTTOTTQA ACATGTCCTC ACATTGATCA 
AGAAGAAACC TGAAAATGAT GATGATGTGG 
CTCAACTTTC AACAAATGAG GAGAAAGTAG 
GAG CAG ACT C ACAAGAGOGC TCCCACTTTG 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC 
GGTGCAAGAA TAAATGCCAT TCACATTTCC 
TTCACCACCA TCATGACTAC CAT CATATTC 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG 
TGacCTGGAT GGTQATAATG GGTGATGGCC 
GTGCTGCTTT TACTGAAGGC TTATCAAGTG 
ATGAGTTGCC TCATGAATTA GGTGACTTTG 
AGCAGQCTGT C CT TT AT AAT GCATTGTCAG 
GAATTTTCAT TGGTCATTAT GCTGAAAATG 
GCTTATTCAT GTATGTTGCT CTGGTTGATA 
GTGACCATGG ATGTAGCCGC TGGGGGTATT 
GTTTTGGAAT TATGTTACTT ATTTCCATAT 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
27fi0 
2820 
28S0 
2940 
3000 
3060 
3120 
3180 
3240 



L VEFIiHVPD T 
TOTCKLQIiES 
RLVLQFBLRR 
GSRTSLPKTW 

itwiihssis 

PSNVDHYSKL 



60 
120 
180 
240 
300 
360 
420 



31 41 51 

I I I 

TTCGCGCCTG GTAGAGATTT CTOGAAGACA 
CGCGTGGGCG CGCCGTGGGA CAACGAGGCC 
TATCTGTAAT CTTGATGCTG ACCTTTGOCC 
AAGCAGCTGC TTTCCOCCAG ACCACTGAQA 
ATGTTGACTT GGCAATTTCC ACAOGGCAAT 
GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 
ATAAGATTAA AAGAATCCAT ATACACCATG 
ACTCAGACCA TGAGCBTCAC TCAGACCATG 
ACCATGATCA TCACTOCCAC CATAATCATG 
TTTGOCCAGA CCATGACTCA GATAGTTCAG 
GAGCTCACCG ACCAGAACAT GCCAGTGGTA 
GTGAAGTGAC CTGAACTGTG TACAACACTG 

tagagactcc aagacctgga aaactcttcc 
gtgtcacatc aaagagccgg gtgagccggc 

GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 
TCAAXGCATC AAAGCTACTG ACATCTCATG 
CAGAOTTCAA CTATCTCTGT CCAGCCATCA 
TTCATACAAG TGAAAAGAAG GCTGAAATCC 
GGC3XTGGTGG TTTTATAGCC ATTTCCATCA 
TAGTGCCTCT CATGAATCGG GTGTTTTTCA 
CCGTTGGGAC TTTGAGTGG1 GATGCTTTTT 
ACCACCATAG TCATAGCCAT GAAGAACCAG 
GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 
TAACAGCTCT AGGAGGCCXG TATTTCATGT 
AACAATTTAA AGATAAGAAG AAAAAGAATC 
AGATTAAGAA GCAGTTGTCC AAGTATGAAT 
ATACAGATGA TCGAACTGAA GGCTAlTTAC 
ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 
AGGAAGTCTA CAATGAATAT GTACCCAGAG 
ACGATACACT CGGCCAGTCA GACGATCTCA 
TCCATCATCA CCACCACCAA AACCACCATC 
AGGAGCTGAA AGATGCOGGC GTCGCCACTT 
TGCAGAATTT CAGCGATGGC CTAGCAATTG 
GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 
CTGTTCTACT AAAGGCTGQC ATGACCGTTA 
CCATGCTGGC GTATCTTGGA ATGGCAACAG 
TTTCTATGTG GATATTTGCA CTTACTGCTG 
TGGTACCTGA AATGCTGCAC AATGATGCTA 
TCTTTTTACA GAATGCTGGG ATGCTTTTGG 
TTGAACATAA AATCGTGTTT CGTATAAATT 
TAAAAAGTTG TCATAGTTTC AGTAGOTCAT 

968 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
9O0 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
150 0 
1560 
1620 
X680 
1740 
1600 
I860 
1920 
1980 
2040 
2100 
2X60 
2220 
2260 
2340 
2400 
2460 



WO 03/042661 



AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA OTTA3TGGGT TTTGTGATTT 252 0 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTAOGTTTTA AT AT TTAAGT 2 580 

TATTCTATCT TQGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 

TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 

TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2 7 GO 

TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2B2 0 

AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 

AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 

CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA CTAATTAGTG TACRTTTAAC TTTGTATAAT ACAGAAATCT 3060 

AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 

TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 

TATTQCCAAG TXATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCXGGTTTA 3240 

CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 3300 

TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 

GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
OATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 

9eq ID NO: 56 Protein sequence 
Protein Accession, ft; XP^_084D07 

1 11 21 31 41 51 

I 1 I I 1 I 

MARKLSVIIiI I/TFALgVTNP LHEUCAAAFP CTTEKIBPaW ESGINVDLAI JBTRQYHLQQL 60 

FYRYGENDI8& SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHEH. HSDHEBHSDH 120 

EHHSDHDHHS HHNHAASGXN KHKftLCPDHD SDSSGKDPRCJ BQGKGAHRPE FASGHEWKD 280 

SVSASBVT6T VYNTVSEGTH FE.ETIKTPHP GB2.FPKDVSS STPPSVTSKS HVSRLAGRKT 240 

NESVSEPHKG FMYSRNTNEN PQECPNASKL LTSHGMGIQV P1MATEPNYL CPAIINQIDA 300 

RSCIrlHTSKK KAEIPPKTY3 LQIAWVGGF2 AISIISFI^L LGVILVPLMN RVFFKFLL9F 360 

LVAIJWGTLS GDAFLHLLPH 8HASHHHSHS KEEPAMEMKR GPI*P 9HL SSQ UIEE9AYFDS 420 

TWKGr,TALGG IiYFMFLVEHV LTLIKQFKDK KKKNQKKJPElJ DDDVEIXKQL SKYESQLSTX 480 

EEKVDTDDRT EGYI.RADSQB PSHFDfiQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKHKC 540 

HSHFHDTLGQ SDDIilHHHHD YHHIIiHHHHH QNHHPKSHSQ RYSHEELKDA GVATLAWMVI 600 

MGDGLHHFSD G2AIGAAFTE GtSSGLSTBV AVFCHKLPHE 1VGDFAVKLKA GMTVKQAVLY 660 

NALSAMLAYL GMATGIFIGH YAENVSMWIF AIiTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RHGYFFLQUA GMLLGFG1MI, LISIFEHKTV FRENF 

Seq 2D NO: 57 DNA sequence 

Nucleic Acid Accession #t HM_oi54i9.1 

Coding sequence: 1,, 8487 

1 11 22 31 41 51 

I I I 1 i 1 

ATQOCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT TTGGGGCCAT 60 

CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCGCAG CGAGGTCCAC 120 

TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 1B0 

AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 240 

TTGOAGCTAC TTATGATTCA OGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAQA 300 

GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AflCTACAAOA AGCTGAGAGT GATCACAGGA 360 

CAGACCCTOC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAG 420 

TTTATCCACC CTCAAGCXTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA 480 

AATCTCCTCC ACCAGCTGCA CGCCAGCAOC TTCTCCACGT TCACATTTTT GGATTATTTC S40 

AGACTCTCCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 600 

AGCATGCTTC GGAACATGCG GCTTCTGGAG AATCTTTACT TOCAGGGAAA TCCGTGGACC 660 

TGCGATTOTO AGATGAGATa OTTlTTGGAA TGGGATGCAA AATCGAGAGG AATTCTGAAG 720 

TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 780 

AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 840 

GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 900 

GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 960 

AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 1020 

GATQTGTACA AGATTCACTT GAACCAAACG GATCCTCCAG ATATTGACAT AAATGCAACA 1080 

GTTGOCTTGG ACTTTGAGTG TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 1140 

ATAOCATACT ACAGTGAAGT TCCOGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 1200 

OCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA CTACACAGGT 1260 

GTGAGAGCCC AGATTCTTGC AGAACCAQAA TGGGTCATGC AGCCATCCAT AGATATCCAG 1320 

CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 1380 

CAAACAATAT CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG 1440 

CCTAGTGGAG CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 1500 

TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAQA TGGCTCCATC 1560 

CTGAAAGU3C CCATOGATGA CCCAGACW3C AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 1620 

AGGATCAAGT CCATGGAGCC ATCTQACTCA GGCTTGtACC AGTGCATTGC TCAAGTGAG6 1680 

GATGAAAIGG ACCGCATGGT ATATAGGGTA CrTGTGCAGT CTCCCTCXJAC TCAGCCAGCC 1740 

GAGAAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 1B00 

GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT I860 

GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 2920 

GTCCAAGTCA GTGATAGTGQ TTACTACVOA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 1980 

CATTTTACGG TGGGAATCAC AGTOACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 2040 

AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG 2100 

QGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 2160 

GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 2220 

AGAAGAAAGC TGAAACTCTG GAAGCATTOG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 2280 

GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCQ 2340 

GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 2400 

GTACCOCOGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAQAABT CACACCACCT 2460 
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TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAAOCAG TGCTGAAGAA 2520 

TCCTCAGCAG ATOTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 2380 

GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2640 

AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACXGAGGA GATAACTTCC 2700 

ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAAOCA 2760 

TCTCCTACIC TGCACACATT AGACACAGTC TATGAAAAGC CCACZCCATGA AGAGACGGCA 2820 

ACAGAGGGTT GGTCTGCAGC AGATGT TGG A TCGTCACCAG AGOCCACATC CAGT GAGTAT 2880 

GAGCCTCCAT TGQATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2940 

GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATQA AAGAAOACaC CTTTGCACAC 3000 

CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTOCAGXA CATCACAGTT ATTTGAGGAT 3060 

TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 3120 

ATCCACCTTG TQAAAAQTAG TCTAAGCACT CAAGACRjCCT TACTGATFAA AAAGGGTATG 31SQ 

AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGQGAGACCC CACACACTCC 3240 

AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 

GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 

CTAOACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 342 0 

ACCATGAGCA CTCAOCCTTC TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3480 

AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3340 

TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCIG 3600 

GT TCCTACAG CTTGGGTGGA TAACACAGT T AATACCCCCA AACAGTTQGA AATGGAGAAG 3660 

AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3720 

CATOGATATA CCOCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 37 HO 

CCAGAAAATA AACATAGAAA CAT TGT TACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3840 

ACTGTTTCTC TGAAAACTGA GGGCOCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3900 

AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3960 

TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 

ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCQCTC CTTGGTCTCC 4080 

ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 

AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 

GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 

GAQTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 

ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 

GATCAAGATC ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 

CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4S00 

ATQTCTTTGG GACAAAjCCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 

GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 

OCAACCCCAG TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA AXTATCAAjCA 46BO 

CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 

TTTQGTAQTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTXCAT 4800 

GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 

CXACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 

CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATT3CA CATGTCCAAA 5040 

CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 

GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC TCCCAOTCCA 5160 

AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCXTTA CCAACAAGAC TCTTTCTTTT S220 

CCAC3U3TTGG GAGTCACCGG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 

GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG 5340 

GACTTTGGCC CTCCGGCACC TCGSTTGTTQ CACACTCCGC AGACCACGQG ATCACCCTCA 5400 

ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC GTTTATAACA 5460 

TCTTCTGTCC AQTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAQTTCTT TGCAGGAGGA 5520 

CCTCCTGCAT CTCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTOOCCA 5580 

CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC CCTGTGAGGG AACAGGAAAA 5640 

CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG GXCTTRTOAC TCOSAATACC 5700 

AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA GGTTCAAGTA S760 

CAAGATCOAG GCCAGTATAT GTGCACCGCC AGCAAOCTGC ACOQCCTGGA CAGGATGGTG 5 820 

GTCTTQCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG GCTGCCACTA CCAGGACGTC 5BB0 

ACTGTCTACC HGGGAGACAC CATTOCAATG QAGTGTCTGG CCAAAGGGAC CCCAGCCCCC 5940 

CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 

CGCATCACCC TGCAOQAAAA CCGQACCCTT TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 

GGCGTCTATA AGTGCGTGGC CAGCAATGCA GCCGGGGCGG ACAGCCTGGC CATCCGCCTG 6120 

CACGTGGCGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6180 

GCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCOTGCOC 6240 

TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTGGCAGT TCCTCCACGG GAACTTGTTT 6300 

GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT 63 60 

' GAGTGCGTGG CCGCCAAGCT GGTAGGCTCC GCGCGCAGGA OGGTGCAGCT GAACGTGCAG 6420 

CGTGCAGCAG CCAACGCGCG CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC 64 SO 

GGAGGAACCC TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 

AGGCTGOOGT CCAAGAGGAT QATGGAOGOG CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 

TTTGCCAATO GGACCGTGGT GQTGAAATCA GTGACGGACA AAGATGCCGG AGATTACCTG 6660 

TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 

AAACCGGCCA AGATTGAACA CAAQOAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC 67 bo 

CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 

QACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGAGG CACCAAGCGC 6900 

TA1GTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 

GACTACACCT GCTTTQCTGA AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 7020 

GTGGTGACAG CGCCCGCCAC CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 

GGAGACGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACXTGG 7140 

TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAOAT 7200 

GGCACTCTCC TTATTCAOAA AGCCCAGCGT TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 

AGGAACAGCG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGTCAACQT CCAGCCACCC 732 0 

AAGATCAACG GTAACCCCAA CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT 7380 

CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC OGABGGTGTT ATGGGCTTTT 7440 

cocoAGaara tggttctgcc agctccatac tatggaaacc ggatcactgt ccatggcaac 7500 
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GGTTCCCTGG ACAT CAGGAG TTTGAGGAAG AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560 

CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTSGA GCCCATGGAG 7620 

AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 

AGCCTCAACT GCTCTGCCGC GGGGAOCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGACGGCATG 7800 

CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT ACCGCTGDGT GGCCOGCfiAT 7860 

GCOGCTGGCC ACACGGAGAG GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 

AAGCAGTATC ATAACCTGGT CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

GCTCCCGGGG CTGGGCAGGG ACGTTTCTCC TOGAOGCTCC CCAATGGCAT GCATCTGGAG 904 0 

GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 

GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG AGACOGAGTA CGGCCCXTCG 8160 

GTCACCAGCA TCCCCGTGAT TGTGATCGCC TATCCTCCCC GGATCACCAG CGAGCCCACC 8220 

CCGQTCATCT ACACC3CGGCC CGGGAACACC QTGAAACTGA ACTGCATGGC TATGGGGATT QZ80 

CCCAAAGCTG ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 8340 

GCTGQTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGAC CAT CCAGCATGCC 8400 

ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA ACATTCTCGG CAGTGACT CC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT QTGGATTCCA GAATGATTGC TTAGGAACTG B520 

ACAACAAAGC GGGGTTTGTA AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT B580 

GTCACAGTGC ATGGTQGCCT CTGGTGGGTT TCAAGTTGAG GTTQATCTTG ATCTACAATT 8640 

GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTGTGTT TACATCATOC CAOaOGCTTC ATTCAGGGTG TCTGTGCTCT QACTGCAATT 8760 

TTTCTTCTTT TGCAAATGOC ACTCGACTGC CTTCATAAGC GTCCATAGGA TATCTGAGGA 8820 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACOCCAT TGAAGACQCA 8880 

TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 8940 

TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG GATTTAGAAC 90 OO 

CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT TACATACATA CAGCTACCAT 9060 

TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

TATATATTTT TTCCTTTCAA ATCAGACQAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9180 

ATTAAAATTA ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 9240 

AT ATAATT T T AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC TGTTATATTA 9300 

CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAOAAA ACT3CATCAT 9420 

AACTTTACAG AATTGAATCT AGAGTCTTGC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 94B0 

CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 

Seq ID HO i 58 Protein sequence 
Protein Accession #t NP_056234.1 

1 11 21 31 41 51 

I 1 I 1 1 I 

MPKRAHWGAL SWLILLWGH PRVAIACPHP CACYVPSEVH CTFRfiLASVP AGIARHVERI 60 

ttLGENSIQAL SETSFAQIiTK IBLmiHBMB IPSIPPGALR OTiSSUQVFKF SYWKLRVITG 120 

QTLQGLSNLM RLHIDHUeaE FIRPQAFNSL TSLRLIiHliEQ KHjIiHQIiHPST FSTFTFIiDYF 180 

HIiSTXRHLYL AJENMVRTIiPA SMDRNMPLLE NIxYLQGNPWT CDCEMRWFLE KDAKSRGIIiK 240 

CKKX7KAYEGG QLCAMCFSPK. KLYKH E IHKLi KDMTCLTCFSI ESPLRQNRSR SlEEEQEOHS 300 

DGGSQIiIEjEK PQLPQWSISL NMTDEHGNMV NMrCDIKKPM DVYKIHLNQT DPPDIDXNAT 360 

VALDFECPHT RENYEKLWKL IAYYBEVPVK LHREIML&KD PRVaYOYRCD ADEEAIjYYTG 420 

VRAQIIiABFB WVMQPSID1Q JjNRRQSTAXK VLIOTJfTQYS QTISTKDTRQ ARGRSVJVMTE 480 

P9GAVQHDQT VUEGGPCQLS QJVKASESPS IFHVLPDG6I LKAFMDDFDS KFSILSSGWL 540 

RIKSMEPGDB GtfYQCXAQVR DEMDRMVYRV LVQSPSTQPA FKDTVTIGKN PGE9VTLPCK 600 

AIAI PEAHLg WI37PNRRIIN DLANTSHVYM LSHGTLBIPX VQVSDSGYYR CVAVHQQGAD 660 

HFTVGITVTK. KGSGLFSKRG BEFGAXAL3R VREDrVEDEG GSGHGDBENT SRRLLHPKDQ 720 

EVFLKTKimA ZHGDXKAKKG KRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKOXNP 7 BO 

ERWADILAKV RGKNLPKGTE VPPItlJCTTSP PELfiLEVTPP FPAVHPPSAS PVQTVTSAEE 840 

QSADVPIAGE EEHVLGTISS ASMGLEHNHN GVXLVEPEVT STPLEEWDD LBEKTEEITE 900 

TBGDLKGTAA PTLISBPYEP BPTLHTLDTV YEKPTHEETA TEGWSAADVG BSPBP TSSEY 960 

EPPUlAVgiiA ESEPMQYFDP DIiETKSQPDE DKMKEDTFAH LTPTPTIHVK DSSTSQLPED 1020 

3TIGEPGVPG QSHLQGLTJDN IHLVKSSLST QDTLIiXKKGM KEMSQTLQGG NMLEGDPTHS 1080 

RSSESEOQES KSITLPDSTL QIMSSMSFVK KPABTTVGTIi liDKDTTTVTT TFRQKWAP33 1140 

TMSTHPSRKR PJR3PRRLRPN KFRHRHKGTP PTTFAPSETF STQPTQAPDI KieSQVESSL 1200 

VPTAKVDHTV NTPKQLEMBK NAEPTSK3TP RRKHGKRPNK HRYTPSTV3S RASGSKPSPS 1260 

PBHKHRNIVT PSSETltLPR TVSWCTEGPy DSLDYMTTTR KIYSBYPKVQ ETLPVTYKPT 1320 

SDGKEIKDDV ATNVDKEKfiD 1LVTGE6ITN AlPTSRSLVS TMGEFKEBSS PVGFPGTPTW 1380 

HPSRTAQPGE LQTDIPVTTS QENX.TDPPLL KBLBDVDFTB EFLSSLTVST PFHQEEAGSS 1440 

TTL8SIKVEV ASSQAETTTL DQDHLETTVA 1LL9ETHPQN HTPTAARHKE PASSSPSTIIj 1500 

MSLGQTTTTK PAIiPSPRISQ ASRDSKEWVF LNYVGNPETE ATFVKNEGTQ HMSGPKELBT 1560 

PSSDHDAFSHj &TKLEX»EKQV FGSRSLPRGP DSQRQDGRVH ASHQUTEVPA KPrLPTATVR 1620 

LPEM3TQgA5 RYPVTSQBPR HWTNKPE1TT YPSGALPEUK QFTTPRLSfiT TIPLPLHMSK 1680 

PSIP6KFTDR RTDQPHGYSK VFGNMNIPEA RNPVGKPPSP RJPHYSHGRL PFPTMKTIiSF 1740 

PQIX3VTRRPQ IPT8PAPVMR ERKVIPGSYK RIHSH8TFHL DFGPPAPPLL HTPQTTGSPS 1800 

TNLQNIPMVS STQSSZSPrT SSVQSSGSFH Q9SSKFFAGG PPA3KFWSLQ EKPQILTKSP I860 

QTVSVTAETD TVPPCRATGK PKPPVTHTKV STGALMTPKT RIQRPBVLKM GTLVIRKVQV 1920 

QDUGOYMCTA SNLHOLDRMV VI1I2SVTVQQP QIIASHYQDV TVY1/3DTIAM ECLAKGTPAP 19 BO 

QISWIFPDRR VWQTV8PVE8 RITLHENRTL SIKRASPSDR GW3QCVA6NA AGAD6LAIRL 2040 

HVAALPPVIH O.EKLrENlSU*P PGIiSIEIHCT AKAAPLPSVR KVLGDGTQIR PSQPUSGNLF 2100 

VFPUGTLYIR UTLAPKOSGRY ECVAANLVGS ARRTVQLHVQ RAAANARITG TSPRRTDVRY 2160 

GGTI1KLDC8A SGDPWPRlLW RLPSKRMIDA LFSFDSSIKV PAHOTIiWKS VTDKDAQDYL 2220 

CVARNXVGDD YWLRVDWM KPAKTEHKEE MDHKVFYGGD LKVDCVATGL PKPBISWSLP 2280 

EGSLVNSFMQ 8DDSGGRTSR YVVFNNGTLY FNEVGMREEG DYTCFAENQV GKDEMHVRVK 2340 

WTAPATIRN KTYIiAVQVPY GDWTVACKA KGEPMPKVTH I»SPTNKVIPT SSEKYQIYQD 2400 

GTZiIiIQKAGR 8DSGNYTCLV RNSAGEDRJCT VWIHVNVQPP KZMGNPHPIT TVREIAAGGS 2460 

PJd.IDCKAEG IPTPRVLWAF PE0WLPAPY YGNRlTVEGU GSLDIRSLRK SDSVQLVCMA 2520 
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5 



RNEGGBARLI 
GTDIjQSGQOL 
KQYHNLVSII 
ASVEDRGTYV 
PKADITWELP 
KTTYIHVF 



VQLTVLEPME KPIFHDPI8E KITAMAGHTI SLNCSAACTP TPSL.VWVLPN 2580 

QRFYHKADGM LHISGIiSSVD AGAYRCVARN AAGHTERLV3 LKVGUCPEAN 2640 

NGETIjKLPCT PPGAGQGRF8 WTLPNGMHUE GPQTLGRVSL LDNGTLTVRE 2700 

CRMETEYGPS VTSIPVIVIA YPPRITSKPT PVIYTHPGHT VKLNtMAMGI 2760 

DKSHLKAGVQ AM.YGNRFLH PQGSLTIQHA TQRDAGFYKC KAKNILGBDS 2820 



Seq ID NOt 59 DNA sequence 
NVicleic Acid Accession #j Eos sequence 
10 Coding sequence i 1..5001 

X 11 21 31 41 SI 

I I I I \ I 

. ATGCCAGGCA CAAAACTAAC CCQAACAGGC GCCCCAGCAG ACTACAGAGT GATATTGAAG 60 

15 ACCTCTCAAG AGGACGAATT ggatgtacct gacgacatca gggtccgggt tatgtcatct 120 

CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 160 

TCAAGACAGT ACACCGTGCG CTATCGAGAG AAQGGGOAAT TGGCCAGGTG GGATTATAAG 240 

CAGATCGCTA ACAGQCGTGT GCTGATTGAG AACCTGATTC CAGACACIGT GTATGAATTT 300 

GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA GTACGTCAGT CTTCCAAAGA 360 

20 ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AOTCAATGGC 420 

AAACCTACAG TTGTCGCTGC ATCTTGGGAT GCGCTACCAQ AGACTGAGGG GAAAGTGAAA 480 

GTCTGTCTGC TGGACACAJGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC TGOCAAATCA 540 

TTTCAGAATA CATTCTTTCA TACGCCCCGO CTCTCAAACC AXTTGGAGCA AAGTCCCTCA 60 0 

_ _ CCTATCCTGG AGACACTACT TCTGCCCTGG TGGATGGTCT aCAQCCTGOG GAACGCTATC 660 

25 TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCIGGGACC TCACTCCAAA GCCTTCATTG 72 0 

TGGCTATGCC AACAAGAATG CAGCTGTACC CAGAAGGATT TCAGXTGTCT AGCTTACCTG 780 

ATCGATATCC AAACCAAACA AOTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGOA B40 

OCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGOATA TXGGCGGCTT TTCCTTCATT 900 

ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 960 

30 acCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 1020 

CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTOT GCCTGCTTCT 1080 

CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 1140 

GGTGGGGCGC CCCGAAAACC CCAGCTTCGC GCCAAGAAGG CAGAGGAGCT QGATCTTCAG 1200 

TCQACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 12 GO 

35 TCAGACACCC AAGACCAOAA ACGGACCCTG AGGCCGCCAA GTAGACACGG CCACTCGGTG 1320 

OTTGCTCCOG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 

GTAGATAAGC CTGGCTTTTC CCTTGGCCACG CAGCCCOGCC CAGGGQOQCC CCCCTCGGCT 1440 

TOGGCCTCTC CTGCCCACCA CGCGTCCACC CAQQGCACCT CTCATCGTCC TTCCCTGCCT 15 00 

GCCAGCTTGA ATGACAACOA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

40 CTGCACCCCA AGGGCGCCTT CGCCCAGCCC CCGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGOG TTCTCCGCGA CAGAAGCTCT GTGCACCCCG GCGCAAAGCC AGCCTCGCCG 1680 

GCaOGGAGGA CCCCOCATTC AGGGGCCGCA GAGGAAGATT CCAGTGGCTC AGCCCCACCC 1740 

rCAAGACTTT CTGCACCCCA TOGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGOACG CCGCAGCCAC CAACTCCAAT I860 

45 GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1520 

GTCTCTGAGG GAGCGGAGGC TTCXGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GOTCTCCAAG CAGGTTCAGC 2100 

A ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCC5CQA 2160 

50 GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 

GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 2280 

TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGQACTTAA GGAGAAGCCC GCAGAGAGGG 2340 

GCCAGCCTGC ATCGGAAGOA ACCCATCCCA GAGAACCCCA AATCCACAGG GG (ZAGAT ACA 2400 

CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 2460 

55 OACGCaOACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTOCACAGA CGGCCRCGCG 2520 

TCCCCTGCTC GTCCTCCCGC AGCACOGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 25HO 

ACACCCGGCC GGQCCCCAGA ACAGCAGCCC CCTCCTCCCG TCTCCACGTC CCAGCACCAC 2640 

O0GGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTOGC AGCCCAGGCT CTCACTGACC 2700 

CAGGCCGGGC OGCCCCGCCC CACGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 2760 

60 ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTGCAGA ACCAGGACGA GGATGCCCAG 2820 

GGCAGCTACG ACGACGACAG CACAGAAGTC GAGQCCCAGG ATGTGCGGGC CCOCGCGCAC 28 B0 

GCCGCGCGCG CCAAGGAGGC AGCTGCGTCG CTTCCCAAGC ACCAGCAGGT GQAGTCTCCC 2940 

ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCOCCGCC 3000 

AGGCCCAGCX: GACCOGGGGG CCCCCAGTOC CGCGCCCGGG TCCCCAOCAG GGCAGCGCGG 3060 

65 GGGAAGTGGG AGCCTCCTTC CAAGOGOCXX: CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 3120 

GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG GOGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTOCCA GGGGCGGCAA AGACGCCGAT 3240 

GGGAGCCTCG CCAAGGAAGA GAGGGAGOCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 

CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCRGGCA GCTCCCCCAG GGCCTCCCAC 3360 

70 GTCCCTTCCC GACCGCCGCC TCGCAGGGCT GCCACOGTGA GCCCCGTCGC GGGCACCCAC 3420 

CGCTGGCCBC GGTACACCAC GCGCGCCCCV OCTGGCCACT TCTCCACCAC COCGATGCTG 3480 

TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCX3TAACC CTCTCTCCCG ACAGCCTGCC 3540 

AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATO TAGAAGGGAA AGTCCTTCCT 3600 

_ _ GGTAGTAATG GAAAACCGAA TGOACAGAGA ATTATCAATG GCCCTCAAGG AACAAAQTGG 3660 

75 GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAQ GAAGGTACCT CCAAGATTCA 3720 

CATGGAAATC CTCTTCQGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 3780 

GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG GGCAGGGGCG ACATGGGACA 3840 

CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGQAAAGCC GCTGGTGGGC 3900 

_ _ TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 3960 

80 ACGACGDCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTTGCCACCAC CKK3CAGCCC 4020 

ACCACTACTA C3ACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 40 B0 

CaCACGACCA CCAGGCGTCC AACAACCACA GTCOGAACCA CTAC3GCGGAC AAOCACCACC 4140 

ACCACCCCCA AACCCACCAC TTCCATCCCC AJCCTGTCCCC CTGGGACCTT GGAACGGCAC 42 OO 

GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 4260 

972 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GATGAGTTCT 
TATGATGAAG 
ACTGCTACCA 
GAATTTGATC 
GACCCATCAG 
GATGAAATCA 
AACATCACCG 
GCCACCCCAG 
ATCAGGAACA 
AAGCCCAACA 
ATCAGCCCTT 
CX3U3GOGGTG 
GCCATGGACG 
GTAATTCACT 
TTQGAGACAG 
GAAGAACAGG 
AGTATCGTCA 
TGGGCTGGTA 
CTGCAAGCTT 
CAGCCAGCAT 
CATTCTGGTC 
TTCAGTTTTG 
OCAGAGACAT 



CAGGCTTGGA 
ATTATGAATT 
CACCGAGGGT 
TGGCTGGAAG 
CCCCGTGCTC 
TCCGCAATGA 
TGGTGGCCGT 
G AGAT T TGGI 
AGTTTTCCAC 
CGAGGTATTA 
CGGTCTCATT 
AGCTATCTGG 
GCAATATGTG 
GAGGTATAAA 
CTGGGGAAGA 
GCCTCAGTCC 
GGAGCCTGTC 



TGCTACATGC 

tcaggtccct 

ATATCCTACT 
TTATTTGTAA 
TGTCTGCAAG 
AAAAAAA 



GCCCTGCCCA 
GCTCAGCCCC 
ATCTCAGTCT 
CTGTTAACTT 
CAGAAACCAG 
CTTCAGTATT 
TTTCTGTTTT 
TTGTATGCAG 
TGAAATTTAC 
ATTCTCAATT 
AATATTGATT 



GACTGACACT 
TGAGACGTCA 
GATCCCAGAG 
GAAACGATTT 
TCTGACTGAT 
CCTGAAGAAG 
GGAAGGTTGC 
CACAGGTTAT 
TCAAGCTTCA 
TTTTAAAGTG 
TGTCACCGAA 
ATCCCATTOG 
AAGCGCACGT 
ATCTACCTCA 
GGTGAAGACC 
TATGTAGAAG 
AGGTTTGGGA 
GTCTCCATCC 
GCCCCACCAA 
GCTGCCCTAG 
GGAACTCAGT 
TGCTTCTCTA 
CAACTGATTC 
TCCAGGAATA 
TCTCATTTTG 
TAGAAAGGAA 
TCTATGGACT 
TTGATATATA 
AAAATTGCTA 



GCAGTACCTA 
AGGCCACCAA 



GTTGCTCCTT 
GCACTGGATC 
AGTGATCTGC 
CACTCATTTG 
TTGGTTTACA 

tcagtaactc 
caagcacaaa 
tcagataatc 
ctttcaaaca 
ggtatcgaaa 

GTGACAACCT 
ATTGCCAATT 
CCCTCCCTAC 
ACATCGGCTT 
CTGGAAAGTG 



GTGCCAGGAA 
CCCACTTCTT 
CTTTTTTTTG 
ADTGTQATTT 
GCATATGCAC 
GATTTCTCCA 
TTATTAAAAA 
TAOCCACTGC 
TATQTATATA 
AATTTGTACT 



cggaagaggc 
ccaccactga 

TCAGTTCCTT 
ACGTGACGTA 
ACTTCCAAGT 
CTCCCCAGCA 
TCATTGTGGA 
GTGCATCCTA 
ACTTGGCCAT 
ATCCTCATGG 
CTCTGCTTGT 
TGATCCCAGC 
GTTCGTGGGA 
GAAAGATACA 
TGTGGATTCA 
TATTCAAGGC 
CGGAAGCCCC 
GTAATCACAG 
CTAGGGGCTG 
GGTCACAGAT 
GGCCTGGACA 
TTTGTTTGTA 
CCCAGACTTT 

AAACTAACTG 
CACCACCAAA 
TAGAATAAAT 
TGCATATACA 
TGTTCACCAA 



CTACGTTATA 
GCCTTCGACC 
TCCTGAAGAA 
CCTAAATAAA 
GGACAGCCTG 
TGCTCCCCGC 
TTGGGACAAA 
TGAAGATTTC 
TGAGAACCTA 
CTACGGACCT 
TGTGAGGCCC 
TAGAGGGACT 
GTTGTTCTTT 
TTCTACAGCA 
CACCTTGATG 
TACTATCGDC 
TACTACTA0G 
GACCGTCATG 
TGAGCAAAGA 
GGACACTGGC 
ATGAACAGGA 
ATAGCACATC 
TTAGGCATGA 
CTTCATGGAA 
AATTTAAGCT 
GAAAATAAAT 
GTATCAAATC 
TATCCACACT 
AAAAAAAAAA 



4320 
4.360 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4 920 
49Q0 
5040 
5100 
5160 
522 O 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5B2 0 
5880 
5940 
6000 



Seq ID NO: 60 Protein sequence 
Protein Accession #r Eos sequence 



1 
I 

MTGTKLTRTG 
SRQYTVRYRE 
TPESAPTTAP 
FQNTFFHTPR 
SLCQOBC9CT 
MCYKDPVSSL 
QGRNAKDIilZL 
DTQDQKRTLR 
ASPAHHASTQ 
SSVLfcDRSSV 



11 
I 

APADYRVIIiK 
KGELARWDYK 
ENIJ5JVHFVNG 
I/SHHLEQ8P3 
QKDFSCIiAYL 
TGNSUCSVAA 



GRQAEATAQT 



GTSHRPSLPA 
HPGAKPASPA 
EDAPATNSNA 
LRARPASGHF 



21 
I 

TSQEDELDVP 
QIANRRVLXB 
KPTWAASWD 
PILETIiLPW 
IDIQTKOVNK 
SKAT5VQQNTE 
GAPRKPOLRA 
APGRTAVHAR 
SLKDNDLVDS 
RRTPH6GAAE 
FSRSTMSSSV 
HLLRHKPFAA 



31 
1 

DDISVRVMSS 
khjIpdtvyep 
ALPETBGKVK 
HMVCSLGNAI 
DPQLEGSVFG 
DNGKPBKPEP 
KKAEELDLQ8 
KPALPRREGV 
DEDSRAVGSL 
EDS9ASAPPS 
SSHIiSSRTQV 



41 
1 

QSVItVSHVPP 
AVHISQGERD 
VCLLDTGLFS 
FSKSGPQTGE 

PCPLFYPLTF 
SSFSPRAPAS 
TEITGBEELG 
DKPGFfiliAXQ 
HPKGAFAQPR 



51 
I 

VJjEKQ'KKWA 
GKWSTBVFQR 
VSSFQPSAKS 
AWDIjTPKPSL 
KLDIGGFSPJ 



PRFGAPPSAS 
PAL8PSRQSP 
RIiIjPTQPHT*S 



ADTEGHSPXA 
GPQBRDAGR3 
SYDDDSTEVS 



TWMDHVPSB 

SLHRKEPIPE NPKSTGADTH PQGKYSSIiAS KAQDVQQ9TD 
PARPPAAR5Q QHPSVFRRMT PGRAPSQQPP PPVATSQHHP 
AGRPRPTSGG RBHSSSDPYT ASSRGKLPTA LQNQDEDAQQ 
AEAKEAAASL PXEQQVESPT GAGaGGDERS QRGHAASPAR PSRPGGPQSR 

SSVPKHPSSS 
PSRPPPXSAA 
PEYRQGXNGR 
GNPLRIKLGG 
EVIXKTTHPP 
TTTRRPTTTV 
EF8GI1ETDTA 
FDIiAGRKRFV 
ITVVAVEGCH 
PHTRYYPKVQ 



fiLAKEEREPA 1ALAPRGGSC 
KPRYTTRAPP GHFSTTPMLS 
SNGKPHGQRI EK3PQGTKHV 



T P L PTTTTPR PTTATTMQPT 
TPKPTTPIPT CPPGTliERHD 
DEDYEFETSR PPTTTEP8TT 
P3APCSI>TJ3A XJXHFQVDSliD 
TPGDLVTGYIi VYSASYEDPI 
SPfiVfiFVTES DNPLLWRPP 



APVKRPLPPP 
ItRQRHMHARF 
VDLDRGLVLN 
IiANAQDKPIL 
TTTTFLPTTT 
DDGMT*IM£SN 
ATTPKVIFEE 
BIIPMDUOCS 



PGS8PRASEV 
RNPLSROPAR 
AJBGRYLGDBH 



PRPTTATTRR 
GIPECYAEKD 
GAISSFPEEE 
DiPPQHAPRK 



GGELBGSHSL SNMIPATRTA MDGHH 



SPQSTVPSRA 
DLRRSPORGA 
QPGfiTDRHAS 
PSQPRLSI^Q 
AQDVRAPAHA 
ARVPSRAAPG 
TPRGGKDADG 
TV3PVAGTHP 
PHVEGKVLPG 
BGRTrVDIiEG 
TT1MQPTTTT 

VPTEEAYVIY 
APYVTYLNKD 
SFVIVDWDKA 
AQNPHGYGPI 



Seq ID NO; 61 DNA sequence 

Nucleic Acid Accession #: NH_022743 

Coill Tig sequence: 12 8.. 123 7 



1 
1 

GTGGATTTTA 
TCTCTGCTGC 
AAAQCTGATG 
AAAAGCTTGG 
TCGTCCAGAC 
TTCAGAATCA 
TGAAGATAAG 
AGAAATACAG 
AGTGATCTGC 
ATATCCCAGT 
TGGGCCCCAC 
CTGCTACCTG 
CTGCTTTGAA 



11 
I 

GAGATACCTC 
AACACCCCTA 
CGATGCTCTC 
CCAGACCACA 
TCCGTTCGAC 
GAGAAGCTTT 
AAAGAGGGCC 
GATGCCTCTC 
AACTCTTTCA 
ATCTCTTTGC 
CTCTTACTGC 
GATATGCTGA 
TGTGACTGTT 



21 
I 

CCCTCCTTCT 
CTGTTCTCCG 
AGTGOCGCGT 
AGCGGGAATG 
TTCXTGGCAG 
ACTCATTTTA 
TCAGGCAACT 
AGCTGCCACC 
CCATCTGTAA 
TCAATCACAG 
GAGCAGTCCG 
TOACCAGTGA 



31 
I 

GCTCAGCTGC 
TGTATTGGCT 
CGCCAAATAC 
CAAATGCCTT 
AGTTGTCTTC 
TGATCTGGAG 
CGTAATGACA 
TGCCTTTGAC 
TGCGGAGATG 
CTGTGACCCC 
AGAGATCGAG 
GGAGCGCCGG 
AACGCAGGAC 



CTTGCAGTAA 

TGTAGTGCTA 
AAAAGCTGCA 
AAACTTATGG 
TCAAATATTA 
TTTCAACATT 
CTTTTTGAAG 
CAGGAAGTTG 
AACTGTTCGA 
GTGGGAGAGG 
AAGCAGCXGA 
AAGGATGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
102O 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



51 
1 

TTAAACTCTT 
GCAGGAAGGA 
AGTGTCAGAA 
AACCCAGATA 
ATGGAGCACC 
ACAAACTGAC 
TCATGAGAGA 
CCTTTGCAAA 
GTGTTGGCCT 
TTGTGTTCAA 
AQCTCACCAT 
GGGACCAGTA 
ATATGCTAAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



973 
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TGGTGATGAG 
GGCACRCTGG 
TGAACGGCXT 
CTGCATCAAC 
ATACAGGATT 
CAAACTGCAG 
TGATATTATG 
TTTAGAAGAA 



GTGAACCTCT 
GGTXTGCAAA 
ATTTGGTTGA 



CAAGTATGGA 
AAGTGGGAGC 
CCCGATATCA 
CTOGGCCTGT 
XTXTTCCCAG 
CTACATCAAG 
AGAGTGACAC 
TGCGACGCCA 
CTTTGTTGAA 
CTTATTGGAA 
CCACAAGAAT 
GGATGCCAAA 



AGGAAGTTCA 
AGGTTCTGGC 
ACATCTACCA 
TGGAGGAAGC 
GAAGCCATCC 
GCATGTTTCC 
ATGGCAGAGA 
ACATCAGAGC 
TGCCTTATTG 
ATTCTGTXCC 
CATTAGTTGT 
AAAAAAAAAA 



Seq ID HQs 62 Protein sequence 
Protein Accession #t NP 073 580 



1 
I 

MRCSQCRVAK 
SEEnbYSFVDIi 
CNSETICUAE 
LDMLMTSSER 
WKWEQVLAMC 
J. FFPG9HPVR 
ECDAKTRAS 



11 
I 

YCSAKCQKKA 
ESNINKLTED 
MOEVGVGLYP 
RKQIiRDQYCF 
QAIISSNSER 
GVGVMKVGKLj 



21 
I 

WPDHKRECKC 
XKEGLRQLVM 
SISLIJJHSCD 
ECDCFRCQXQ 
LPD I N I YQI*K 
QLHQGMFPQA 



AGAATCCCT3 
CATGTGOCAG 
GCTGAAGGTG 
CTTGTTCTAT 
CGTCRGAGGG 
CCAAGCAATG 
ACACAGCCTG 
ATCCTAAGGG 
AGGTCACACA 
GTGTTTGTGT 
AGAGAAGCAC 
AAAAAAA 



31 

I 

LKSCKFRYPP 
TPQKFMREEI 
FNCSIVFNGP 
DKDADMLTGD 
VIiDCAHDACI 
MKNLRXAFJDI 



AAAAAAATTG 
GCGATCATAA 
CTCGACTGCG 
GGTACTCGGA 
GTTCAAGTGA 
AAGAATCTGA 
ATTGAAGATT 
AACGCAGTCA 
CXCTATGCTT 
AGGTAAATAA 
GATTATAATA 



AAGAACIGAA 
GCAGCAATTC 
CCATGGATGC 
CCATGGAGCC 
TGAAAGTTGG 
GACTGGCTTT 
TGATTCTACT 
GAGGGAAATA 
TGTTAGCTGT 
AGGCAGACAT 
AATTCAAAAC 



41 51 

1 i 
DSVRLLGRW PKLHDGAP3E 
QDA8QXPPAF HLFEAFAKVI 
HLI/LRAVRDI EVGSSLTICY 
EQVWXEVQES I^KKIEELKAH 
NLGLLEEALF YGTRTMBPYS 
MHVTHGREHS LIEDLIKLLE 



340 
&oo 
960 
1020 
10BD 
1140 
1200 
1260 
1320 
13B0 
1440 



Sea. ID NOi 63 PSA sequenCfe 

Mucleic Acid Accession #s NM_0D3Q14.2 

Coding sequence: 23 8.. 648 

1 11 21 31 41 51 

I I I I I I 

GGCGGGTTCG DGCOOGGAAG GCTGAGAGCX GGCGCTGCTC GTGCOCTGTG TGCCAGACGG 
CGGAGCTCCG OGGCCGGACC CCGCGGCCCC GCTTTGCTGC GGACTGGAGX TTGGGGGAAG 
AAACTCTCCT GCGCCCCAGA AGATTXCTTC CTCGGCGAAG GGACAGCGAA AOATGAGGQT 
GGCAGGAAGA GAAGGCGCTT TCTGTCTGCC GGGGTCGCAG GGOGAGAGGG CAGTGCCATG 

ttcctctcca tcctagtggc ggtgtgcctg tggctgcacc tggcgctggg CGTOCGCGGC 

GCGCCCTGCG AGGCGGTGCG CATCCCTATG TGCCGGCACA TGCCCTGGAA CATCACGOGG 

atgcccaacc ACCTGCACCA cagcacgcag gagaacgcca tcctggccat CGAGCAGTAC 

GAGGAGCXGG TGGACGTGAA CTGCAGCGCC OTGCTOCGCT TCTTCTTCTG TGCCATGTAC 
GCGCCCATTT GCACCCTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 
CAAOGCGOGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 
AGCCTGGCCT GCGACGABCT GCCTGTCTAX GACCGTGGCG TGTGCATTTC GCCTGAAQCC 
ATCGXCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 
CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 
AAGGTGAAGC CAACXTXGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 
AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 
GAGATCXTCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCCGCTCAT TACAAATTCT 
TCTTGCCAGT GTCCACACAT CCTGCCCCAX CAAGATGTTC TCATCATGTG TTACGA3TGG 
CGTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 
AAAAGATCCA TACAGTGGGA AGAGAGGGTG CAGGAACAGC GGAGAACAGT TCAGGACAAG 
AAGAAAACAG CCGGGCGCAC CAGTCGTAQT AATCCQCCCA AACCAAAGGG AAAGCCTCCT 
GCTCCCAAAC CAGCCAGTCC CAAGAAGAAC ATTAAAACTA GGAGTGCCCA GAAGAGAACA 
AACCCGAAAA GAGTGTGAGC TAACTAGTTT CCAAAGOGGA GACTTCCGAC TTCCTTACAG 
GATBAGGCTG GGCATTGCCT GGGACAGOCT ATGTAAGGCC ATGTGCCCCT TGOGCTAACA 
ACTCACTGCA GTGCTCTTCA TAGACACATC XTGOAGCAXT TTTCTTAAGG CTATGCTTCA 

GTTTTTCTTT GXAAGCCAXC acaagccata gtggtaggxt TGOCCTTTGG tacagaagot 

GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG TGTGGATACT 
CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC TAATATGTGC ATTGTAAAAX 
AAATGCCATA TTTCAAACAA AACAOGTAAX XXXXXXACAG TATGTTTTAT TACCTTTTGA 
TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA TGTTTTTAAG 

aaggaacagt agtggaatga atgttaaaag atctttatgt gtttatggtc TQCAOAAGGA 

TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA AT TAGAG AAG TAGCATATGG AAAATTATAA 
TGTGTTTTTT TACCAATGAC TTCAGTTTCT GTXTXTAGCT AGAAACTTAA AAACAAAAAT 
AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTGTGGATTC CTGTTTTTTG 
GTTACCXGAT TTCCATGATC AXGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 
ACAGTGAGTT TGTCIGXACC ATTAGGAGTT AGGTACXAAT TAGTTGGCTA ATGCTCAAGT 
ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT CCCAGGACAT OCACCCTGAG 
AATAATTTGA CAAGCTTAAA AATGGCX3TC ATGTGAGTGC CRAATTTTGT TTTTCTTCAX 
TTAAATATTT *rCTTTGCCTA AATACATGTG AGAGGAGTTA aatataaatg tacagagagg 
AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGAXACX TTAATCAGAA 
AAAAAGAACX XATITGCAQC ATTTTAXCAA CAAATTTCAT AATTGTGGAC AATTGGAGGC 
ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT GTATTTTATA 
AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT CCTACTCTCC 
ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATAtTTA TTCTCCAGGT GTTTGCTTAT 
GCACTTATAA AATGATTTGA ACAAATAAAA CIAGGAACCT GXATACATGT GTTTCATAAC 
CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 
TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 
TATTGGATAC ttaggtggtt tcttcactga CAATACTGAA taaacatctc AOCGGAATTC 

Seq XD NO: 64 Protein sequence 
Protein Accession $• NP 003005.1 
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1 I I 1 I I 

MFLSILVALC LWLKLcALGVR GAPCEAVRIP MCRHMPWNTT RWPNHLHHST QENAIIAIEQ 60 

YEEIjVDVNCS AVT..RFFFCAM YAPICTLEFk HDPIKPCKSV CQRAKDDCEF LMKMYNHSWp 120 

ESLACDKLPV YDRGVCZSPE AIVTDIiPKDV KWIDITPDMM VQERPIiDVDC KRLSPDRCKC 180 

KKVKFTLATY L9KNYSWIH AKXKAVQRSG CNEVXXWDV KEIFKBS8PI PRXQVPLIXN 240 

SSCQCPHILP HQDVLIMCYS WRSRMMLLEN CLVEKWRDQIi SKHSIQWEKR LQBQRRTVQD 3 GO 
KKXTAGRTSR SNPPKPKGKP PAFKPASPKK K I KTR SAQKR TNPKRV 

Seq ID NO = 65 DMA sequence 

Nucleic Acid Accession #t BCQ10423 

Coding sequence: 246. .17B0 

1 11 21 31 41 51 

1 I I 1 I I 

CACAGCGXGG GAAGCAGCTC TGGGGGAGCI CGGAGCTCOC GATCACGGCT TCTTGGOGGT 60 

AGCTACQGCT GGGTGTOTAQ AACGGGGCCG GGGCTGGGGC TGG6TCCCCT AGTGGAGACC 120 

CAAGTGCGAG AGGCAAGAAC TCTGCAGCTT CCXGCCXXCT GGGTCAGTTC CTTATTCAAG 1B0 

TCTGCAGCCa GCTDCCAGGG AGArCTCGGX GGAACTTCAG AAACGCTGGG CAGTCTGCCT 240 

TTCAACCAXG CCCCTGTCCC XGGGAGCCGA GATGTGGGGG CCTGAGGCCT GQCTGCTGCT 300 

GCTGCTACTG CTGGCATCAT TTACAGGCCG GTGCCCCGCG GGTGAGCTGG AGACCTCAGA 360 

CGTGGTAACT GTGGTGCTGG GCCAGGACGC AAAACTGCCC TGCTTCTACC GAGGGGACTC 420 

CGGCGAGCAA GTGGOGCAAa TGQCATGGGC TCGGGTGGAC GCGGGCGAAG GCGCOCAGGA 480 

ACTAGCGCTA CTGCACTCCA AATACGGGCT TCATGTGAGC CCGQCTTACG AQGGCCGCOT 540 

GGAOCAGCOG CCGCCCCCAC GCAACCCCCT GGACGGCTCA GTGCTCCIGC GCAACGCAGT 600 

GCAGGCGGAT GAGGGCGAGT ACGAGTGCCO GGTCAGCACC TTCCCCGCCQ QCAGCTTCCA 660 

GGCGCGGCTG CGGCTCCGAG TGCTGGTGCC TCCCCTGCCC TCACTGAATC CXGGTCCAGC 72 0 

ACTAGAAGAG GGCCAGGGCC TOACCCTGGC AGCCTCCTGC ACAGCTGAGG GCAGCCCAGC 780 

CCCCAGCGTG ACCTGGGACA CGGAGGTCAA AGGCACAACG TCCAGCOGTT CCXXCAAGCA 840 

CTCOCaCTCT GCTGCCGTCA CCTCAGAGTT CCACTXGGTG CCXAGOCGCA GCAXGAATGG 90C 

GCAGCCACTG ACTTGTGTGG TGTCCCATCC TG3CCTGCTC CAGGACCAAA OGATCACCCA 960 

CATCCTCCAC GTGTCCTTCC TTGCrGAGGC CTCTGTGAGG GGCCXTGAAG ACCAAAATCT 1020 

GTGGCACATT GGCAGAGAAG GAGCTATGCT CAAGTGCCTG AGTGAAGGGC AGCCCCCTCC 10 BO 

CTCATACAAC TGGRCACGGC TGOATGGGCC TCTGCCCAGT GGGGTACGAG TGGAXGGGGA 1140 

CACTTTGGGC TTTCCCCCAC TGACCACTGA GCACAGCGGC AXCTAOSTCT GOCATGTCAQ 1200 

CAATOAGTTC TCCTCAAGGQ ATTCTCAGGT CACTGTGGAT GIXCXXGACC CCCAGGAAGA 1260 

CTCTGGGAAG CAGGTGGACC TAGTGTCAGC CTCGGTGGTG GTGGTGGGTG TGATCQCOGC 1320 

ACTCTTOTTC TOCCTTCTGQ TGGTGGTGGT GGTGCXCAXG XCCCGATACC ATCGGCGCAA 1380 

GGCCCAGCAG ATGACCCAGA AATA7GAGGA GGAGCTGACC CTGACCAGGG AGAACTCCAT 1440 

CCGOAQGCTS CATTCCCATC ACACGGACCC CAGGAGCCAG CCGGAGGAGA GTGTAGGGCT 1500 

GAGAGCCGAG GGCCACCCTG ATAGTCTCAA GGACAACAGT AGCTGCTCTG TGATGROTQA 1560 

AGAGCCOGAG GGCCGCAGTT ACTCCACGCT GACCACGGTG AGGGAGATAG AAACACAGAC 1620 

TGAACTGCTG TCTCCAGGCT CTGGGCGGGC CGAGGAGGAG GAAGATCAQG ATGAAGGCAT 1680 

CAAACAGGCC ATGAACCATT TTGTTGAGGA GAATGGGACC CTAGGGGCCA AGCCCACGGG 1740 

CAATGGCATC TACATCAATG GGCGGGGACA CCTGGTCTGA CCCAGGOCTG CCTCCCTTCC IS 00 

CTAGGCCTGG CTCCTTCTGT TGACATGGGA GATTTTAGCT CATCTTGQGG OCCTCCTTAA IB 60 

ACACCCCCAT XTCTTGCGGA AGAXGCTCCC CATCCCACTG ACTGCTTGAC CTTTACCTCC 1920 

AACCCTTCTQ TTCATCGGGA GGGCTCCACC AATTGAOTCT CTCOCACCAT GCATGCAGGT 1980 

CACTGTGTGT GTGCATGTGX GCCTGTGTGA GTGTTGACTG ACTGTGTGTG TGTGGAGGGG 2040 

TGACTGTCCG TCK3AGGGGT B ACTGTGTCCG TGGTGTGTAT TATGCTGTCA TATCAGAGTC 2100 

AAGTGAACTG TGGTGTATGT GCCACGGGAT TTGAGTGGTT GCGTGGGCAA CACTGTCAGG 2160 

GTTTGGCBTQ TGTGTCATGT GGCTGXGTGT GACCTCTGCC TGAAAAAGCA GGTATTTTCT 2220 

CAGACCOCAG AGCAGXATTA ATGATGCAGA GGTTGGAGGA GAGAGGTGGA GACTGTGGCT 22 BO 

CAGACCCAGG TGTGOSGQCA TAOCTGGAGC TGGAAXCIGC CXCCGGTGTG AGGGAACCTG 2340 

TCTCCTACCA CTTCGGAGCC ATGGGGGCAA GTOTGAAGCA GCCAGTCCCT GGGTCAGCCA 2400 

GAGGCXTGAA CTGTTACAGA AGCCCTCTGC CCTCTGGTGG GCTCTGGGCC TGCTGCATGX 2460 

ACATATTTTC TGTAAATATA CATGCGCDGG GAGCTTCTTG CAGGAATACT GCTCOGAATC 2520 

ACTTTTAA1T TTTTTCTTTT TTTTTXCXXG CCCTTXCCAT TAGTTGTAXI TTTTATTTAT 2580 

TTTTATTTTT ATTXXIXTTT AQAGTTTGAG TCCAGCCTG3 ACQATATAGC CAGACCCTGT 2640 
CTGTAAAAAA ACCAAAACCG AAAAAAAAAA AAAAAAAAAA 

Seq ID NO* 66 Protein sequence 
Protein Accession. #: AAH10423 



^_ 1 11 21 31 41 SI 

65 | | | | | I 

MPLSLGAEMW GPSAWLLLLL LLASFTGRCP AGELET8DW TWLGQDAKL PCFYRGD8GK 60 

QVGQVAWARV DAGEGAQELA LLHSKYOLHV S P A YEGRVEQ PPPPRNPLDG SVU^KAVQA 120 

DEGEYECRVfi TFPAGSFQAR LRLRVLVPPIi PSLNPGPALE BGQGLTLAAS CTAEGSPAPS 180 

VTWDTEVHGT TSSRSFKH9R SAAVTSEFKb "VPSRSMMGQP LTCWHHPGL I/QJDQRITHIL 240 

70 BVSFLABASV HBLEDQNIjWH IGREOAMLKC LSBGQPPPSY NHTRIiDGPLP SGVHVDGDTL 300 

GFPPLTTEHS GIYVCHVSNE F9SRDBQVTV DVLDPQBDSG KQVDLV8ASV WVGVlAALIi 360 

FCLLVWWL HSRYHRRKAO QMTOKYEEEL TLTREKSIRR LHSHHTDPRS QPEESVGLRA 420 

EGEPDSLKDK SSCSVM9EEP EGRSYSTLTT VREIETOTEL LSPGSGRAEE EEDQDEQIKQ 480 
AMHHFVQENB TLRAKFTGNG I Y UN GRQHLTV 



Seq ID NO: 67 DMA sequence 

Nucleic Acid Accession ft: NM_001203 

Coding sequence i 274..17B2 

1 11 21 31 41 51 

I I I I 1 1 

CGCGGGGOGC GGAGXOGGCG GGGCCTCGCG GGACGCGGGC AGXGCGGAGA CCGCGGCGCX 60 
GAGGACGCGG GAGCCX3GGAG CGCACGCGCG GGGTGGAGXX CAGCCTACTC TTTCTTAGAT 120 
GTGAAAGGAA AGGAAGAXCA TTTCATGCCT TGTTGATAAA GQTTCAGACX XCXGCXGATT 180 

975 
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CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA 
TGCCATAAGT GAGAAGCAAA CTTCCTTOAT AACATGCTTT 
AATGTOGGCA. CCAAGAAAGA GOATGGTGAG AGTACAGCCC 
TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG 
GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG 
GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG 
AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA 
CTGCCTCCAT TGAAAAACAG AOATTTTGTT GATGGACCTA 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA 
TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT 
ATTCOTCCTQ GAGAATCCCT GAGAGACTTA ATTGAGCAGT 
rCAGGOCTCC CTCTGCTQGT CCAAAGQACT ATAGCTAAGC 
ATTGGAAAAG GTCGCTATOG GGAAGTTTGG ATGGGAAAGT 
GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG 
GGGTOCTGGA CCCAGTTGTA CCTAATCACA GACMATCATG 
TATCTQAAGT CCACCACCCT AGACGCTAAA TCAATGCTGA 
AGTGGCTTAT GTCATTTACA CACAQAAATC TTTAGTACTC 
CATOTAOATC TGAAAAGTAA AAACATTCTG GTGAAOAAAA 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG 
ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT 
AATCACTTCC AGTCTTACAT CATGQCTGAC ATGTATAGTT 
GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT 
GTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA 

CGceccTCAO: tcccaaaccg gtggagcagt QATGAGTGTC 

ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA 
ACACTTGCCA AAATGTCAGA GTCCCAGOAC ATTAAACTCT 
CATCTCTSCA GAAAGCGAAC AGGTACTCTT CTGTTTGTGG 
TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA QAAGCTCCCm 
TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA 

Seq ID NO: 68 Protein sequence 
Protein Accession. # - HP 001194 



AAAAGTTAAA 
TGCGAAGTGC 
CCACCCCOCG 
TCAACAATAT 
GGTTOCCTGT 
ACACTCCCAT 
ATAAAGAOCT 
TAGACCACAG 
T ATT AT T TT G 
TAGAACAGGA 
CTCAGAGCTC 
AGATTCAGAT 
GGCGTGGCGA 
GAGAGACAGA 
CTGCAGATAT 
AAAATGGTTC 
AGTTAGCCTA 
AAGGCAAACC 
ATGGAACTTG 
AAGTTGACAT 
TGGACGAGAG 
TTGGCCTCAT 
ACCAGCTTCC 
TTGTGTGOVT 
TAAGGCAGAT 
CAGCCCTGCG 
GATAGGAGAG 
GCAGAGCAAA 
CCAGTGGGTT 
GAAGGAGAGA 
GATATGATGC 



CTTACAAGCC 
AGGAAAATTA 
TCCAAAGGTC 
TTGCAGCACA 
GGTCACTTCT 
TCCTCATCAA 
ACACCCTACA 
GGCTTTACTT 
TTACTTCCGG 
TGAAACTTAC 
AGGAAGTGGA 
GGTGAAACAG 
AAAGGTAGCT 
AATATATCAG 
CAAAGGGACA 
CCTTTATGAT 
CTCTTCTGTC 
AGCAATTGCC 
CTGTATTGCT 
ACCACCTAAC 
CTTGAACAGA 
CCTTTGGGAG 
TTATCATGAC 
CAAGAAGTTA 
GGGAAAACTC 
GGTTAAGAAA 
GAAAAGTAAG 
AGACATCAAA 



TTGATCCGTG 
AT 



GPTHHEAZiIjI 



WFRETBXYQT 
MLKLAYSSVS 
TNEVDIPPNT 
EEYOJLPYHDL 

rXjTAIiRvkkt 



11 21 

t 1 

VGTKKKDGES TAPTPRPKVfc 
CLGLFGSDPQ CZRDTPIPHQH 
SVTVCSLXiV LIILFCYFRY 
GLPUjVQRTI AKQIQMVKQI 
VLHRHBHXLG FIAADIRGTG 
GLCHLHTEIF STQGKPAIAH 
RVGTKHYHPP BVLDESLNRM* 
VPSDPSYEDM REIVCTKKLiR 
IAKMSESQDI KL. 



31 
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4.1 
I 



51 
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RSIECCTERW 
KRQETRFRYS 
GKGRYGEVWM 
SWTQIiYDTTD 
RDLKSKNILV 
HFQSYIKADM 



ECNKDLHPTL 
XGLBQDETYI 
GKWRGBKVAV 
YHENGSLYDY 



YSFGItlLWEV 
ECLRQMGKLM 



240 
300 
360 
420 
460 
540 
600 
660 
720 
760 
B40 
900 
960 
1020 
1080 
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1620 
1660 
1740 
1600 
1660 
1920 
I960 



PPLKNKDPVD 
PPGESXiRDIil 
KVFFTTHEAS 
LKSTTLDAKS 
U3LAVKFISD 
ARRCVSGGIV 
TBCWAHNPAS 



60 
120 
180 
240 
300 
360 
420 
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Seq ID NOs 69 DMA sequence 

nucleic Acid Accession # = Bos sequence 

Coding sequence: 166.. 1737 

1 11 21 31 41 51 

I 1 I 1 1 I 

TTGGGQGTTT ATTCTCTTOC CTTCTAACTT QACAGGGTCT TX3CTCTGTCA TTCAGGCAAG 
AGTGCAGTAG TGTQATCACT TCTTACTGCC GCCIGAAGCT TCCAGCCTCA ACTCAAGCAA 
TOCTCCCACC TCAGCCACCC AAGTGGCTGQ OACTACAGAT TAAGAATGAC CCAAAATAAA 
TTAAAGCTTT GTTCCAAAGC CAATGTGTAT ACTGAAGTGC CTGATGGAGG ATGGGGCTGG 
GCGGTAQCTG TTTCATTTTT CTTCGTTOAA GTCTTCACCT ACGGCATCAT CAAGACATTT 
GGTGTCTTCT TTAATGACTT AATGGACAGT TTTAATGAAT CCAATAGCAG GATCTCATGG 

ataatctcaa tctgtgtgtt tgtcttaaca ttttcagctc ccctcggcac agtcctqagc 
AATCGTTTCQ gacaccgtct ggtagtgatg ttgggggggc tacttgtcag caccgggatg 
GTGGCCGCCT CCTTCTCACA aoagqtttct catatgtaog TOGCCATOGG CATCATCTCr 
QGTCTGGGAT ACTGCTTTAG TTTTCTCCCA ACTOTAACCA TCCTATCACA ATATTTTOGC 
AAAAGACGTT CCATAGTCAC TGCAGTTGCT TCCACAGGAG AATGTTTCGC TGTGTTTGCT 
TTCGCACCAG CAATCATGGC TCTGAAGGAG CBCATTGGCT GGAGATACAG CCTCCTCTTC 
GTGGGCCTAC TACAGTTAAA CATTGTCATC TTCGGAGCAC TGCTCAOACC CATCTTTATC 
AGAGGAOCAG CGTCACCGAA AATAGTCATC CAGGAAAATC GGAAAGAAGC GCAGTATATG 
CTTOAAAATG AGAAAACAOG AACCTCAATA GACTCCATTG ACTCAGQAGT AOAACTAACT 
ACCTCACCTA AAAATGTGCC TACTCACACT AACCTOGAAC IGGAGCCGAA GGCCGACATG 
CAGCAGGTCC TGGTGAAGAC CAGCCCCAGG CCAAGCGAAA AGAAAGCCCC GCTATTAGAC 
TTCTCCATTT TGAAAGAGAA AAGTTTTATT TGTTATGCAT TATTTGGTCT CTTTGCAACA 
CTGGGATTCT TTGCACCTTC CTTGTACATC ATTCCTCTGG GCATTAGTCT GGGCATTGAC 
CAGGACCGCG CTGCTTTTTT ATTATCTACG ATGOOCATTG CAGAAGTTTT CGGAAGGATC 
GGAGCTGGTT TTGTCCTCAA CAGGQAGCCC ATTCGTAAGA TTTACATTGA GCTCATCTGC 
GTCATCTTAT TGACTGTGTC TCTGTTTGCC TTTACTTTTG CTACTQAATT CTGGGGTCTA 
ATGTCATGCA GCATATTTTT TGGGTTTATG GTTGGAACAA TAGGAGGGAC TCACATTCCA 
CTGCTTQCTG AGGATGATGT GGTGGGCATT GAQAAGATGT CTTCTGCAGC TGGGGT CTAC 
ATCTTCATTC AGAGCATAGC AGGACTGQCT GGACCGGCCC TTGCAGGTTT GTTGGTGGAC 
CAAAGTAAGA TCTACAGCAG GGCCTTCTAC TCCTGCGCAS CTGGCATGGC CCTGGCTGCT 
GTGTGCCTCG CCCTGGTOAG ACCGTGTAAG ATGGGACTGT GCCAGCATGA TCACTCAGGT 
GAAACAAAGG TAGTGAGCCA TCGTG9GGAAG ACTTTACAGG ACATACCTGA AGACTTTCTG 
GAAATGGATC TTGCAAAAAA tgagcacaga gttcacgtgc aaatggagcc GGTATGACAC 
ACTTrCTTAC AACAACAGCC ACTGTGTTGG CTGGAGAGGG ATGGGGTGGG CCCAAOGGGG 
ACACAAGGAG GCAGAGGAGC TAACOCCTCT ACTCCACTTT CAAAACTACA TTTTAAAGGQ 
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AATGTGTATG TGAAGAGCAC TACCAACATC GCTTTTGTTT TGTTTTGTTT TGTTTTAAGC 

TTTTTTTTTT tgcttgtttt taaagccaaa acaaaaaaca accaagcact cttccatata 

TAAATCTGGC TGT AT TCAGT AGCAATACAA GAGATATGTA GAAAGACTCT TTGGTTCACA 
TTCCGATATT AAAATAGTGA CAT6AACTGG CAAASTQGXT TTAAAAGCTT TCACGTGGGA 
TAAATGATTT tctttttttc TTTTCTTTCT TCCTATGGTC TTGTCTGAAT AAACTACTCT 
CCTGAATAAA ACAACAT C CA. ACCCAGGTCA TTGAAATGAA ATTGGCCA6T C 

Seq id NO* 70 Protein sequence 
Protein Accession tt: Eos sequence 



1920 
1980 
2040 
2 LOO 
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1 
I 

MTQNKLKLCS 
SRISWIISIC 
IGIISGLGYC 
YSLLFVGLLQ 
GVELTTSPKN 
GLFATLGFFA 
IBLICVILLT 
AAGVYIFIQS 
HHHSGETKW 



11 

I 

KAKVYTEVPD 
VFV1/FFSAPL 
FSFLPTVTIL 
LJJIVlPGAIiI* 
VPTHTNLELE 
FSLYIIFLGI 
VSLFAFTFAT 
IAGLiAQPPliA 



21 
I 

GGWGWAVAVS 



SQYFGKRRSI 
RPIFIRGPAS 
PKADMQQVLV 
SLGIDQDRAA 



31 
I 

FFFVEVE T YG 
RLWHLGGLL 
VTAVASTGBC 
PKIVTQENRK 



GLJjVDQSKIY 



FLLSTKAIAE 
FFGFMVGTIG 
SRAFYSCAAG 



41 
I 

IIKTFGVPP19 
VSTGMVAASF 
FAVPAFAPAI 
EACJYMLjEKEK 
APLLDFS ILK 
VFGRIGAGFV 
GTHIPLLAED 
MAIiAAVCIiAIi 
EPV 



SI 
I 

DLMDSFNESN 
SQEVSHMYVA 
MALKSRIGHR 
TRTSIDSIDS 
EKSFICYALF 
LNRBPIRKIY 
DWGIBKMSS 
VRPCKMGLCQ 



Seq ID NO: 71 DMA sequence 

nucleic Acid Accession #s NM_004694 

Coding sequences 166 -.1737 
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21 



31 



41 



ATTCTCTTCC cttctaactt gacagggtct tgctctqtca 

AGTGCAGTAG TGTGATCACT TCTTACTOCC GCCTCAAGCT TCCAGCCTCA 
TCCTCCCACC TC3W3CCACCC AAGTGGCTGG GACTACAGAT TAAGAATGAC 
TTAAAGCTTT gttccaaagc CAATGTGTAT ACTGAAGTGC CTGATGGAGG 
GCGGTAOCTG TTTCATTTTT CTTCGTTGAA GTCTTCACCT ACGGCATCAT 
GGTGTCTTCT TTAATGACTT AATGGACAQT TTTAATGAAT CCAATAGCAG 
ATAATCTCAA TCTGTGTGTT TGTCTTAACA TTTTCAGCTC CCCTCGCCAC 
AATCGTTTCG GACACCGTCT GGTAOTGATG TTGGGGGGGC TACXTGTCAG 
GTGGCCGCCT CCTTCTCACA AGAGGTTTCT CATATGTACG TCGCCATCGG 
GGTCTGGGAT ACTGCTTTAG TTTTCTCCCA ACTGTAACCA TCCTATCACA 
AAAAGACGTT CCATAGTCAC TGCAGTTGCT TCCACAGGAG AATGTTTCGC 
TTCGCACCAG CAATCATGGC TCTGAAGGAG CGCATTGGCT GGAGATACAG 
QTGGGCCTAC TACAGTTAAA CATTGTCATC TTCGGAGCAG TGCTCAGACC 
AGAGGACCAG CGTCACCGAA AATAGTCATC CAGGAAAATC GGAAAGAAGC 
CTTGAAAATG AGAAAACACG AACCTCAATA GACTCCATTG ACTCAGGAGT 
ACCTCACCTA AAAATGTGGC TACTCACACT AACCTGGAAC TGGAGCCGAA 
CAGCAGGTCC TGGTGAAGAC CAGOCCCAGG CCAAGCGAAA AGAAAGCCCC 
TTCTCCATTT TGAAAGAGAA AAGTTTTATT TGTTATQCAT TATTTGQTCT 
CTGGGATTCT TTGCACCTTC CTTGTACATC ATTCCTCTGG GCATTAGTCT 
CAGGACCGCG CTGCTTTTTT ATTATCTACQ ATQGCCATTG CRQAAQTTTT 
GGAGCTGQTT TTGTGCTCAA CAGGGAGCCC ATTOGTAAGA TTTACATTGA 
GTCATCTTAT TGACTGTGTC TCTGTXTGCC TTTACTTTTG CTACTGAATT 
ATGTCATGCA GGATATTTTT TGGGTTTATG GTTGGAACAA TAGGAGGACT 
TGCTTGCTGA AGATGATGTC GTGQGCATTG CAGAAGATGT CTTCTGCAGC 
ATCTTCATTC AGAGCATAGC AGGACTGGCT GGAOCGCCCC TTGCAGGTTT 
CAAAGTAAGA TCTACAGCAG GGQCTTCTAC TCCTGCGCAG CTGGCATGGC 
GTGTGCCTCG CX3CTGGTGAG ACCGXGTAAG ATGGGACTOT GCCAGCGTCA 
GAAACAAAGG TAGTGAGCCA TCGTGGGAAG ACTTTACAGG ACATACCTGA 
GAAAXGGATC TTGCAAAAAA TGAGCACAGA GTTCACGTGC AAATGGAGCC 
ACTTTCTTAC AACAACAGCC ACTOTGTTGG CTQGAGAGGG ATGGQGTGGG 
ACACAAGGAG GCAGAGGAGC TAACCCCTCT ACTDCACTTT CAAAACTACA 
AATGTGTATG TGAAGAGCAC TACCAACATC GCTTTTGTTT TGTTTTGTTT 
TTTTTTTTTT TGCTTGTTTT TAAAGCCAAA ACAAAAAACA ACCAAGCACT 
TAAATCTGGC TGTATTCAGT AGCAATACAA GAGATATGTA GAAAGACTCT 
TTCCGATATT AAAATAGTGA CATGAACTGG CAAAGTGGTT TTAAAAGCTT 
TAAATGATTT TCTTTTTTTC TTTTCTTTCT TCCTATGGTC TTGTCTGAAT 
CCTGAATAAA ACAACATCCA ACCCAGGTCA TTGAAATGAA ATTGGCCAGT 

Seq, ID BO: 72 Protein sequence 
Protein Accession # t NP_0046B5 
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I 



ACTCAAGCAA 
CCAAAATAAA 
ATGGGGCTGG 
CAAGACATTT 
GATCTCATGG 
AGTGCTGAGC 
CACCGGGATG 
CATCATCTCT 
ATATTTTGGC 



CCTCCTCTTC 
CATCATTATC 
GCAGTATATG 
AGAACTAACT 
GGCCGACATG 
GCTATTAGAC 
CTTTGCAACA 
GGGCATTGAC 
CGGAAGGATC 
GCTCATCTQC 
CTGGGGTCTA 
CACATTCCAC 
TGGGGTCTAC 
GTTGGTQGAC 
CCTGGCTGCT 
TCACTCAGGT 
AGACTTTCTG 
GGTATGACAC 
CCCAACGGGG 
TTTTAAAGGG 
TGTTTTAAGC 
CTTCCATATA 
TTGGTTCACA 
TCACGTGGGA 
AAACTACTCT 
C 



1 
i 

MTQNKLKLCS 
SRISWJISIC 
IGIISGLGYC 
YSLLFVGLLQ 
GVELTTfiPKH 
GLPATLGPFA 
IELICVILLT 
AAGVYIFIQS 
REM6GETKW 



11 
1 

KANVYTEVPD 
VFVLTFSAPL 
FSFIiPTVTlL 
LNIVIFGALL 
VPTHTNLELE 
PBLYIIPLGI 
VSLFAFTFAT 
XAGLAGPPLA 
SHRGKTLQDI 



21 
1 

GGKGttAVAVS 



SQYFGKRRSI 
RPIIIRGPAS 
PKADHQjQVLV 
SLGIDQDRAA 
EPWGLMSCai 
GLLVDQSKIY 
PBDPLEMDLA 



31 

\ 

FFFVEVFTYG 
RLWMLGGLL 
VTAVASTGBC 
PKIVIOJENRK 
KTSPRPSEKK 
FLLSTHAIAE 
FFGFMVGTIG 
SHAFYSCAAG 



Seq ID MO i 73 DMA sequence 

Nucleic Acid Accession #s NM_Q021B4,1 

Coding sequences 256. .3012 



60 
120 

lao 

240 
300 
360 
420 
48D 



60 
120 
160 
240 
300 
360 
420 
400 
540 
600 
660 
720 
700 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13BO 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2O40 
2100 
2160 



41 51 

I I 
IIKTFGVFFN DtLMDSFNESN 
VSTGMtfAASF SQEVSHMYVA 
FAVPAFAPAI MALKERIOWR 
EAQYMLEHEK TRT BIDS IDS 
APLLDFS ILK BKSFICYALF 
VFGRIGAGFV LNRBPIRKIY 



MALAAVCLAL VRPCKMGLCQ 
EPV 



60 
120 
180 
240 
300 
360 
420 
480 



977 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 51 

1 1 I ! I 1 

GAGCAGCCAA AAGGCCODCG GAGTCGCGCT GGGCCGCCCC GSCGCAGCTG AACCGGGGGC 60 

OGCQCCTGCC AGGCCGAOGG GTC7GGCCCA GCCTGGCGCC AAGGGGTTCG TQCGCTOTQQ 120 

5 AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTGAAACCC AATGGAAAAA GCATGACATT 180 

TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 

AAATATCOGC GCAAGATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 

CTCACCACTG AATCTACAGG TGAACTTCTA GATCGATGTG GTTATATCAG TCCTGAATCT 360 

CCAGTTGTAC AACITCATTC TAATTTCA.CT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 

10 GATTATTTTC ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 4 B0 

AAGGAGCAAT AT AC TAT CAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 54 Q 

TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 6D0 

TATGGAATCA CAATAATT TC AGGCTTGCCT CCAQAAAAAC CTAAAAAITT GAGTTGCATT 660 

GTGAACGAGG GGAAGAAAAT GAGGTGTGAG XGGGATGGTG GAAGGGAAAC ACACTTGGAX3 720 

15 ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 

CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 

GTCTGGGTAG AAGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 90 Q 

CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 960 

CrGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 

20 AAATATAACA TTCAATATAG GACCAAAGAT GCCTCAACTT GGAGCCAGAT TCCTCCTGAA 1080 

GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1140 

GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGOAGTGAA 1200 

GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 

. _ ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 

25 CCTTTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CrCTCACAAG ATGGAAATCA 1380 

CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1440 

TATCTAQCAA CCCTAACAGT AAGAAATCTT GTTGGCAAAT CAGATGCAGC TGTTTTAACT 1500 

ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTOTAATGQ ATCTTAAAGC ATTCCCCAAA 1560 

GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 

30 GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1C8D 

ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 

GTTACTCCAG TATATGCTGA TGGACCAGGA AGCCCTGAAT CCATAAAGGC ATACCTTAAA 1800 

CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGGGAA AAACGAAGCT I860 

GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATQ GATTTATCAG AAATTATACT 1920 

35 ATATTTTATA GAACCATCAT TGGAAATGAA ACTGCTGTGA ATG7GGATTC TTCCCACACA I960 

GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAGCATAC 2040 

ACAGATGAAG GTGGOAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 

GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATXGAC AACTCTTCTG 2160 

Ar . GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG QCCTAATGTT 2220 

40 CCAGATCCTT CAAAGAGTCA TATTGOCCAG TGGTCACCTC ACACTCCTCG AAGGCACAAT 2280 

TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 

ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTOA AATCATTGGA CJCTGTTCAAA 2400 

AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 

TCXTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTOGAGC 2520 

45 ACTGTCCAGT ATTCTAOCGT GGTACACAGT GGCTACAGAC AOCAAGTTCC GTCAGTCCAA 25BO 

GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAQCG GCCAGAAGAT 2640 

CTACAATTAG TAGATCATGT AGAT0GCGGT GATGGTAIIT TCCCCAGGCA ACAGTACTTC 2700 

AAACAGAACT GCAGTCAGCA TGAATCCAGT CCAGATATTT CACRTTTTGA AAGGTCAAAG 2760 

CAASTTTCAT CAGTCAATGA GGAAOATTTT GTTAGACTTA AACAGCAGAT TTCAGATCAT 2 820 

50 ATTTCACAAT CCTGTGGATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 

GCTTTTGGTC CAGGTACTGA □OGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 

GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 

ATGCCTCAGT GAAOGACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAOTAAAG 3060 
_ CTAAAATGAT TTTATCTGTG AATTC 

55 

SeQ ID BO: 74 Protein oeguenoe 
Protein Acccbsioh fft NP_0Q2 175,1 



60 
65 
70 
75 
80 



1 
1 

MLTtiQTHWQ 
HANYIVWKTK 
ISGLPPRKPK 
SCTVDYSTVY 
KLTWTNPSIK 
CKKEDGKGYW 
GKILDYEVTL 
FQATHPVMDIf 
YLRGNLAESK 
CJLFVDVQKGF 
KDGPSFTETT 
SHIAQWSPHT 
TE13H8SGIGG 
E3TQPI>XJDSE 
NEEDPVRLKQ 
MPKSYIiPQTV 



11 
I 

ALFIFLTTES 
HFTIPKEQYT 
NLSCIVNEGK 
FVNIBVWVEA 

svirxsorNiQ 

SDWSEEASGI 
TRWJCSHLCjHY 
EAFPKDNMLW 
CYlilTVTPVY 

PKFAQGEIEA 



21 
1 

TGETJjDPCOT 
IlKaTASSVT 



ENALGKVTSD 
YKTKDASTW3 
TYBDRPSKAP 
TVKATKKTVN 
VEWTTPRESV 



IIGNETAVNV 
IWPVCXAPI. 
QMYSDGWPTD 



ERPEDLQIjVD 
QISDHISQSC 
RQGGYMPO 



HVDGGDGIIiP 



31 
1 

ISPESPWQIi 
FTDIASUaQ 
BTHIiBTftFTL 
HINFDPVYKV 
QIPPEDTAST 
SFSfYKtDPSH 
I>TNDRYLATL 
KKYTLEHCVT. 
KAYIiXQAPPS 
DSSETBYTX9 
EiTTUiOVLFC 
VSWEIERKD 
QNTSSTVQYS 
aQQYFKQNCS 



41 
I 

HSNFTAVCVL 
LTCITIIjTPGQ 
KSEHATHKFA 
KPNPPHNIjSV 
RSSPTVQDLX 
TQOYRTVQLV 
TVRHLVGKSD 
SDKAPCITDM 
KQPTVRTKKV 
fiLTSDTIi'XMV 
FNKRDIiIKKH 
KKPFPEDUCg 
TWKSGYRHQ 
QEBS8BDX9H 
THGQVERFET 



51 
t 

KEKCMHYFHV 
liEQNVYGm 
DCKAKRDTPT 
1NSEELSSIL 
PFTEYVFRIR 
HKTLPPFEAW 
AAVLTIPACD 

qqedgtvhrt 



HMAAYTDBGG 
IWPWVPDPSK 
liDLFKKEKXN 
VPSVQVPSRS 
PKRSKQVSSV 



Soq ID NQ s 75 DMA sequence 

Nucleic Ax3id Accession nm_022131 

Coding sequences 11, .2878 

1 11 21 31 41 51 

1111(1 

tgctgcgagg atgctgcctg QGCGGCTGTG ctgqgtgcog ctcctgctgg 

978 



60 
120 

lao 
240 
3 00 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 



WO 03/042661 



PCT/US02/36810 



GGGGAGOGGC AGCGGCGGTG GCGGGGACAG CCGGCAGCGC CGCCTCCTCQ CGGCTAAAGT 120 
CAATAAGCAC AAGCCATGGA TCOAGACTTC ATATCATGGA GTCATAACTG AGAACAATGA XQO 
CACAOTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGQ TTCCTTTTGC 240 
_ AGGGGAAATC TGTGCGTTCA AGATOCATGG CCAGGAGCTG CCCTTTGAGG CTGTGQTGCT 300 
5 CAACAADACA TCAQGAGAGG GOCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
GAAGGAGTAC ACATTCATCA TCCAQGCCTA TGACTGTGGT GCTGGOCCCC ACGAGACAGC 420 
CTGOAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 
TOCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGACG GAGGGCAAGA TCTATGACAG 540 
CATTCT3CAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
10 CTATGAAATC GTCACCACAG ATGTGOCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 

CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAQTTTG 780 
CAAGCCTGGC TGGCAAGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 
GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTOCAGAT 900 
15 CGTCACAGAG CTGCAGACTA ATTACATIGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 

GAAATCCCTT GAGAAGTTAT GTGGAGCCTC CTCTGGCATC ATXGACCTCT TGCCATCCCC L020 
TAGCGCTGCC ACCAACTGGA CTGCAOGACT GCTGGTGGAC AGCAGTGAGA TGATCTTCAA 1080 
QTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1140 
TCAGTTCACC ATCACCATGT GGATGAAACA CGGCCCCAGC CCTGGTGTGA GAGCCGAGAA 12 OO 
20 GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 1260 
TGTGCACAAC TGCOGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAG3 CTGACACCTT 1320 
TCGCCCCGOG GAGTTCCACT GGAAGCTGGA TCAQATTTGT QACAAAGAGT GGCACTACTA 1380 
TGTCATCAAT GTGGAQTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 
ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT IS 00 
25 CGGOGCTTGT TOGCAAGGAQ GAGAAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1560 
AAGCCTGGCC AGTCTCACCA TOCGCCCTGG CAAAATGGAA AGCCAGAAGG TGATCTCCTG 1620 
OCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG QAAAGCCTTG GCCAAGGAAT 1€8D 
AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 1740 
CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCBGG 18 00 
30 TGTGCGGCGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACG TATGCATCAG 1860 

TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCGAGCOOC GGATCACCCT 1320 
CCGGGGCACA GACCACTTCT GGAGACCXGC TGCCCAGTTT GAAAGTOCCA GGGGAGTGAC 1980 
CCTCTTCCCT GATATCAAGA TTGTGAGCAC CTTCBCCAAA ACCGAAGCCC COGGGGAOGT 204D 
GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 
35 TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAAGGA 2160 
CAGTGAGCTC CACCAACGAC ACCTGGATGC CACXAATTCT ACTGCAGGCT ACl'CCATCTA 2220 
CGGTQTQGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATCCGCT AOCGCAACTG 2280 
GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 
CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAOA 2400 
40 TAAGGAGCAT GTCAATCATC TQATTGTGCA GCCTCCCTTC CTCCAGTCTG TOCATCATCC 2460 
TGAGTCCCQG AGTAGCATCC AGCACAGTTC AGTGGTOGCA AGCATTGCCA CAGTGGTCAT 2520 
CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC QGGTCDGGAT 2580 
OGCCCACCAG CACTTCATCC AGGAGACTGA GGCTGCCAAG GAATCTGAGA TGGACTGGGA 2640 
CGATTCTGCG CTGACTATCA CAQTCAACCC CATGGAGAAA CATGAAOGAC CAGGGCATGG 2700 
45 GGAAGATGAG ACTBAGGGAG AAGAGGAGGA AGAAGCOGAG GAAGAAATGA GCTCCAGCAG 2760 
TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAQACATGG 2820 
GCAGAATGGA GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCGACCCTCC CCTACTAGTG 2B80 
CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTAXGCC 2940 
CATDTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 300O 
50 TCCTGGAGCC CACCCTTTAA GCCTTGOGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 
AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTXSTOCT GTAGCCTCCA 3120 
CTTCTGCCCT AACTTOOCCA GCATCCTGAG TACCTGTCTG CAGAGTTTGC CTTTGIITTT 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACGTO OCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
55 CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 
TGCTTAACAG TTTAAAGGAA AGTGCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCXAGXT CCCCAGCTTG GAGAGCCTTT 3480 
CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 
^ TCCTTGGCCA CAAGCAGGGT CTGATCCCCC ATCAGAGCTA TCTGAGCCTG OCTGTCTGGG 3600 

60 CAGCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AOCAAACAGA ACCACAGGGC 3660 
CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 
GGAlCAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGDCAGA 3780 
AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
65 TAIQOTOCCT TATCTOCTAT CTTCOCTCTT GAGAAAATAC ACGCTTTCTG CATGTATTAG 3960 
AAAGGCACGA GCTCCACCAA GTCTACAATG AAAGT1TGAA ATTTAACTGC AAGGAATTAG 4020 
AAGCATATXT GCAATCATTG CAGCTTCTTC TTTCTTCroC TCATAAAAGG AGGAACACTT 4080 
TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 
XTTCCATCTC CATCCTAACA TGCACAACCT GTGAAGAGAA TTQTTTGTAT AGTAACTGGT 4200 
70 CTGTGATCTT TTGTGGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 4260 
ACGAAGCTCT QAGAACATGT TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TCAMGTGTA 4320 

tgctctgtcc ccatccttca ctcctcctca agctcacacc aattggtttg gcacaggcac 43 bo 
agagctggtc cctagttaag tggcatttat gttaaaaaaa a 

75 Seq ID NO: 76 Protein sequence 
Protein Accession #s KP_071414 

1 11 21 31 41 51 

1 I I I I I 

MLPGHLCWVP LiDLALGVGSG SGGGGDSRQR RLIAAKVNKH KPWIETSYKG V1TENNDTV1 
LDPPLVALDK DAPVPPAGEI CAFKZHGQEI* PFEA.WXWKT SGEGRLKAKS PIDCKLQKEY 
TFIXQAYDCG AGPHBXAWKK SBKAVVHIGV KUVMEFAPTF KKPAYSCAWT EC3KIYDSILQ 
VEAIDEDCBP QYSQICKTYEr VTTDVPFAID RNOilKNTKK LSYDKQHQYE ILVTAYPCGQ 
KPAAQDTIjVQ VDVKPVCKPG WQDWTKRIBY QPGSGSMPIiP PSIHLETCDG AVSSLQIVTE 

979 



80 
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120 
180 
240 
300 



WO 03/042661 
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LQTNYIGKGC 
RQGAK1PDGI 
CRLVPLBRKD 
TNDWPIHPSH 
CKEGLDXNSL 
LXVSSKVQCF 
DIXIVSTFAK 
HQRHI»DATNS 
NEFMIaEVSIIi 
VCMLVFWAM 



DRETYSEKSL 
VPKNI/TDQFT 
FDQADTFRPA 
IAMQL.TVGAC 
ESLGQGIKYH 
GEDVCI9IPE 
TKAPGDVKTT 
TAGYSIYGVG 
HEDQVSDKEH 
GVYRVRIAHQ 



QKLCGASSGI 
ITNWMJOIGP8 
EFHWKLDQIC 
WQGGEVTKPQ 
FNPSQ3 ILVH 
VDAYVMVLQA 
DPKSEVIiES* 
SMSRYBQVTjH 
VNHLTVQPPP 
HFIQETEAAK 



IDLI*FSPSAA 
POVRAEKETI 
DKEWHYYVIN 
FAQFFHG3LA 
EGDDIQJINR 
IEPRITL.RGT 
IiHMIJDFCDIIj 
HIRYRNWRPA 
LQSVHHPESR 



TNWTAGLL.VD 
LCNSDKTEKK 
VEPPWTLYM 
SLTIRPGKKE 
ALQKVSYINS 
DKFWRPAAQF 
VIGGDLDPRQ 
SfcEARRFRIX 
SSIQHSSWP 
LTITVNPMEK 
ARQAQLEWDD 



SSEMIFKFDQ 
RHHYALYVHN 
DGATYEPYXiV 
SQKVISCLQA 
RQFPTA3VRR 
ESARGVTLFP 
ECLELNHSEI, 
CSEUJGRYrS 

siAiwuie 



360 
4.2 0 
4.B0 
540 
600 
660 
720 
7B0 
840 
900 



STIiPY 



Seq id NOs 77 dna sequence 
Nucleic Acid Accession #: Eos sequence 
Coding sequence: 482 _ .3007 

1 IX 2a 31 41 51 

I I I I I I 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 
TACAACCTAA GAAAGCAAGA GTGATCAAXC TCAGCTCTGT TAAACATCTT GTTTACTTAC 
TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTQAAQT 
ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 
CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTSACCA GAGGTTTTAA 
CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 
ATTACAGATA AOGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 
CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GOCTGTATAT CTITACACTC 
CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 
AGATGGCACA A1GCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 
TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 
CACAAATGAC TTtTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 
TGCAOATATT GABATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATGAA 
TCACAATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCAT GGACTGGAAA ACCTGGAATT 
CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AQTGCCTTTA GCAAGCTCAA 
CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 
CCGATTTGTT CCTTTAACCC ATCTAQATCT TCSJTGGAAAT CAATTACAAA CATTOCCTTA 
TGT3X3GTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 
GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATQC CTCCACABTC 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGPAG 
ACTAAAGAAG GAATC TATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 
AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 
CACGTOCATT CTAAAACTAC CCACCAAAQC ACCAGGTTTG ATACCTTATA TTACAAAGCC 
ATCCACTCAA OTTCCAGGAC ClTAOTGCCC TAl'TCCTTGT AACTGCAAAG TCCTATCCOC 
ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 
TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGOAAAT ATTATTCRCA GTTTAATGAA 
GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 
AGTTCTTGAA GAABGATGGT TTATOAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 
TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 
ATATCTTGAA TACAATGGCA TTAAGGAAAT ACTGCCAGGA ACCXTTAATC CAATGCCTAA 
ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCO^AGTT TTACCACCAC ATATTTTTTC 
AGGGGTTCCI CTAACTAAGG TAAATCITAA AACAAACCAG TTTACCCATG TACCTGTAAG 
TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTQAC CTTGAGQATA ACCCCTGGGA 
CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 
GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TQAAAGCCCT 
AAATAGTGAA ATTCTCT GTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 
TCTTACGGAC GCTGTGCCAG TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 
TATTGTTTTC TGTGCTGCAQ GGATAOTOGT TCTTGTTCTT CACGGCAGGA GAAGATACAA 
AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 
GTATGGCCAT aaaaccactc ATCACACTAC TGAAAG ACCC TCTGCCTCAC TC£ atgaaca 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 
GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 
TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATQAAAT ACAAAACCAC 
GAAOCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACAXTTT 
AGAAAAAGAA AGGGAACTTC AGCAACTGGQ AATCACAOAA TACCTAAGGA AAAACATTGC 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 
GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 
TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 
AACATAGATG GAOAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAjSTCCAT 
ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATG AGAACCTAAG CACAGCAGTA 
AACXATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 

Seq ID WO; 7B Protein sequence 
Protein Accession #$ Boa sequence 

1 11 21 31 41 51 

I 1 1 I I I 

MKLWIHLFYS SLIiACISLHS QTPVL3SRGS CDSLQICEEK DGTMLMCEA KGIKMVSEIS 60 
VPPSRPFQLS IiLNNGLTMLH TNDFSGIxTMA ISIHLGFNNI ADIEIGAFNG 120 
HNSLEILKED TFHGLBEJLEF LQADHKFITV IEP8AFSKLN HUCVLIIiNDN AIE6LPPNIF 180 
RFVPr.THIJ3L RGMOXO/TLFY VGFLEHIGRI LDLQLEONKW ACNCDLLQLK TWliENMPPQS 240 
IIGDWCNSP PFFKGSlIi3H IJCKBSICPTP PVYEEHEDP3 GSliHLAAT&S INDSRMSTKT 300 
TSrUOiPTKA PGLIPYITKP STOtPOPYCP IPCNCKVLSP SGLLIHOQER NIESLSDU5P 360 



60 
120 
180 
240 
BOO 
360 
42 0 
4B0 
540 
600 
660 
720 
780 
B4D 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



980 



WO 03/042661 



PCTAJS02/36810 



PPQNPRKLIb AGNIIRSLMK SDLVEYPTLE MLHDGNNHIS VLEEGSFMNL TMiQKLYLNG 420 

NKLTKLSKGrt FLGLHNLEYL, YLEYKAIKEI LPGTFNPMPK LKVLYLWNNI, LQVLPPHIFS 4 BO 

GVPLTKYNLK TMQFTHLPVS NILDDLJDI.LT QIDBEDNPWD CSCDLVGLQQ V? 1 QKLSKNTV 54 0 

TDDILCTSPG HLDKKELKM. NSEILCPGLV NNPSMPTQTS YliMVTTPATT THTADT1LRS 6D0 

5 LTDAVPLSVL I LGLLIMFIT LVFCAAGIW I>VLHRRRRYK KKQVDEQMRD HSPVHE^QYSM 660 

YGHKTTHHTT ERPSASLYEQ HMVSPKVHVY RSPSFGPXHL EfiEEERNEKE GSDAKEJDQHS 720 

LiEQENHSPL TGSNMKYKTT NQSTEPLSFQ DASSLYRNIL EKERELQQUG ITEYLRKNIA 760 

OJjQPDMEAKV PGAHSELKLM ETLMYSRPRK VIiVEQTKKEY FELKRNliHAE POYIiEVLEQQ 840 

10 T 

Seq ID NOi 79 DPTA sequence 

Nucleic Acid Accession NMJ316640-2 

Coding sequence; 39, . 1359 

15 1 11 21 31 41 51 

1 1 1 I 1 I 

GCTTAAGTTQ ACCTCTGGGT CCGGAATCGC GGGCAAAGAT GGCGGCGGCC AGGTGTTGGA 6D 

GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 

CTACAGAAAC GACCTCCCAA GAOGTCGCGG CGACCCCGGT CGCGCGGTAC CCGCCGATTG 180 

20 TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGOG GCGGATCOAG CGCTGGCAGG 240 

OGAOGGTGCA CGCTGCGGAG TGGQTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 

TTATGAAGTA CATGGTTTAC CCGCAGACCT TCGOQCTGAA TGCCGACCGC TGGrACCAGT 360 

ACTTCACCAA GACCGTGTTC CTGTGGGGTC TGCCGCCGOC CCCAGCGGAG CCCGAGCCCG 420 

AGCCCGAACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 

25 ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGGCG GOGCGTQCAC COTTACQAGG S40 

AGAGCGAGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GlCAACCCTC GTGGGCCTCC €00 

TCAGGCCACA CAACCCGGCC CTGGCCGCTG COGCCCTCGA TTATAOATGC CCAGTTCATT 660 

TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGAAGAGGT CGAATTQATG 720 

ACrrGCGATA CCAGATAGAT GATAAACCAA ACAACCAGAT TCQAATATCC AAGCAACTOG 780 

30 CAGAGTTTGT GCCATTGGAT TATTCTGTTC CTATAGAAAT CCOCACTATA AAATGTAAAC 840 

CAGACAAACT TCCATTATTC AAAOGGCAGT ATGAAAACCA CATATTTGTT GGCTCAAAAA 500 

CTGCAGATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT GTTACCTGAC AAATTAAGAA 960 

GGGAAAGGCr TTTGAGACAA AACTGTGCTG ATCAGATAGA AGTTGTTTTT AGAGCTAATG 1020 

n CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTQGAGTG 10 BO 

35 AAGCAGATGT TACTCGACCT TTTGTCTCCC AGGCTGTGAT CACAGATGGA AAATACTTTT 1140 

CCTTTTTCTG CTACCAGCTA AATACTTTGG CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 

CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTGAGGATA 1260 

ATGATOTGAA AGOTTTTAAT GATGAIGTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 

GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 13 80 

40 TGTGQOAATA TTTAAATTTT ACTGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAAAAA AAAAAAAAAA AA 



45 



50 



55 



Seq ID MO; 80 Protein sequence 
Protein Accesaicm WF_057724.1 

1 11 21 31 41 51 

I I 1 I I 1 

MAAARCHRPL LRGPRLSLBT AANAAATATE TTSQDVAATP VARY PP IV AS MTADSKAARL 60 
RRIERWQATV HAAE8VDEKL RILTKMQFMK YMVYPQTFAL NADRWYQYFT KTVFLSGLPP 120 
PPAEPEPEPE PEPEPAUJLA AIiRAVACDCL LQEHFYLRRR RRVHRYEESB VISIxPFI^QL 180 
VSTLrVGLLSP HNPALAAAAL DYRCPVHFY* VHGKEIIPRG HRRGRIDDLR YQIDDKPNNQ 240 
IRlSKQIiAEF VPU>YSVP1B IPTlKCKPDK LPLFKRQYEN HIPVGSKTAD PCCYGHTQFH 300 
l»LPDKXiRItEH MiRQNCADQI EWFRKHAIA SLFAHTGAQA MYGGFHSEAD VTRPFVSGAV 360 
ITDGKYFSFF CYQLNTLAI.T TQADQNNPRK NICWGTQSKP 1.YETIEDWDV XGFHDDVL-IjQ 420 
IVHFLLNRPK EEKSQliLEN 



60 



65 



70 



75 



80 



Seq ID jags ex EMft sequence 
Kucleic Acid Accession #: 
Coding sequence: 1..207D 



1 11 21 

! I I 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG 
CGGCGCCGGC GCCAGCGCCC CXCTOTGGGC 
CTCAGGCAGA GCGACCCGCA GAAACGGAAC 
CAGCAGCAGC ACTCGGAGAT GCFGGCCAAG 
GAAAACAAGG GTGAGCCGGC GCGGGGCCCT 
ACACTGCCGC TCCCGCAGCA OASAAACACA 
GGGGGAACAC AQOACGGGGA GCXXTCTCCAG 
CCTGTATGCC AACCCAGTGG GTACAGGTTC 
AGCCQTGQCT GGACGATGTT ATGCAGCCAA 
GGGCCTGAGG TCATTGCAOG GCGGCAGGTG 
CCAAGTAGAG CTGAAATGGG AAGGAACCCC 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG 
ATGCTGGGQQ CCCAGGGGAT ATGGAjGACAC 
GCAGCAACCA TGGGGACAAA GGGAGGAAGC 
GCACTTCCXX: ATCCTQACAG CGQCCCCCAC 
GCTCACTTCC CATTATCTTT GGGGCTGGGG 
1X3GAGCC3U3C CTGGGAACAT CGCAGCXGGG 
GACATGGAGA AGGGGGTTGA GGGAGGGCCC 
CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG 
GACAGGACAC GGGAAGAGGC CATGCTTTGC 
CCCTCCTGCT TTCCAGAK3G CCCCTCAOGA 
GGCGCTCGCT GGGTCTGCAT CAACGGAGTG 



predicted 



31 
I 

CGGCCCCCCA 
QTOCAQTCCT 
CTGGACCTGG 
CTCCATGAGG 
AGGCCGGCCC 
GCCATCAACT 
ACTffTOCTTG 
TGGGGGACCT 
GCACAGCACG 
GCCACAGGGT 



ATTTCCAGCC 
TGCATCCAGG 
AGAGTCCTQT 
CCAGCCCAGG 
CTOACATCAG 
GCAGTGCCTA 
TTCCCTAGCC 
CAGCCCCAGC 
CTCGGGACCT 
AACCACCTTT 
TGGGTAGAGC 



41 
1 

GCTCGCCGAC 
TGAGGCOGCA 
AGAAAAGCCT 
AGATCGAGCA 
TGCCTCCCCA 
CCAGCACAOG 
CCCACCTGGC 
GGACAGATGC 
TGCTGCTCTC 
GCTCGCCAGA 
CCTGCCCTGC 
CTATGGCTCT 
GATCCCTTCC 
TTCCTTGCCA 
ATCCTGGGCT 
GAGGACATCT 
GGGCTCTCCC 
GCTGTGGCAA 
CCTGCA.GTGC 
GCTGTTCCAT 
CCAGGGCCTC 
CGGGAGGACC 




GCAGTTCCTG 
TCTGAAGCGG 
GGCACACTCA 
CCTGGGCTCA 



CGCTACCTCT 
GGGAAGCCCA 
CCTCCCTCCT 
TAGATCTTTG 
GAGTCCTCAC 
TGCCATCTGG 
CTTGTCCAAG 
GTGGTCTCAA 
GACTGGTGGA 
TTCCCAOGGA 
CTCCAGTGAG 
TGQGQACGCr 
GTGTCCCAAG 
TGCTCCCTTG 
CAGCCCTGCC 



60 
120 
180 
240 
3D0 
360 
420 
480 
540 
600 
66O 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



981 



WO 03/042661 



PCT/US02/36810 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



AGGCTGAAGG 

TCTGTCAAGT 
TCCTTCAACA 
CCCCTACTTC 
GAGAAAGCAG 
AGGCAGATGG 
ACCACACTTA 
ACCCAAGAGC 
CCGGAGGAAG 
AAGAGCCTCT 
CTGAAGCAGA 
AAACGGCGCC 



AGGGCTCCTC 
CCGACACTGT 
CCATCTCTAA 
AGCAAGATTC 
ACRACAGCRA 
AGGCCTCTAA 
GGGCGGGGGC 
GGCAGTGCGA 
TGCGGCACCT 
CTAGCTTTCC 
CCAAGAAATG 
CCCCGAAGAA 
TGCATCGCTC 



ACGGACACAC 
GOGCTCTCCT 
TTCAGCCAAC 
AAAAGCTGAC 
GCTGGACAAA 
TGCAGGAGCT 
ACACCCCCCA 
AGTGCTCATC 
CAAGTCCCTC 
CAGGGACCAA 
CCTGAGCCCA 
CAACTTTGCC 
AGTGCTTTGA 



AGGCCAGGAG 
GCAGACAGCC 
TCTCAAGGCA 
GTCTCCCAGA 
GTTCCTQGGG 
GCCTGTATGG 
ATGATCCTGC 
CGCGAGCTGT 
CTGGAAGGGA 
GAAGCCACGC 
CCTGTGGCGG 
GAGAGGCAGA 



GCAAGCGTGG 
TCTCCATGTC 
AGGCCAGGCC 
AGGGGGACCT 
TACAAGGGCA 
GGAACAGCCA 
CCCTTCCCCT 
GGAATACCAA 
GCCAGAGGCC 
ATTTCCCCAA 
AGCGTGCCAT 
AGAGGCTGCA 



GCGTCTTGCG 
AAGCTTCCAQ 
CCAGCCCGGC 
GGAAGAGGAG 
GGCCAGAAAG 
GCACCAGGGC 
GCGAAAGCCC 
CCTCCTGCAG 
CCAGGCAGCC 
GGTCTCCACC 
CCTGCCCGCA 
GGCAATGCAG 



Seq ID NO: 82 Protein sequence 
Protein Accession #i FGENESH predicted 



1 
I 

MSGAGVAAGT 
QQQBSEHLAK 
GGTQDGBPLQ 
GPBVIAGRQV 
MLGAQGrWTH 
AHFP&SUGLG 
LFWAKCGPSR 
GARWVCINGV 
SVKSISNSAN 
HKAEASNAGA. 
TOELRHLKSIi 
XiKQTPKNNPA 



11 

I 

RPPSSPTPGS 
UiEETEHLKR 
TVltAHLAAIiA 
ATGCSHJLPP 
SIQGSLPAIW 
L»TSQGatLiTGG 
QPQPCSAGDA 
WVEFOGPSPA 
8QGKARPQPG 
ACMGKSQHQG 



21 
1 

EHRRQRPSVG 
EKTKG E PARGP 
PVCQPSGYBF 



31 
I 

VQSLRPQSPQ 
RPALPPQAH8 
WGTHTDAATS 
HD8PCPAR8L 



AATMGTKGGS 
WSQPGNIAAG 
DRTREEAMLS 
RDKEGSSRTH 



ERQKRLQAMO 



RQMGAGAHPP 
FEEASFPRDQ 
KRRLHRSVli 



AVPRALPSQG 
IiGTCCSMCPK 
RPGGKRGRIA 
VfiQKADLKKS 
MTLPLPLRKP 
EATHFPKVST 



41 
I 

LRQSDPQKRN 
TLPLPQHRNT 
SRGWTMLCSQ 
PQIAAVARPR 
ALPHPDSGPK 
DMEKGVBGGP 
PSCFPDGPSG 
GG3ADTVRSP 
PLLHNSKLDK 
TTLRQCEVLI 
KSL8KXCI»SP 



Seq id NO: 83 DMA sequence 

Nucleic Acid Accession NM_005264.1 

Coding sequence : 557 * , 1954 



1360 
1440 

1500 
1560 
1620 
16B0 
1740 
1800 
I860 
1920 
1980 
2040 



51 
I 

EDLEKSXjQFL 
AINSSTRLGS 
AQHVLLSGSP 
J3SPMAX.8PH 
PAQDPGLWSQ 
PPSRCGMSSE 
NHLBRASAPL 
ADS1.SMSSFQ 
VPGVQGQARK 
KELKNTHfLLQ 
FVAERAIIiPA 



60 
120 
180 
240 
300 
360 
42Q 
480 
540 
600 
660 



1 11 21 31 41 51 

I I I 1 I I 

GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCX3CGCC GCTTCCAATA 
ACCACTAACA TCCCTAACGA GCATCCGAGC GGAGGGCTCT GCTCGGAAAT CGTCCTGGCC 
CAACTCGGCC CTTC5GAGCTC TCGAAGATTA COGCRTCTAT TTTTTTTTTC TTTTTTTTCT 
TTTCCTAGCG CAQATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 
CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 
TTGAGTCCAG GTTGGGTCGG ACCTQAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 
ATCCCGGAGC TGAGTCGCCG GCGGCGGTGG CTGCTGOCAG ACCCGGAGTT TCCTCTTTCA 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA □CAGCACAGC TGTCGGGGGA TCGCTGCAOG 
CTGAGCTCCC TCGGCAAGAC ccagcggggg ctcgggattt TTTTGGGGGG GGQGGGACCA 
GCCCCGOGCC GGCACCATGT TCCTQGCBAC CCTGTACTTC GCGCTGOCGC TCTTGGACTT 
GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAQ CCAGTGATCA 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGOGGG 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGQATGAGT GCCGCAGCGC 
CATGGAGGCC CTGAAGCAGA AGTQGCTCTA CAACTGCOGC TGCAAGCQGG GTATGAAGAA 
GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 
GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCOGGGTGGT 
CCCATTCATA TCAGATGTTT TTCAGCAAQT OGAGCACATT CCCAAAGGGA ACAACTGCCT 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CnOCOTACAT 
CACCCOGTGC ACCACCAGCG TGTCCAAOGA TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 
CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAGACC ATCGTGCCTG TGTGCTCCTA 
TGAAGAGAGG GAGAAGCCCA ACUGTTTGAA TTTGCAGGAC TCCTGCAAGA CGAATTACAT 
CTGCAGATCT CGCCTTGCGG ATTTTTTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 
CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 
AGTCATOACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGOCCCAT GGTGTGACTG 
CAGCAACAGT GGGAACGAOC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCOTGTGGCA 
GCCAGCCTTC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 
CAAGCCCCTG GGGCC3WSCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCQTG 
TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATOTOTCG GGCAA1ACAC ACCTCTGTAT 
TTCCAATGGT AATTATGAAA. AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCACAAAATC 
AATGaCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTG9TGG TAACCGCTCT 
GTCCACCCTA TTATCTTTAA CAOAAACATC ATAGCTGCAT TAAAAAAATA CAATATGGAC 
ATBTAAAAAG ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTTGTATAGC TGAAATTCCA 
GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGGA ACATTTTTTT TTTTCCTTTT 
AAGAAAOCTT CTTGTGATCC TTOGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 
AAACTCAGAA GGCTTTGGGA TATG CXGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTQT 
AAAGCAAACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATQATCATGA. TGATTTTAAC 
AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 
TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 
CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCIT TGTGGAAACA GCTGGTACTG 
ATGITCACCT TXATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 
TTCTGCACTC TTOTACAAAA GAAAAAACCA CCCGGAATTC 

Seq U3 NO: 84 Protein gggnencg 
Protein Accession #i bm_00S264.1 

982 



60 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 
j 720 
780 
640 
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1020 
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WO 03/042661 



i 
1 

M FlATTi Y FALi 

YEPVMSRLSD 
SV3NTWCWRR 
PNCLWLQDSC 
YIDSSSLSVA 
Q TTTAT TTTA 



11 21 31 41 51 

I t 1 I I 

PLUDUjIjSAE VSGGDRJjDCV KASDQCIiKEQ SCSTKYRTUfc QCVAGKETW SO 

ECRSAMEA3uK QKSLYNCRCK RGMKKEKKFCL RIYWSMYQSL QGNDLLEDSP 120 

IFRWPFISD VFOQVEHIPK GNNCLDAAKA CWDDDICKKY KSAYITPCTT 180 

KCHJCALRQFF DKVPAKHSYG MLFCSCRDXA CTERRRQTTV PVCSYEEREK 240 

KTNYICRSRI* ADFFTNCQPE SRSVSSCLKB NYADCLLAYS GLIGXVMTPN 30 0 

PWCDCSNSGN DLEECLKFLM FFKDMTCLKK AIQAPGNGSD VTVWQPAFPV 360 

LKVKNKPLGP AGSENEIPTK VLPPCAKTLQA QKJoXSNVSGN TffbCISNQNY 420 
ITTKSMAAPP SCCLSPIiLVL WTRLSTU^S IjTETS 



Seq ID NO? 85 DMA aequence 

Nucleic Acid Accession «i XMJ)27172-1 

Coding sequence: 143- .1405 

1 11 21 31 41 51 

1 I i I 1 I 

GGTGGAGACA OCGCCTCAGG GCTCQGTGCA CAGTGGACAT TTGGGGAQCG TTGTGGGTGA 60 

CCCGCACACA GGCACTGOQA ATGCAGGGGA GAGGGGGCCA AGGOQQAAAG GGGCCAGAGT 120 

GTTGG C TTTG GATTCAGGAG GGATGGATTC CAGTCCTAGC TTGCCACTTA TTAQGACXCC 180 

TGAGAGCAGC CTCCATGAGG CCCTGGACCA GTGCATGACC GCCCTGGACC TCTTCCTCAC 240 

CAACCAGTTC TCAGAAGCAC TCAGCTACCT CAAGCCCAGA ACCAAGGAAA GCATGTACCA 300 

CTCACTGACA TATGCCACCA TCCTGGAGAT GCAGGCCATG ATGACCtTTG ACCCTCAGGA 3 60 

CATCCTGCTT GCCGGCAACA TGATGAAGGA GGCACAGATG CTGTGTCAGA GGCACCGGAG 420 

GAAGTCTTCX GTAACAGATT CCTTCAGCAG CCTGGTGAAC CGCCCCACGC TGGGCCAATT 480 

CACTGAAGAG GAAATCCACQ CTGAGGTCTG CTATGCAGAG XGCCTGCTGC AGCGAGCAGC 540 

CCTBACCTTC CTGCAGGGTT OCTCACADQG AGGGGCAGTC AGGCCCAGAG CCTTGCATGA 600 

TCCCTCTCAC GCCTGCAGCT GCCCACCTGG GCCAGGCCGT CAGCATCTCT TCCTCCTGCA 660 

GGACGAGAAC ATGGTGAGCT TCATCAAAGG CGGCATCAAA GTTCGAAACA GCTAOCAGAC 720 

CTACAAGGAG CTGGACAGCC TTGTTCAGTC CTCACAATAC TGCAAGGGTG AGAAOCACCC 780 

GCACTTTGAA GGAGGAGTGA AGCXTGGTGT AGGGGGCTTC AACCTGACAC TGTCCATGCT 840 

TCCTACTAGG ATCCTGAGQC TGTTGGAGTT TGTGGGGTTT TCAGGAAACA AGGACTATGG 900 

GCTGCTGCAG CTGGAGGAGG GAGCGTCAGG GCACAGCTTC OGCTCTGTGC TCTGTGTCAT 960 

GCTCCTGCTG TOCTACCACA CCfTCCTCAC CTTOGTGCTC GGTACTGGGA ACGTCAACAT 1020 

CGAGGAGGCC GAGAAGCTCT TGAAGCCCTA CCTGAACCGG TACCCTAAGG GTGOCAXCTX 1080 

CCTGTTCT TT GCAGGGAGGA TTGAAGTCAT TAAAGGCAAC ATXGATGCAG TGAGTGATGG 1140 

GGGTCCGGGC CGGGGCTGGG GATCCCTCGG GGXCTCCCAG ACCAGCAGGA AQTCRGGCAC 1200 

ATGTGACATA CTCAGGGACA GGATAGACTG GGGGCGGOGG GGGGGCCAAG AGAGAACCAA 1260 

CCAGAGAGCA GGGGCAGGAG AGGCCCTTCT GGCAGAGCAG CCTGGGAAGA CAAGGGAGGA 1320 

GGAGGCATTT OTGGTGCCTG GGATTTTGAC TGGGAGATAT AGGACTGCAG CATTGCAGTG 1380 

GAGGGAGGTG GAGGGAGGTG CTTGAGGGAG GCAGAGGTTA GGAAAGCCCA TCTGTTTAGG 1440 

GCATGAC5GAT TAGGCTQGAG TClGGTAOCT CCCCTCCATT ATAGCICXCX CCTGCTCTTT 1500 

CATTTIGTTA CTAAAAACCA GAGTCCTAGG CGGGGGCTGT ATTTGAGCCC AAGBTCATGX 1560 

AAGACTTAGG AGGTAAAACC AGGACTGGAG GCCAGATCTC CTGGCTCCTG GGGCCCCACC 1620 

TGAQCCTAGG ACAGGGCTGG ACCACTATGC CCTGGAGGAG TCCCGGTCTG CTGTGGTGTT 1680 

GGGAGGTTOG GAGGATGCAG AGGGGTTGGG GCTGGGTGGG CAOCCGTCAG GCTGACCAGA 1740 

AGGTGCCTGC AGGCCATCOG GCGTTTCSAG GAOTGCTGTG AGGCCCAGCA GCACTGGAAG 1800 

CAGTTCCAOC ACATGTGCTA CTGGGAGCTG ATGTGGTGCT TCACCTACAA GGGOCAGTGG 1860 

AAGAXGXCCT ACTTCTACGC CGACCTGCTC AGCAAGGAGA ACTGCTGGTC CAAGGTOGGC 1920 

TGATGCCAOG TGTTAGGGGC ATTGGGTGAC CAGGGCTGAC TGTGTGCCTC CAGACCAOGG 1980 

GCCAAATCCC TAACTGAACA CAGATGTCTC AGCTGGAATC TAAACATAAC CTTAAATTCT 2040 

AATAGOACTC AGGCTTGGAA GGAACTAAAG ACCACAAGAG AAACTTCTGA AOCACAATGT 2100 

GTCACAAAGA GATTTCTAGT CACAAAGGAC AGAAACATGG CTCCCTCTGT CCAGTAGAAC 2160 

TGTTTCTGTG GTAGAAATGT TCTACTTATG CACTGGCGAA ATATGGTAGT CACCAGCCAC 2220 

ATGTGCACAA TOAGCACATG AAATGTGCCT AGTGCAACTG GGAAACTGAT TGTTTTCAGT 2280 

TTTATTTAAT TTTAATTAAC TAAATGTTAA ATTTAAATAG CCATGTAGGG CITGTGGCCA 2340 

CTATATTGGA CTATGCAGGT CCAAAACACA AAAGGCTCAT ATAACTGAAC ATTCTTGGCA 2400 

CATCCGACtT CAGGTAGGGC TGGATOCA3G AATTCAAA3TG ATGTCGTCTG GCTTGGTCXT 2460 

TCCATTTXJTG GCTGTGCTCT CTCCTATGAC ATCTTTGTTT CTGCTGCATC TTXGCAGGAA 2520 

GGTTCTCTCC ATGTGACAGG CAAGGTGGCC ACGGGCTGCT TCTACTCATA TCCTOCCCTT 2S80 

GGTTTCAACC .AGAGXCCCAC AGTTTGAGCC TCACTGGTCT GACTTGTCAC CTGCCAATGC 2640 

CTGGAACAGA GGGGTGGGAA QATTCCTCAT GGGCTGAGAG TAGGAGAGGG GTGGTTCGAC 2700 

AGAAGAAAAT GATGCACCCA GAAATAGTGG AGGQATTAAC AAGATGCCAT ACAGGCAAAA 2760 

CAAAGCCAAA CAGATGCCCQ GCTACCAAAG ATGAAATTTA TCATGGTAAG TATTGAAATA 2820 

AGTGTTAGCT TGXACCATCA TAGTAATGAT AGTGCAGAAA TTGGAACCAA GAGTCTTACA 2880 

ACCACXTTAGC TCAGCAAACG TCTAATCIGT TATTTGTAAA TACACAGGAC ATGTGTCTXC 2940 

ATGGCTTCAT OGCCTGCCCA TAGCAGACAT TOCTAATCAA TCCTCTGCCA TGAGCCTAGC 3000 

TGTGACCTTA GATACCTTCC CTGCAAAGCT CCAGGCAGCC GTTAGAACTG ACCCACATTG 3060 

GCACCTGTTA GOAGACCTGT CATCTGCATC TGGTCCCTCC CCCrCCTCTG CATGGAGGTC 3120 

TCFTCCGTAG CCCTGOCTGG GGACCAAGCT GAAGGGGAAT TCTCTGGGTC TGGGGCAGGT 3180 

GGGGCTGGGG AAGGGAGCAG GCTTCTACTG AGCTCTCAAC ATGTCTGGCA CCTGTCATGT 3240 

CCTCCACACC ACACCCAGTG AGGAGAAGTT ATTATCTCCA 1TGGGCACAT GAGGGCCACT 3300 

GATGCTAAGC GAGGACTGTG ACTTTCCCAC CCACATGGCT GTTSGTGACA GACACOGGAT 3360 

TTGAGGACAG TCCTGTGTGA CTCAOAGGCC TCTGCTTCXC TCTTGCTCCT AGAGTCCICC 3420 

TGAGGAGTCG GGGCTTGCCC TGAGCCCACC CTGCTGTTGA AGGTGCTTCC TCAGGCCCAG 3480 

CTC50CATGGC COCCACACCC CCTCCTCATC ACCTCCTACT CCCAAAAAGG ACAAAGCCTC 3540 

AGGGAAGCTT TTTTCTTTTT TTAGAGACAG GGTCTTGCTA TGTTGGTCAG GCTAGTCTTG 3600 

AACTCCTGGG CCCAAGCAAT CTTCCCAOCT CTGCCTCCTA AAGTGCTGGG ATTCTGGGCC 3660 

TTAGGGAACC TXTCXTGAACT GAAAGTGACT CTCGAAGCCT TCCTGTAGAG TGAGGTGGGT 3720 

GGCTGGGGCT AACCATATGG GGAAGGAGAG ACCTGGTGGG GGCACACAGC TGCTATATAG 3780 

AGGAACAGAA GGTGGGCCAG GCCXOCCAGC TGTCAGGAAA TCTGGCTCAG TCCGCAGACC 3840 

GCTGTGTGGC TTTGGGGTGG OCAGTCCCTC TCTCTCTGCA ACACTCTCCT CATGAGGACT 3900 

TCTGTGAAAA ATQGGGGTTG TAACCCTCTG AGTTCTGGGG CCCCAOCCAT TXCTAATCTC 3960 
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CAGAACTTGG CCACCACAGA CTCCACCAAC TTCTAGTCCT GGGGCCTGQG CCTCTGGCCA 4020 

TTGCCATAGG CACCACCTGC TCTGTGCAGG CAGCGCCCCC CTCTGCCAGG ATCCTCCGAG 4080 

GTCAGCTGCT GGGTCTGACC CGCAGACCCT GGCTGAGCGA CGGATGAACG GAGTATGCAG 4140 

ACACAGGCTT GCCTGTCAGC AGATGGGGGA CCCCTGCCAG AGTCAGCAGC GGCCCCCATA 4200 

AGGCTGCCAC GCTTGCATTT ATTTAGTACA GATGTAATGA CAAAGGCCTA AAGCAAACTC 4260 

CATTTGTGGG TAATTAACAT TGTCGCCCCC CCAGAAAGAG CAGTOCTCCG CATGATGATT 4320 

AAAGGCCAGG TTCCGAGGCC TAAGTAAACC AACTTATCTA GATCAATTCC CTTACTTCTT 43 BQ 

GTXATCXACT CTGAGAGAAT TCAGCTGCCT TCAGCCAAAT CCTTTCCCGA AGCTTTTGCA 4440 

AAACCTCOGA GCOTTOCAAG GTTTGCTTCT TTCTGTAATT TTTCTCACCA CCCTGACCTA 4500 

TCTCCTGCAG TCAGCCCTGT GGAGGCCTTT GTGTTTCCCC CAaTGCTGGC AGCCTAGAOG 4 560 

CTGAGATGGC CAGAAACAAG GTGGTGACAG TGGCGTGCTC AGGGCTTGGG AAACCCAAGG 4620 

AGCTAAAGGC ATGCCCAGGC AACCAAAOAG GACAGGAAGG CTTCTGAGGA GAGACCTCTG 46B0 

AGGTGGGTCT TGGAGAGGAA GGACTTAGGG AGGCAGAGTG GAGGAAGTGA GAGGACACCC 4740 

CAAGCCAAGA GGGCGGCAGG ACCAAAGGCT CAGAAGCCAG GGCGCTGCAG AGGGGCTGTG 4800 

TGCCACAGGG TGAAOAGTTT GTGTGGCAGA AGGGCAGGGG GCTTGCATCA GGGGTGACAG 4860 

CTGCTCTTTT GTCCCAGCAT AGCCCCTQTA CATCCCTGGA GAGCTGGGGC GTCCACAACT 4920 

CTAAGTCACA GCCCOCATCC TAACCCTGGT GGTGCAGTGA GGGTGAGCTG TCTGTGGGCA 4960 

GQAGGGAAGA CTCTTGGAGA TGAGCCTGGT GAAGGGATAA TGGCATCCCG GGCCGAGGAG 5040 

CAGCACAGGC AGAGQCCTGG GGAGAGTTTA AGGAGTQTAG GGGAGGAAAT GGCAGAAGAT 5100 

GAGCCAGAAA AAGAAAGGTT AGGGCAGGTC CTGGAGGACA TGAGTGGCTG TTTGGGCTTT 5160 

ATCCAGCAGT GGGGGAGCCT TGGCAGGCTT GTGGCTTAQA TAGGTGCTTT AGAAAGCCCA S220 

CCAGCAGTTG CTGGGCCACC CCGCTGGCTG GGTCCTGTTC TAAGGCAGGA AATAC&AGCA 52 BQ 

TGAGCAGGAA AAGACCCCCT CAAGGCTCAC GTCCTAGTGG GGAGACAAGA AACACABATG 5340 

GGCAATATAA CACGATGTCT GGTTCCAGTA AGTGCAGTGA AGAACAAGCG AGGCTGGATG 5400 

CAGGGAGTGA TGGGAGGGGC TTTGTAAGGG GAGGTCGGGG GAAGCCTGTC TCAOAGGACA 5460 

CCAGAATGGA GCGCAGGAGC AflCACGTGGC AGTCACATGG CAGGCCGTTA GGGCAGAGGG 5520 

AGCTGGGCAG GGCACAGCAG GGCAGGAGTG TGTTTGATGT GTCCTGGGAA COGCCCTGAG 5580 

GCCGTCGTGT GGCTGGAGTG CTGCAGGTGT CAAGGAAATT GTAGGAGATG TCTCCTQAGT S640 

GTGATGGAAT ATAACCAGAT TTCCAGAAGG AACTGACATG ATCTGACTTA AAAAGGTCAG 5700 

TGTGCGAAAT GGCTTOCAGG GGACASGAGT GGGAGCAGGG AGATAGGAGA CAATGTGTAC 5760 

CAGGACAGCA GAAAGACATC CCGGGTAGCC TGGAACAGGG AGACOGTGTG GAGATGGTGG 5820 

GAGTCCGATA ATGAOAG CCG TAGGGCAAGG CCAGCAGGAT CCTAGAGTGA □ACOGGAGGT 5880 

AAAGTCACDG GGAjCTTGGTG TCTCCACGTC AGGGGCAGGG GAAAGGGAGA GGACAAGGGT 5940 

GACCCGGGAG GTTAAAGATG GGACCGGGGC CAQACGCAGT GGCTCATGCC TGTAATCCTA 6000 

GCACTTTGGG AGGCTGAGGC GGGCOOATAG CTTGAGGTCA GGAGTTTGAA ACCAGCCTGG 6060 

CCAACATGGT GAAACCCCGT CTCTACTAAA ATATACAAAA ATTAGCCTGG CGTGGTGGTG 6120 

CATGCCTGCA GTCCCAGCTA TTCAOGAGGC TGAGGCAACA AGAATCGCTT GAACCTGGGA 6180 

GGCGGAGGTT GCAGTGAGOC GAGATCGCGC CATAGCACTC CAGCCTTAGC CTGGGOGACA 6240 
GAGCGAOACC ACATC 

3eq ID NO: 86 Protein sequence 
Protein Accession #x XP 027172.1 



1 
1 

MDSSPSLPtil 
IJ5MQAMHTFD 
EVCYASCLLQ 
IKGGIKVENS 
LEFVGFSGNK 
KPYLJJRYPKG 
IDWGRGGGQE 



11 
I 

RTPESSLtHKA 
PQDILLAGNM 
RAAIiTFLQGS 
YQTYKELDSL 
PYGKLQLBEG 
AIPLFFAGRI 
RTNQHAGAGE 



21 
I 

IiDQCMTALDL 
MKEAQMLCQR 
SHQGAVRPRA 
VQS3QYCKGE 
ASGHSFRfiVL 
EVTKCNTDAV 
ALLAEQPGKT 



31 
I 

PLTNQFSEAL. 
HRRKSSVTDB 
LHDPSHACSC 
NHPHFKGGVK 
CVMbtiXiCYHT 
SDGGPGRGWG 
RKBEAFWPG 



41 

I 

3YLKPRTKES 
FSSLVNUPTL 
PPGpGhQHLF 
LGVGAPJTLTIj 
FIiTFVLGTGN 
8LGVSQTSRK 
ILTGRYSTAA 



51 
I 

MYESLTYATI 
GQFTEEEXEA 
LLQDENMVSF 
SMLPTRIIiRL 
VWIEKAEKLIi 
fiGTCDILRDR 
LQHREVEGGA 



60 
120 
180 
240 
300 
360 



Ssq ID NO: 87 dha geguence 
55 Nucleic Acid Accession #t ABQ07921 
Coding sequence; 143.. 1363 



60 
65 
70 
75 
80 



GGTGGAGACA 
CCCCCACACA 



CRACCAGTTC 
CTCACTGACA 
CATCCXGCTT 
GAAGTCTTCT 
CACTGAAGAG 
CCTGACCTTC 
TCCCTCTCAC 
GGACGAGAAC 
CXACAAGGAG 
GCACTTTGAA 



GCTGCTGCAG 
GCTCCTGCTQ 
CGAGGAGGCC 
OCTGTTCTTT 

ATGTGACATA 
ACCAGAGAGC 
AGGA5GCATT 
GGAGGGAGGT 



11 21 31 

\ I I 

CCGCCTCAGG GCTCGGTGCA CAGTGGACAT 
GGCACTGGGA ATGCAGGGGA GAGGGGGCCA 
GAITCAGGAG GGATGQATTC CAGTCCTAGC 
CTCCATGAGG OCCTGGACCA. GTGCATGAGC 
TCAGAAGCAC TCAGCTACCT CAAGCCCAGA 
TATGCCACCA TCCTGQAGAT GCAGGCCATG 
GCCGGCAACA TGATGAAGGA GGCACAGATG 
GTAACAGATT CCTTGAGCAG CCTGGTGAAC 
GAAATCCACG CTGAGGTCTG 
CTGCAGGGTT CCTCACACGG 
GCCTGCAGCT GCCCACCTGG GCCAGGCCGT 
ATGGTGAGCT TCATCAAAGG CSGGCATCAAA 
CTGGACAGCC TTGTTCAGTC CTCACAATAC 
GGAGGAGTGA AGCTTGGTGr AGGGGCCTTC 
ATCCTGAGGC TGT TGGAGT V TGTGGGGTTT 
CTGGAGGAGG GAGOGTCAGG GCACAGCTTC 
TGCTACCACA CCTTCXTTCAC CTTCGTGCTC 
GAGAAGCTCT TGAAGCOCTA CCTGAACCGG 
GCAGGGAGGA TXGAAGTCAT TAAAGGCAAC 
□QGGGCTGGG GAT CGCTCGG GGTCTCCCAG 
CTCAGGGACA GGATAGACTG GGGGCGGGGG 
AGGGGCAGGA GAGGCCCTTC TGGCAGAGCA 
TGTGGTGCCT GGGATTTTGA CTGGGAGATA 
GGAGGGAGGT GCTXGAGGGA GGCAGAGGTT 
TTAGGCTGGA GTCTGGTACC TCCGCTCCAT 



41 
I 

TTGGGGAGCG 
AGGGGGAAAG 
TTGCCACTTA 
GCCCTGGACC 
ACCAAGGAAA 
ATGACCTTTG 
CTGTGTCAGA 
CGCCCCACGC 

AGGCCCAGAG 
CAGCATCTTT 
GTTCGAAACA 
TGCAAGGGTG 



TCAGGAAACA 
CSCTCT6TGC 
GGTACTGGGA 
TACCCTAAGG 
ATTGATGCAG 
ACCAGCAGGA 
QGGGGGCCAA 
GCCTGGGAAG 
TAGGACXGCA 
AGGAAAGCCC 
TATAGCTCTC 



51 
\ 

TTGTGGGTGA 
GGGCCAGAGT 
TTAGGACTCC 
TCTTCCTCAC 
GCATGTACCA 
ACCGTCAGGA 
GGCACCGGAG 
TGGGCCAATT 



TCCTCCTGCA 
GCTACCAGAC 
AGAACCAOCC 
TGTCCATGCT 
AGGACTATGG 
TCTGTGTCAT 
ACGTCAACAT 
GTGCCATCTT 
TGAGTGATGG 
AGTCAGGCAC 
GAGAGAACCA 
ACAAGGGAGG 
GCATTGCAGT 
ATCTGTTTAG 



60 
120 
18D 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 B0 
1440 
1500 
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TCMTFTGTT ACTAAAAACC AGAGTCCTAG GCGGGGGCTG TATTTGAGCC CAAOGTCATG 1560 
TAAGACTTAG GAGGTAAAAC CAGGACTGGA GGCCAGATCT CCTGGCTCCT GGGGCCCCAC 1620 
CTGAGCCTAG CACAGGGCTG GACCACTATG CCCTGGAGGA GTCCCGGTCT GCTGTGGTGT 3-680 
TGGGAGGTTC GGAGGATGCA QAGGGGTTGG GGCTGGGTGG GCACCCGTCA GGCTGACCAG 1740 
5 AAGGTGCCTG CAGGCCATCC GGCGTTTCGA GGAGTGCTGT GAGGCCCAGC AGCACTGGAA IB 0 0 
GCAGTTCCAC CACATGTGCT ACTGGGAGCT GATGTQGTGC TTCACCTACA AGGGCCAGTG 1B60 
GAAGATGTCC TACTTCTACG CCGACCTGCT CAGCAAGGAG AACTGCTGGT CCAAGGTGGG 1920 
CTGATGCCAC GTGTTAGGGG CATTGGGTGA CCAGGGCTGA CTGTGTGCCT CCAGACCAOG 1980 
GGCCAAATCC CTAACTGAAC ACAGATGTCT CAGCTGQAAT CTAAACATAA CCTTAAATTC 2040 
10 TAATAGGACT CAGGCTTGGA AGGAACTAAA GACCACAAGA GAAACTTCTG AACCACAATG 2X00 
TGTCACAAAG AGATXTCTAG TCACAAAGGA CAOAAACATG GCTCCCTCTG TOCAGTAGAA 2160 
CTGTTTCTGT GGTAGAAATG TTCTACTTAT GCACTGCCCA AATATGGTAG TCACCAGCCA 2220 
CATOTGCACA CTGAGCACAT GAAATGTGCC TAOTGCAACT GGGAAACTGA TTGTTTTCAG 2260 
TTTTATTTAA TTTTAATTAA CTAAATOTTA AATTTAAATA GCCATGTAGG GCTTGTGGCC 2340 
15 ACTATATTGG ACTATGCAGG TCCAAAACAC AAAAGGCTCA TATAACTGAA CATTCTTGGC 2400 
ACATCCGACT TCAGGTAGGG CTGGATCCAG GAATTCAAAT GATGTCGTCT GGCTTGGTCT 2460 
TTCCATTTGT GGCTGTGCTC TCTCCTATGA CATCTTTGTT TCTGCTGCAT C'l'iTGCAGGA 2520 
AGGTTCTCTC CATGTGACAG GCAAGGTGGC CACGGGCTGC TTCTACTCAT ATCCTCCCCT 2580 
TGGTTTCAAC CAGAGTCCCA CAGTTTGAGC CTCACTGGTC TGACTTGTCA CCTGCCAATG 2640 
20 CCTGGAACAG AGGGGTGGGA AGATTCCTCA TGGGCTGAGA GTAGGAGAGG GGTGGTTCCA 2700 

CAGAAGAAAA TGATGCACCC AGAAATAGTG GAGGGATTAA CAAGATGCCA TACAGGCAAA 2760 
ACAAAGCCAA ACAGATGCCC GCCTACCAAA GATGAAATTT ATCATGGTAA GTATTGAAAT 2B20 
AAGTGTTAGC TTGTACCATC ATAGtTAATGA TAGTGCAGAA ATTGGAACCA AGAGTCTTAC 2880 
AACCACCTAG CTCAGCAAAC GTCTAATCTG TTATTTGTAA ATACACAGGA CATGTGTCTT 2940 
25 CATOaCTTCA TCCCCTGCCC ATGGCAGACA TTGCTAATCA ATCCTCTQCC ATGAGCCTAG 3000 

CTGTGACCTT AGATACCTTC CCTGCAAAGC TCCAGGCAGC CGTTAGAACT BACCCACATT 3060 
GGCACCTGTT AGGAGACCTG TCATCTGCAT CTGGTCCCTC CCCCTCCTCT GCATGGAGGT 3120 
CTCTTCCGTA GCCCTGCCTG GGGACCAAGC TGAAGGGGAA TTCTCTG3GT CTGGGGCAGG 3180 
TGGGGCTGGG GAAGGGAGCA GGCTTCTACT GAGCTCTCAA CATGXCTGGC ACCTQTCATG 3240 
30 TCCTCCACAC CACACCCAGT GAGGAGAAGT TATTATCTCC ATXGGGCACA TGAGGGCCAC 3 300 
TGATGCTAAG CQAGGACTGT GACTTTCCCA CCCACATGGC TGTTGGTGAC AGACACCGGA 3360 
TTTGAGGACA GTCCT3TGTG ACTCAGAGGC CTCTGCTTCT CTCTTGCTCC TAGAGTCCTC 3420 
CTGAGGAGTC GGGGCTTGCC CTGAGCCCAC CCTGCTGTTG AAGGTGCTTC CTCAGGCCCA 3480 
GCTCCCATGG CCCCCACACC CCCTCCTCAT CACCTCCTAC TCCCAAAAAG GACAAAGOCT 3540 
35 CAGGGAACCT TTTTTCTTTT TTTAGAGACG GGGTCTTGCT ATGTTGGTCA GGCTGGTCTT 3600 

GAACTCCTGG GGCCAGGCAA TCTTCCCGCC TCTGCCTCCT AAAGTQCTGG GATTCTGGGC 3660 
CrrAGGGAAC CTTTTTGAAC TGAAAOTGAC GCTCGAAGCC TTCCTGTAGA GTGAGGTGGG 3720 
TGGCTGGGGC TAACCATATG GGGAAGGAGA GACCTGGTGG GGGCACACAG CTGCTATATA 3780 
GAGGAACAGA AGGTGGGCCA GGCCTCCCAG CTGTCAGGAA ATCTGGCTCA GTCCCCAGAC 3840 
40 CGCTGTGTGG CTTTGGGGTG GCCAGTCCCT CTCTCTCTGC AACACTCTCC TCATGAGGAC 3900 
TTTCTGTGAAA AATGOGGGTT GTAACCCTCT GAGTTCTGGG GCCCCACCCA TTTCTAATCT 3960 
CCAGRACTTG GCCACCACAQ ACTCCACCAA CTTCTAGTCC TGGGGCCTQG GCCXCTGGCC 4020 
ATTGCCATAG GCACCACCTG CTCTGTQCAG GCAGCGCCCC CCTCTGCCAG GATCCTCCGA 4080 
GGTCAGCTGC TGGGTCTGAC CCGCAGACCC TGGCTGAGCG ACGGATGAAC GGAGXATGCA 4140 
45 GACACAGGCT TGCTTGTCAG CAGATGGGGG ACCCCXGCCA GAGTCAGCAG CGGCCCCCAT 4200 

AAGCCTGCCA CGCTTGCATT TATTTAGTAC AGATOTAATG ACAAAGGCCT AAAGCAAACT 4260 
CCATTTGTGG GTAATTAACA TTGTCGCCCC CCGAGAAAGA GCAGTCCTCC GCATGATGAT 4320 
TAAAGGCCAG GTTCCGAGGC CTAAGTAAAC CAACTTATCT AGATCAATTC CCTTACXTCT 4380 
TGTTATCTAC TCTGAGAGAA TTCAGCTQCC TTCAGCCAAA TCCTTTCCCG AAGCTTTTGC 4440 
50 AAAAOCTCCG AGCCTTCCAA GGTTTGCTTC TTTCTGTAAT TTTTCTCACC ACCCTGACCT 4500 
ATCTCCTGCA GTCAGCCCTG TGGAGGCCTT TGTGTTTCCC CCAGTGCTGG CAGCCTAOAG 4560 
GCTGAGATGG CCAGAAACAA GGTGGTGACA GTGGCGTGCT CAGGGCTTGG GAAACCCAAG 462 O 
GAGCTAAAGG CATGCCCAGG CAACCAAAGA GGACAGGAAG GCTTCTGAGG AGAGAOCTCT 4680 
GAGGTGGGTC TTGGAGAQGA AGGACTTAGG GAOGCAGAGT GGAGGAAGTG AGAGGACACC 47*0 
55 CCAAGCCAAG AGGGCGGCAG GACCAAAGGC TCAGAAGCCA GGGOGCTGCA GAGGGGCTGX 4800 

GTGCCACAGG GTGAAGAGTT TGTGTGGCAG AAGGGCAGGG GQCTTGCATG AGGGGTGACA 4860 
GCTGCTCTTT TGTCCCAGCA TAGCCCCTGT ACATCCCTGG AGAGCTGGGG CGTCCACAAG 4920 
TCTAAGTCAC AGCCCCCATC CXAACCCTGG TGGXGCAGTG AGGGTGAGCT GTCTGTGGGC 4380 
AGGAGGGAAG ACTCTTGGAG ATGAGCCTGG TGAAGGQATA ATGGCATCCC GGGCCGAGGA 5040 
60 GCAGCACAGG CAGAGGCCTG GGGAGAGTTT AAGGAGTGTA GGGGAGGAAA TGGCAGAAGA 5100 
TGAGCCAGAA AAAGAAAG3T TAGGGCAGGT CCTGGAGGAC ATGAGTGGCT GTTTGGGCTT 5 ISO 
TATCCAGCAG TGGGGGAGCC TTGGCAGGCT TGTGGCTTAG ATAGGTGCTT TAGAAAGCCC 5220 
ACCAGCAGTT GCTOGGCCAC CCCGCTGGCT GGGTCCTGTT CTAAGGC3VGG AAATACAAGA 52 BO 
ATGAGCAGGA AAAGACCCCC TCAAGGCTCA CGTCCTAGTG GGGAGACAAG AAACACAGAT 5340 
65 GGGCAATATA ACACGATGTC TGGTTCXMT AAGTGCAGTG AAGAACAAGC GAGGCTGGAT 5400 
GCAGGGAGTG ATGGGAGGGG CTTTGTAAGG GGAGGTCGGG GGAAGCCTGT CTCAGAGGAC 5460 
ACCAGAATGG AGCGCAQGAS CAGCACGTGG CAGTCACATG GCAGGCCOTT AGGGCAGAGG S520 
GAGCTQGGCA GGGCACAGCA GGGCAGGAGT GTGTTTGATG TGTCCTGGGA ACCGCCCTGA 5580 
GGCCGTCGTG TQGCTGGAGT GCTGCAGGTG TCAAGGAAAT TGTAGGAGAT GTCTCCTGAG 5640 
70 TGTGATGGAA TATAACCAGA TTTCCAQAAG GAACTGACAT GATCTGACTT AAAAAGGTCA 5700 
GTGTGCGAAA TGGCTTGGAG GGOACAGGAG TGGGAGCAGG GAGATAGGAG ACAATGTGTA 5760 
CCAGGACAGC AGAAAGACAT CCCGGGTAGC CTGGAACAGG GAGAOGGTGT GGAGATGGTG 5820 
GCAGTCCGAT AATGAGAGOC GTAGGGCAAG GGCAQCAGGA TCCTAGAGTG AGACGGGAGG 58 BO 
. TAAAGTCACC GGGACTTGGT GTCTCCACGT CAGGGGCAGG GGAAAGGGAG AGGACAAGGG 5940 
75 TGACCCGGGA GGTTAAAGAT GGGACCGGGG CCAGATOCAG XGGCTCATGC CTGTAATCCT 6000 
AGCACTTTGG GAGGCTGAGG CGGGCGGATG GCTTGAGGTC AGGAGTTTGA AACOGGC3CTa 6060 
GOCAACATGG TGAAACCCCG TCTCTACTAA AATATACAAA AATTAGCCTG GCGTGGTGGT 6120 
GCAXGCCTGC AGTCCCAGCT ATTCAGGAQG CTGAGGCGAC AAGAATCGCT TGAACCTGGG 61B0 
_ . AGGCGGAGGT TOCAGTGAGC CGAGATCGCG CCATAGCACT CCAGCCTTAG CCTGGGCGAC 6240 

80 AGAGCGAGAC CACATC 
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70 

75 
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1 
1 

VETPPQGSVH 
ESSLKEALDQ 
XIjIiAGMMHKB 
LTFLQGSSHG 
YKELDSIiVQS 
I^QLEEGASG 
I*FPAGRIEVI 



11 
1 

SC3KLGSWGD 
CMTALDltPLrT 
AQWIiCQRHRR 
GAVRPRAI/HD 
9QYCKQENHP 
HSFRSVLCVM 
KGNIDAVSDG 
GRAAWEDKGG 



21 
1 

PHTGTGNAGE 
KQFSEAL8YL 
KSSVTDSPSS 
PSHACSCPPG 
HPEGGVKLGV 
LLLCYHTFLT 
GPGRGWGSLG 
GGICGATOFD 



31 
1 

RGPRGKGARV 
KPRTKESMYH 
IiVNRPTIiGQF 
PGRQHLFLLO 
GAFNlfTLSMIj 
FVLGTGKVNI 
VSQTSRKSGT 
WEI 



41 

I 

LrAIJDSGGMDS 
SLTYATILEM 
TEEEIHAEVC 
DENMVSFIKG 
PTRIIJUJr.EF 
EEAEKLLKPY 
CDILRDRIDW 



51 

i 

SPSLPLIRTP 
QAMMTTDPQD 
YAECLLQRAA 
GIKVHNSYQT 
VGFSGNKDYG 
LNRYPKGAIF 
GRGGGPRENQ 



Seq id NO i 89 DNA sequence 
Nucleic Acid AcceBaion »t AF00717O 
Coding sequence: 73.-1725 

1 11 21 31 41 51 

I 1 1 I I 1 

AAGGAGGCGG CCTCCGGGAA AAGCGACDGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 
CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 
AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCXTGC CGGCAACATG 
ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 
TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACQCT 
GAGGTCTGCT atgcagagtg CCTGCTGCAG CGAGCAGCCC tgaccttcct GCAGGACGAG 
AACATGGTGA GCTTCAICAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 
GAGCXGGACA GOCTTGTTCA GTCCTCRCAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 
GAAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT QCTTCCTACT 
AGGATCCTGA GGCTGTTGGA GTTTGXGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 
CAGCTGGAGG AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAQOAG 
GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTOCCAT CTTCCTGTTC 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 
GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATQTCCT ACTTCTACGC CGACCTGCTC 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 
ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACQ AAGTGGAATT ATTTCGAGCT 
STGCCAGGCC TGAAGCTCAA GATTGCraaG AAATCTCTAC CCACAGAGAA GTTTGCCATC 
CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC 1GCCAGTGCC TGCTCTGGAA 
ATGATGTACA TCTGGAACGG CTACGCCGTQ ATTGGGAAGC AGCCGAAACT CACGGATGGG 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 
TCAGTGGATG ACGAGTGCTT GGTGAAATTQ TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 
CGTGTCCAGG AG3CCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 
TAT qaccact acttgatccc aaacgccctq CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 
QACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 
TCCATGGAGT C3UU3GACACA CTTTCGAATC CAGQCAGCCA CACTCCAAGC CAAGTCTTCC 
CTAGAGAACA GCAGCAGATC CATQGTCTCA TCAGTGTCCT TGTAQCTTTG TGCAGCAGTT 
CCGQaCTOGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCOCCT 
OCCCCTGOOC TGCCCTCCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 
TGTATCCGTG CAGAAGCCGA GGTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 
GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT AQGGGTACTT GCTTTTCACT 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CXTITCAAAT 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAOA GGACCTGAGG AATGCCTQQG 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 

ccactacctt ACTACTCACA cxtcatttca ctccttttgt aaatttccaa tttaaaaatc 

AAGCACGTCT TTTTAGTGAG ATAAAATCTO AGCTCTTCTG TAGAAAAATC AATCTCTACC 
AGTAGAAAAT GCCAGGGCTT GATGGARGAG CTGTGTAOCC CTTTCTATGC CAAAGCGAGG 
AAAYTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 
ctgaaaccac tgggaataat ttatgaaaca TAAAAATCTT CTGTACTTCA CTCCAAGGTA 
CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGT T ATTC TTGAAAAAAA AAAAAAAAAA 
AA 

fieq ID MO: 90 Protein sequence 
Protein Accession ft: AAC39582-1 



KSAA3GK3DR 
LEMQAMMTFD 
"EVCYAECLTiQ 
EGGVKLGVGA 
IKZYHTFliTFV 
ECCEAQQHWK 
MFGKHDHKPF 
MMYIWNGYAV 
RVQEABENFR 



11 
I 

RTPESSLHSA 
PQDIKLAGNM 
RAALTFLCDB 
FITLTL6MLPT 
LGTGNVNIEE 
OFHHMCYWRL 
GDDEVEEiFRA 
IGKQPKLTDG 



21 
I 

LDQCMTAH>I> 
MKEAQML.CQR 
NMVSFIKGGI 
RILRLLEFVG 
AEKbLKPYLN 
MWCFTYK0QH 
VPGLXLKIAG 
ILEHTKAEE 
YDHYLlPNAIi 



SME3RTHFRI QAATLQAKS3 IiEMSSRSKVfi 



31 
I 

FLTNQFSEAB 
KRRK£ S VXD S 
KVRKSyQTYK 
FSGNKDYGLL 
RYPKGAIFlkF 
KMfiYFYADliL 
KSIrPTEKFAl 
MTiKKGPSNEY 
IiEIiALIiLMEQ 
SVSli 



41 
I 

SYLXPHTKES 
FSSLWRPTL 
ELDSLVQSSQ 



FAGRIEVIKG 
SKENCW8KAT 
RKSSRYFSSN 
SVPUECliVKI* 
DRNEEAIKLli 



60 
120 
ISO 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1H60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2560 
2640 



51 
I 

MYHSLTYATI 
GQFTEEEIHA 

FRSVLfCVMLL 
N1DAAIRRFE 
YIYMKAAYLS 
PI8LPVPALB 
IiXGIiOjKYIjG 
ESAKQNYXNY 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 



Seq 33) NO; 91 DNA sequence 
Nucleic Acid Accession ft: 
Coding sequence: 31.. 906 



Eos sequence 



986 



WO 03/042661 PCT/US02/36810 



1 11 21 31 41 51 

I I 1 I I I 

CGGGTCGACC CACGCGTCCO GGGAGAAAGG ATGGCCGGCC TGGCGGCGCG GTTGGTCCTG 60 

CTAGCTGGGG CAGCGGCGCT GGCOAGCGGC TCCCAGGGCG ACOGTGAGCC GGTGTACCGC 120 

5 GACTGCGTAC TGCAGTGCGA AGAGCAGAAC TGCTCTGGGG GCGCTCTGAA TCACTTCCGC 1B0 

TCCCGCCAGC CAATCTACAT GAGTCTAGCA GGCTGGACCT GTCGGGACGA CTGTAAGTAT 240 

GAGTGTATGT GGGTCACCGT TGGGCTCTAC CTCCAGGAAG GTCACAAAGT GCCTCAGTTC 300 

CATGGCAAGT GQCCCTTCTC CCGGTTCCTG TTCTTTCAAG AGCCGGCATC GGOCGTGGCC 360 

TCGTTTCTCA ATGGCCTQGC CAGCCTGGTG ATGCTCTGCC GCTACCGCAC CTTCGTGCCA 420 

10 GCCTCCTCCC CCATGTACCA CACCTGTGTG GCCTTCGCCT GGGTGTOCCT CAATGCATGG 480 

T X CTGGTCCA CAGTYTTCCA CACCAGGGAC ACTGACCTCA CAGAGAAAAT GGACTACTTC 340 

TGTGCCTCCA CTQTCATCCT ACACTCAATC TAOCTGTGCT GCGTCAGCCT CATCCGCTTC 600 

GACTATGGCT ACAACCTGGT GQCCAAOGTG GCTATTGGCC TGGTCAAOGT GGTGTGGTGG 660 

CTGGCCTGGT GCCTGTGGAA CCAGCGGCGG CTGDCTCACG TGCGCAAGTG CGTGGTGGTG 720 

15 GTCTTGCTGC TGCAGGGGCT GTCCCTGCTC GAGCTGCTTG ACTTCCCACC GCTCTTCTGG 780 

GTCCTGGATG CCCATGCCAT CTGGCACATC AGCACCATCC CTGTCCACGT CCTCTTTTTC 840 

AGCTTTCTGG AAGATGACAG CCTGTACCTG CTGAAGGAAT CAGAGGACAA GTTCAAGCTG SOD 

GACrOAAGAC CTTGGAGCGA GTCTGCCCCA GTGGGGATCC TGCCCCCGCC CTGCTGGCCT 960 

CCCTTCTCCC CTCAACCCTT GAGATGATTT TCTCTTTTCA ACTTCTTGAA CTTGGACATG 1020 

20 AAGGATGTGG QCCCAGAATC ATGTGGCCRG CCCACCCCCT GTTGGCCCTC ACCAGCCTTG 10 B0 

GAGTCTGTTC TAGGGAAGGC CTCCCAGCAT CTGGGACTCG AGAGTGGGCA GCCCCTCTAC 1140 

CTCCTGGAGC TGAACTGGGG TGGAACTGAG TGTGCTCTTA GCTCTACCGG GAGOACAGCT 1200 

gcctgtttcc tccccatcag ccicctccoc acatccccag ctgcctggct gggtcctgaa 1260 

gccctctgtc tacctgogag accaggoacc acaggcctta gggatacaq3 gggtcccctt 1320 

25 CTGTTACCAC COOCCACCCX CCTCCAGGAG ACCACTAOQT GGTGCTGGAT GCTTGTTCTT 1360 

TGGCCAGCCA AGGTTCACGG OGATTCTCCC CATQGGATCT TGAGGGACCA AGCTGCTGGG 1440 

ATTGGGAAGG AGTTTCACCC TGACCRTTGC CCTAGCCAGG XICCCAGGAG. GCCTCAJCCAT IS 00 

ACTCCCTTTC AGGGCCAGGQ CTCCAGCAAG CCCAGGGCAA GGATCCTGXG CTGCTGTCTG 15 GO 

GTTGAOAGCC TGCCACCGTG TGTCGGGAGT OTQGGCCAGG CTGAGTGCAT AGGTGACAGG 1620 

30 GCCGTGAGCA TGGGCCTGGG TGTGTGTGAG CTCAGGC3VCT AGGTGCGCAG TGTGGAGACG 16B0 

GGTGTTGTCG GGGAAGAGGT GTGGCTTCAA AGTGTTGTGT GTGCAGGOGa TKGGTGTGTT 1740 

AAGCGTGGGT TAGQGGAACG TGTGTGCGCG TGCTGGTQGG CATGTGAGAT GAGTGACTQC 1800 

CGGTGAATGT gtccacagtt gagaggttgg AGCAGGATGA gggaatcctg TCACCATCAA I860 

TAATCACTTG TGGAGCGCCA. CTTGGCCCAA GACGCCACCT GGQCGGACAG CAGGAGCTCr 1920 

35 CCATGGCCAG GCTGCCTGTG TGCATGTTCC CTGTCTGGTG CCCCTTTGCC CGCCTOCTGC I960 

AAACCTCACA GGGTCCCCAC ACAACAGTGC CCTCCAGAAG CRGCOCCTOG GAGGCAGAGG 2040 

AAGGAAAATG GGGATGGCTG GGGCTCTCTC CATCCTCCTT TTGTCCTTCJC CTTCGCATGG 2100 

CTGGCCTTCC CCTCCAAAAC ClCCATTCCC CTGCTGCCAG CCCCTTTGOC ATAGCCTGAT 2160 

TTTGGGGAGG AGGAAGGGGC GATTTGAGGG AGAAGGGGAG AAAGCTTATG GCTGGGTCTG 2220 

40 GTTTCTTCCC TTCCCAGAGG GTCTTACTGT TCCAGGGTGG CCCCAGGGCA GGCAQGGGCC 2280 

ACACTATGCC TGCGCCCTGG TAAAGGTGAC CCCTGCCATT TACCAGCAGC CCTGOCATGI 2340 

TCCTGCCCCA CAGGAATA3A ATGGAGGGAG CTCCAGAAAC TTTCCATCCC AAAGGCAGTC 2400 

TCCGTGGTTG AAGCAGACTG GATTTTTGCT CTGCCCCTGA CCCCTTGTCC CTCTTTGAGG 2460 

GAGGGGAGCT ATGCTAGGAC TCCAACCTCA GGGACTCGGG TGGCCTGCGC TAGCTTCTTT 2520 

45 TGATACTGAA AACTTTTAAG GTGGGAGGGT GGCAAGGGAT GTGCTTAATA AATCAATTCC 2580 

AAGCCTCAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 



50 



60 



Seq ID HO: 92 Protein aeguence 
Protein Accession ft: Bos sequence 



1 11 21 31 41 51 

1 I I I I I 

MAGLAARL.VL LAGAAALASG SQGDREPVYR DCVLQCEEQN CSGGA1HHFR SRQPIYMSLA 60 
GWTCRDDCKT ECMWVTVGLY LQEGHKVPQF HGKWPFSRFL FFQSPASAVA SFLNGI»ASIiV 120 
55 MLCRYRTFVP ABSPMYHTCV AFAWSLHAW FWfiTVFHTKD TDI/TEKMDYF CASTVILHSI 180 
Yl^CCVRTVGL QHPAWfiAPR ALLLLMLTVH VSTLSIiIBFD YGYNLVANVA IGLVNWKWIi 240 
AKCLUNQKRIi PHVRKCVWV ^LDQGDSUIrE I1LDFPPI1FKV LDAHAIWHXS TIPVHVI»FFS 300 
FLSDOSLYIJj KESEDKPKLD 



Sea; TD NO: ?3 DWA secfUBoce 

Wucleic Acid Accession #s KM_033419.1 

Coding sequence ; . 18 - - 980 



1 11 21 31 41 51 

65 i ] i i I I 

CGAGCCAGGG AGAAAGGATG GCCGGCCTGS CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 60 

CGGCGCTGGC GAGCGGCTCC CAGGGCGACC GTGAGCCGOT GTAOCGOGAC TGCGTACTQC 120 

AGTGCGAAGA GCAGAACTGC TCTGGGGGCG CTCTGAATCA CTTCCGCTCC CGCCAGCCAA 180 

TCTACATGAG TCTAGCAQGC TOGACCTGrTC GGGAOGACTG TAAQTATGAG TGTATGTGGG 240 

70 TCACCGTTGQ GCTCTACCTC CAGGAAGGTC AGAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 

CCTTCTCCCG GTTCCTGTTC TTTCAAGAGC CGGCATCGGC CGTGQCCTCG TTTCTCAATG 360 

GCCTGGCCAa CCTGGTGATG CTCTGGCGCT ACOGCACCTT CGTGCCAGCC TCCTGCCCCA 420 

TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 

TTTTCCACAC CAGGGACACT GACCTCACAG AGAAAATQGA CTACTTCTGT GCCTCCACTG 540 

75 TCATCCTACA CTCAATCTAC CTQTGCTGCX3 TCAGGAOOGT GGGGCTGCAG CACCCAGCTG 600 

TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 

GCCTCATCCG CTTCGACTAT GGCTACAACC TGGTGGCCAA DGTOGCTATT GGCCTGGTCA 720 

ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GOGGCTGCCT CACGTGGGCA 780 

orfc AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 

80 CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAQC2ACC ATCCCTGTCC 900 

ACGTCCTCTT TTTCAGCTTT CTGGAAGATG ACAGCCTGTA CCTGCTGAAG GAATCAGAGG 960 

ACAAGTTCAA GCTGGACTGA AGACCTTGGR GCGAGTCTGC OOCAGTGGGG ATOCTGCCCC 10 20 

CGOCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 10 BO 

TGAACTTGGA CATGAAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1140 

987 



WO 03/042661 



PCT/US02/3681© 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CCTCACCAGC 
GGCAGCCCCT 
CGGGOflGGAC 

CAGGGGGTCC 
GGATGCTTGT 
ACCAAGCTGC 
GGAGGCCTCA 

G CAT AGGTG A 
CAGTGTGGAC3 
QGTGQOTGTG 
TGAGTGACTG 
GTCACCATCA 
AGOCAGGAGC 
GCCOGCCTCC 
TCGGAGGCAQ 
TGCCTTCGCA 
GCCATAGCCT 
ATGGCTGGGT 
GCAGGCAGGG 
AGCCCTGGCA 
CCCAAAGGCA 
TCCCXCTTTG 
OGCTAGCTTC 
ATAAATCAAT 



CTTGGAGTCT 
CTACCTCCTG 
AGCTGCCTGT 
TGAAGCCCTC 

TCTTTGGCCA 
TGGGATTGGG 
CCATACTCCC 
TCTGGTTGAG 
CAGGGCCGTG 
ACGQGTGTTG 
TTAGCGTGGG 
CCGGTGAATG 
ATAATCACTT 
TCTCCATGGC 
TGCAAACCTC 
AGGARGGAAA 
TGGCTGGCCT 
GATTTTGGGG 
CTGGTTTCTT 
GCCACACTAT 
TGTTCCTGCC 
GTCTCGGTGG 
AGGGAGGGGA 
TTTTGATACT 
TCCAAGCCTC 



GTTCTAGGGA 
GAGCTGAACT 
TTCCXCCCCA 
TGTCTAOCTG 
CCACCCCCCA 
□CCAAGGTTC 
AAGGAGTTTC 
TTTCAGGGCC 
AGCCTGCCAC 
AGCATGGGCC 
TCGGGGAAGA 
TTAGGGGAAC 
TGTCCACAGT 
GTGGAGCGCC 
CAGGCTGOCT 
ACAGGGTCCC 
ATGGGGATGG 



AGGAGGAAGG 
CCCTTCCCAG 
GCCTGCGCCC 
CCACAGGAAT 
TTGAAGCAOA 
GCTATGCTAG 
GAAAACTTTT 
AAAAAAAAAA 



AGC3CCTCCCA 
GGGGTGGAAC 
TCAGCCTOCT 
GGAGACCAGG 
OCCTCCTCCA 
ACGOCGATTC 
ACCCTGACCA 
AGGGCTCCAG 
CGTGTGTCGG 
TGGGTGTGTG 
GGTGTGGCTT 
GTGTGTGCGC 
TGAGAGGTTG 
AGCTCTGCCC 
GTGTGCATGT 
CACACAACAG 
CTGGGGCTCT 
AACCTCCATT 
GGCGATTTGA 
AGGGTCTTAC 
TGGTAAAGGT 
AGAATGGAG3 
CTGGATTTTT 
GACTCCAACC 
AAGGTGGGAG 
AAAAAAAAAA 



GCATCTGGGA 
TGAGTGTGCT 
CCCCACATCC 
GACCRCAGGC 
GGACACCACT 
TOCCCATGGG 
TTGCCCTAGC 
CAAGCCCAGG 
GAGTGTGGGC 
TGAGCTCAGG 
CAAAGTGTGT 
GTGCTGGTGG 
GAGCAQ3ATG 
AAGGCGCCAC 
TCCCTGTCTG 
TGCCCTCCAG 
CTCCATCCTC 
CCCCTGCTGC 
GGGAGAAGGG 
TGTTCCAGGG 
GACCCCTGCC 
GAGCTCCAGA 




CTCGAGAGTG 
CTTAGCTCTA 
CCAGCIGCCT 
CTTAGGGATA 
AGGTGGTGCT 
ATCTTGAGGG 
CAGGTTCCCA 
GCAAGGATCC 
CAGGCTQAGT 
CGTAGGTGCG 
GTGTGCAGGG 
GCATGTGAGA 
AGGGAATCCT 
CTGGGCGGAC 
GTGCCCCTTT 
AAGCAGCCCC 
CTTTTCTCCT 
CAGCCCCTXT 
GAGAAAGCTT 
TGGCCCCAGG 
ATTTACCAGC 
AACTTTCCAT 
TGACCCCTTG 
GGGTGGCCTG 
GATGTGCTTA 



1200 
1260 
1320 

l3ao 
1440 
1500 
1S60 
1620 
1690 
1740 
1800 
I860 
1920 
19B0 
2040 
2100 
21GO 
2220 
22B0 
2340 
2400 
2460 
2520 
2560 
2640 



Seq ID KG: 94 Protein sequence 
Protein Accession #: NP_2194B7.1 

1 11 21 31 41 51 

I 1 I I I 1 

MKDVGPB9CG QPTPCWPSPA LESVLGKASQ HLGLBSGQPli YLLELNWGGT ECALSSTGRT 
AACPLPISIi PTSPAAWliGP EALCLPGHPG TTGLRDTGGP UJOPPPTLLQ DTTRWCWMLV 
DWPAKVHGUS PHGILBDOAA GIGKEFHPDH CPSOVFRRPH HTFFOGQGSS KPRARlIiCCC 
LVESLPPCVG SVGQAECIGD RAVBMGIXJVC BI*RPRCAVWR SVLSGKRCQF KVCVCRGWVC 

Seq ED NO i 95 DNA sequence 

Kfticleic Acid Accession #i xm_0 90469 

Coding sequence: 



1 
I 

ATGGGGTTTG 
TGCGGCGTGC 
AGCCCACACC 
CCACCTGAAG 
TCCCTGGCAC 
ATGTCTCTTC 
AACTCCGCAG 
GCTGCCCACC 
CAATCAGTTC 
AGGGCACCTT 
CTCAGCCGCA 
OGCGAGCTAG 
TGOCTGCCOG 
GGCAGCCCAC 
CTGCATCAAG 
TGTATCAAAG 
GAATGA 



11 
I 

GAGACCAGGG 
TGGACAGGAC 
CCTTQCATGG 
TGAAAATGTT 
AAGGCACAGG 
CTGGAGTGCT 
ATACTGACXG 
AGGAAGACAC 
AGCAGCAGAA 
GCGCGCCACC 
GCCCGGCGOC 
AOGCGCCAGG 
CGGCTCCAGA 
CACCTAAGGA 
AATCACCAGG 
GGGCTCTGGG 



21 
1 

AACGGTGGAA 
GCAGGTGAAC 
ACAGTCGGTC 
TGGACOCAGT 
CTGCTCAGTG 
CAGTQCTGAT 
GGGAACTGAG 
TTTCTTTTTG 
TTGCATTTAC 
AAGGCGAGAG 
GTCCCCAOGT 
COCGCCOCCC 
AGCCCAGAAA 
CATCTTGCAG 
TTCATTTCCA 
"TGTGGCTGCT 



31 
I 

GGOAGCCTAG 
AAGCCCTGCC 
AACAGCCACC 
CAAGGTTTGC 
TGCTGGGAGG 
GCTGGOCAGG 
GAGGGACCAG 
AAAGACATCA 
AGCOCCAGAG 
GCCCCGCTTG 
ACACCCATGG 
GGGCTGOQGA 
CTCTTCTTCC 
AOGCTCCAGC 
ATAGGTTTAT 
GAAGCAGCAT 



41 
1 

GAACGTCGAA 
TGCTGGAGGC 
CATCAGGTGC 
TGACAATGGA 
TGAAOGGCOC 
TGOAGCACAG 
GAAGTCTAGA 
AGCACACAAG 
AAAAACCCTG 
CCCTCAGCAG 
OGOCCTTCCC 
GCTCGGCAGC 
TGCTGCOCCT 
ACAAGGCACC 
GTAACAGAAC 
TTCACTTACA 



Seq ID NO: 96 Protein sequence 
Protein Accession XP_090469 

21 



60 
120 
180 



51 
1 

AGATAGCTCC 
TGCCAAGCAC 
TCATCASAAA 
AACAAACCAG 
AGGGTTGGAA 
AAGAGAAATG 
TTTCGCCGTG 
TACGTTCAGG 
TGGGAATGTC 

GACCTCTGAC 
AGOCCCTCAC 
TTATCCAGAT 
TGGGAAGAGA 
AGCTGAACCA 
GTTTTCCAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 

7ao 

840 
9Q0 
960 



1 11 21 31 41 51 

1 1 I I i I 

MGFGDQGTVS GSLGTSKDSS CKVLnRTQVN KPCLLEAAKH SPHP^HGQSV NSHPSGAHQK 
PPEVKMFGAS QGLLTMETNQ SLAOGTGCSV CWBVNGPGLE HSLPGVLBAD AGQVEHHRQM 
NSADTDHQTB EGPGSU3PAV AAHQEDTPFI* lODIZHTBTFR QSVQQQNCIY SPKEKPCGKV 
RAPCAPPRRB APLftLSHPWR I»SHSPAPSPR TPMAPFPTSD RRLDAFGPPP GLRSSAAAPH 
CLPAAPEAQK LFFLLFLYPD G6PPPJCDII1Q TLQHKAPGKH LHQESPGSFP IHLCHRTAEP 
CIKGALGVAA EAAFHLQFSS V 

Seii ID NO j 97 DNA sequence 

Nucleic Acid Accession #= KK_003474.2 

Ooding sequence: 37- -3036 

1 11 21 31 41 51 

I I I I I I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TXGCTCATTT ATTGCAACGG 
TCAAGGCTGG CTTGTGCCAG AAOGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 
CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC gogggcogtg cgcgagggct 
CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CrCCGGTCGC GACGOCCGGC COCGCTCGGC 
GC3CCGOGTG3 GATGGTGCAG CGCTCGCGGC CGGGCCCGAG AGCTGCTGCA CTGAAGGOCG 

988 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



WO 03/042661 



PCT/US02/36810 



GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCtTT 6CTC3GCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC GGAGGGGTGA GCTTATGGAA DOAAGGAAGA 420 
GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 48 D 
TTCGACTCCA AQAATCATCC AGAAGTGCTG AATATTOGAC TACAAGGGGA AAQCAAAGAA S40 
5 CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 60 O 

TATCXGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 
TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 
10 TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGSA AAAAGTTAAG 1020 
CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 
ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1140 
15 CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 

TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTAXTTCC AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGXG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 132 0 
GACAATOCCC TTGGTGCAOC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 
AATCATGACA CACXGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 
20 ATCAXGAACG CTTCCACCGG GTACCCATTT CCCAXGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACTTGOAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1620 
GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 
AAG CCGG ACG CTGTGTGCGC ACATGGGCTG TGCTGXGAAG ACTGCCAGCT GAAGCCTGCA 1740 
25 GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTQACCTCC CAGAGTTCTG CACAGGGGCC IB 00 

AGCCCTCACT GCCCAGCCAA CGTGXACCTG CACGATGGGC ACTCATGXGA GGATGTGGAC 1860 
GGCTACTGCT ACAATGGCAT CTGCGAGACT CACOAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 
CCAGOTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2040 
30 AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 

GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTQ CCGGGGGACC 2160 
CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTQT 2220 
GCAGATGGAA AAATCTGCCT QAATCX3TGAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 
35 GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTXXGGCX TTGGAGGAAG CACAGACAGC 2400 

GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAQ GAATTCXGGT GACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT GTCAAAAGGA AGACCTTGAT ACGACTGCXG 2520 
TTTAGAAATA AGAAGACCAC CATTGAAAAA CTAAGGTOTG TGCGCCCTTC CCGGCCACCC 2580 
CGrGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGGCT GATGAGGAAG 2640 
40 CCGCCAGATT CCTACOCACC GAAGGACAAT CGCAGGAGAT TGCTGCAGTG TCAGAATGTT 27 0O 

GACATCAGCA GACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGDGAGXG 2760 
CTTCCTCCCC TCCAOCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 
AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCOCCTCA GAAQCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
45 CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 

GTGCCCAGAT CCACCCACAC CGCCTATATT AAGXGAGAAG CCGACACCTT TTTTCAACAG 3060 
TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTXXTGTA CCAACTTTTA 3120 
GGATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGOCGTCAGT 3180 
GCTGTGCTGT GCTATGGTGC TCTGTCTACT IQCACAGGXA CXXGTAAAXX ATTAATITAT 3240 
50 GCAaAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCAXGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GGACAGGAXG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 
CCAACTACCC CCAGCTGTGG TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAr 3480 
CTCAGTTGAT TTXCXGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
55 TGTGTTTGOC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAB GCTCCCAGGG 3600 

ACACCXGGGA GAAATCTGGC TTCXGGOCAG QAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAX AGAGACTGGA ACACTAGACA AGCCAGAACX 3720 
TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAQGGG TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTGAT AQAAATGCCA AGCAjCTTCTT TXTCTCGCTG TCCTTTCTAG AGCACXGCCA 3840 
60 CCAGXAGGTT ATTTAGCTTO GGAAAGQTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 

CTGCAAACCG CCACCTCXICT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 
CAATGATCCT GTATTCAGAC AGATQAGGAC TTTCCATGGG ACCACAACTA TTTTtZAGATG 4020 
TGARCCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGG1XCA ACTTATTAAC 4080 
AATTAQGCAB ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 
65 TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTOGAC AAAGAAOCTT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCXGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCXATTTCT TAAACACITG CAACCTACCT GTTGAGCATC AC5VGAMK3TG ATAAGGAAAX 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCMCA TCCCCTTGAA 4380 
nfx CTOTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 
70 AATGGCATGA GAAATACAAA AATACTCAGA TAAQGTAAAA TGCCATGATG CCTCTGTCTT 4500 
CTGGACTOGT TTTCACATTA GAAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 
- TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 

75 AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 

TTATTCTAXA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACXG TAACTGATTA CACTTGGTAA TTBTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AGATGCOCAA ATCCTTAGAT CXGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 
CO ACCAAAAAAA AAAAAAAAAA AA 
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MAARPIxFVSP ARALli&IAG ALiAPCEARG VSLHNEGHAD BWSASVR3G DLKIPVKSFD 60 

SKNHPBVLNI RLQRESKELI INLERNEQLI ASSFT ETHYL QDGTDVSIjAR NYTVILGHCY 12 0 

5 VHGHTOGYSD SAVSLSTCSG LRGLIVPEWE SYVLEFMKSA TNRYKLPPAK KLKSVRGSCG ISO 

SHHNTPM1AA KNVFPPPSQT WARRHKRKTL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 

LIEIANHVDK FYRPLNIRIV LVGVEVWNDW DKCSVSQDPF TSLHEFLDKR KMKIiPRKSH 300 

DHAQIaVSGVY PQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTIAH ELGHNFGMNH 360 

DTLDRGCSOQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 

10 SFGGQKCGNH FVEEGEECDC GEPEECMNRC CNATTCXbKP DAVCRHGLCC KDCQLKPAGT 4S0 

ACRDSSNSCD LPEPCTGASP HCPANVYXHD GHfiCQDVDGY CYNGICQTHE QQCVTLWGPG 540 

AKPAPQXCFE KVNSAGDPYG AXCJBMRDAKC GKIQCQGGAS RPVIGTHAV3 600 

IETKIPLQQG GRILCRGTHV YIjGDDMPDPG LVLAGTKCAD GKICIMRQCQ NISVFGVHEC 660 

AMQCHGEOVC NNRKNCHCEA HWAPPFCDKP GFGGSTDSGP IRQADNQGLT IGI&VTIIjCL 720 

LAAGFWYLK R5Cn.TRLi.FT NKKTTIEKLR CVRPSRPPRG PQPCQAHUGH LGKGLMRKPP 780 

DSYPPKESSPR RLLQCQNVDI SRPUtfGfcNVP QPQSTQRVLP PliKRAPRAPS VPARPLPAKP 840 

AliRQAQGTCK PNPFQKFIiPA DPLART TRliT HALARTPGQW ETGUUAPLR PAPQYPHQVP 900 
R&THTAYIK 

20 a«q ID MO; 99 DMA sequence 
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GAGGAGGAGG GAAAAGGCGA GCAAAAAGGA AGAGTGGGAG GAGGAGGGGA AGCGGCGAAG 60 

GAGGAAGAGG AGGAGGAGGA AGAGGGGAGC ACAAAGGATC CAGGTCTCCC GACGGGAGGT 120 

TAATACCAAG AACCATGTGT GCCGAGCGGC TGGGCCAGTT CATGACCCFG GCTTTGGTGT 180 

TGQCCACCTT TGACCCGGCG CGGGGGACCG ACGCCACCAA CCCACCCGAG GGTCCCCAAG 240 

AGAGGAGCTC CCAGCAGAAA GGCCGCCTGT CCCTGCAGAA TACAGCGGAG ATCCAGCAGT 300 

GTTTGGTCAA CGCTGGCGAT GTGGGGTGTQ GGGTGTTTGA ATGTXTCGAG AACAACTOTT 360 

GTGAGATTCG GGGCTTACAT GGGAXXTGCA TGACTTTTCT GCAC&ACGCT GGAAAATTTG 420 

ATGCCCAGGG GAAGTCATTC ATCAAAGACG CCTTGAAATG TAAGGCCCAC GCTCTGCGGC 4B0 

ACAGGTTCGG CTGCATAAGC CGGAAGTGCC CGGCCATCAG GGAAATGGTG TCCCAGTTGC 540 

35 AGCGGGAAXG CXACCXCAAG CACGACCTGr GCGCGGCTGC CCAGGAGAAC ACCOGGGTGA 600 

TAGTGGAGAT GATCCATTTC AAGGACTTGC TGCTGCACGA ACCCTACGTG GACCTCGTGA 660 

ACTTGCTGC? GACCTGTGGG GAGGAGGTGA AGGAGGCCAT CACCCACAGC GTGCAGGTTC 720 

AGTGTGAGCA GAACTGGGGA AGCCTGTGCT CCATCTTGAG CTTCTGCACC TCGGOCATCC 780 

AGAAGCCTCC CACGGCGCCC CCCGAGCGCC AGCCCCAGGT GGACAGAACC AAGCTCTCCA B40 

40 GGGCCCACCA CGGGGAAGCA GGACATCACC TCCCAGAGCC CAGCAGTAGG GAGAGTGGCC 900 

GAGGTGCCAA GGGTGAGCGA GGTAGCAAGA GCCACCCAAA CGCCCATGCC CGAGGCAGAG 960 

TCGGGGGCCT TGGGGCTCRO GGACCTTCCG GAAGGAGCGA GTGGGAAGAC GAACAGTGTG 1020 

AGTATTCTGA TATCCGGAGG TGAAATGAAA GGCCTGGCCA CGAAATCTTT CCTCCACGCC 108D 

GTCCATTTTC TTATCTATGG ACATTCGAAA ACATTTACCA TTAGAGAGGG GGGATGTCAC 1140 

ACGCAGGATT CTGTGGGGAC TGXGGACTTC ATCGAGGTGT GTGTTGGCQG AACGGACAGG 1200 

TGAGATGGAG ACCCCTGGGG COGTGGQGTC TCAGGGGTGC CTGGTGAATT CTGCAjCTTAG 1260 

ACOTACTCAA GGGAGCGCGC CCGCGTXATC CTCGTACCTT TGTCTTCTTT CCATCTGTGG 1320 

AGTCAGTGGG TGTOGGCCGC TCTGTTGXGG GGGAGGTGAA CCAG6GAGGG GCAGGGCAAG 1380 

GCAGGGCCCC CAGAGCIGGG CCACACAGTG GGTGCTGOGC CTCGCCCOGA AGCTTCTGGT 1440 

50 GCAGCAGCCT CTGGTGCTGT CTCCGOGGAA GTCAGGGCGG CTGGAXTCCA GGACAGGAGT 1500 

GAATGXAAAA AXAAATATCG CTTAGAATGC AGGAGAAGGG TGGAGAGGAG GCAGGGGCCG 1560 

AGGGG9TGCT TGGTGCCAAA CTGAAAXXCA GTTTCTTGTG TGGGGCCTTQ CGGTTCAGAG 1620 

CTCIXGGOSA GGGTGGAGGG AGGAGTGTCA TTTCTATGTG TAAXTTCTGA GCCAXXGTAC 1680 

TGTCXGGGCT GGQGGGGACA CXGXCCAAGG GAGTGGCCCC TATQAOTTTA TATTTXAACC 1740 

55 ACTGCTTCAA ATCTCGATTT CACTTTTTTT ATTTAXCCAG TTATATCTAC ATATCTOTCA 1800 

TCTAAATAAA TGGCTXTCAA ACAAAGCAAC TGGGXCATTA AAACCAGCXC AAAGGGGGTT I860 

TAAAAAAAAA AAAACCAG CC CATCCTTTC3A GGCXGAIXTT TCTTTTTTTT AAGTTCTATT 1920 

TTAAAAGCTA TCAAACAGCG ACAXAGCCAT ACATCTGACT GCCTGACATG GACTCCXGCC 1980 

CACTTGGGGG AAAOCT TATA CGCAOAGGAA AATACACACC TGGGGAGTAC ATTTGACAAA 2040 

60 TTTCCCTTAG GATTTCGXTA TCTCACCTTG ACCCTCAGCC AAGAXXGGIA AAGGTGCGTC 2100 

CTGGCGATTC CAGGAQACCC AGCTGGAAAC CTGGCTTCTC CATGTGAGGG GATGGGAAAG 2160 

GAAAGAAGAG AATGAAGACT ACT TAGTAAT TCXTCATCAGG AAAXGCXGAC CTTTTACATA 222 0 

AAATCAAGGA □ACTGCTOAA AATCTCTAAG GGACAGGATT TTCCAGATCC TAATTGGAAA 2280 

TTTAGCAATA AGGAGAGGAG TCCAAGGGGA CAAATAAAGG CAGAGAGAGA GAGAGAGAGA 2340 
65 GGGAGAGGAA. GAAAAGAGAS AGAGAAAAGA GCCTCQTGCC 

seq ZD MO i 100 Protein sequence 
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70 1 11 21 31 41 SI 

11111} 

MCAEKLGQFM TIiALVIiAXPD PARGTDATNP PEGPQDRSSQ QKGHLSLQNT AEIQSCIiVMA 60 

GDVGCGVFBC FENNSCEIRG ZJJGXCNTFIiH HAGKFDAQGK 8FIKDAUKCK AHALRHRFGC 120 

- I3RXCPAIHE MVSQLQRECY LKHDLCAAAQ ENTEVIVBHI HFKDLLLHEP YVDLVNI^XiT 180 

75 CGEEVKEAIT HSVQVQCEQN KG31.CSILSF CTSAIQKPPT APPERQPOVD RTKDSHAHHG 240 

EAGHHIiPEPS SREXGRGAKG ERGGK8RPWA HARGKVGGLG AQGPSGSSSW EDEQSEYSDT 300 
RR 
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AAGC CCAG CA- GCCCCGGGGC GGATGGCTCC GGCCGCCTGG CTCCGCAGCG CGGCCGCGCG €0 

CC3CCCTCCTG CCCCCGATGC TGCTGCTGCT GCTCCAGCCG CCGCCGCTGC TGOCCCGGGC 120 

TCTGCCGCCQ OACGTCCACC ACCTCCATGC CGAGAGGAGG GGGCCACAGC CCTGGCATGC 180 

AGCCCTGCCC AGTAGCCCGG CACCTGCCCC TQCCACGCAG GAAGCCCCCC GGCCTGCCAG 240 

CAGCCTCAGG CCTCCCCGCT GTGGCGTGCC CGACCCATCT GATGGGCTGA GTGCCOGCAA 300 

CCGACAGAAG AGGTTCGTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC TCACCTACAG 360 

GATCCTTCGG TTCCCATGGC AGTTGGTGCA GGAGCAGGTG CGGCAGACGA TGGCAGAGGC 420 

CCTAAAGGTA TGGAGCGATG TGACGCCACT CACCTTXACT GAGGTGCACG AGGGCCGTQC 480 

TGACATCATG ATCOACTTCG CCAGGTACTG GCATGGGGAC GACCTGCOGT TTGATGGGCC 540 

TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAGACTCAC CGAGAAGGGG ATGTCCACTT 600 

OGACTATGAT GAGACCTGGA CTATCGGGGA TGACCAGGGC ACAGACCTGC TGCAGGTGGC 660 

AGCCCATGAA TTTGGCCACG TGCTQGGGCT GCAGCACACA ACAGCAGCCA AGGCCCTGAT 720 

GTCCGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAGATGACT GCAGGGGCGX 1BO 

TCAAGACCTA TATGGCCAGC CCTGGCCCAC TGTCACCTCC AGQACCCCAG COCTGGGCCC 840 

CCAGGCTGGG ATAGACACCA ATGAGATTGC ACCGCXGGAG CCAGACGCCC CGCCAGATGC 900 

C1GTGAGGCC TCCTTTGAOG CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAAAGC 960 

GGGCTTTGTG TGGCGCCTCC GTGGGGGCCA GCTOCAGCCC GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCOCTGT GGACGCTGCC TTCGAGGATG COCAGGGCCA 1080 

CATTTGGTTC TTCCAftGGTG CTCAGTACTG GGTGTACgAC GGTGAAAAGC CAGTCCTGGG 1140 

CCOCQCACCC CTCAOCGAGC TCGGCCTGGT GAGGTTCCCG GTCCATGCTG CCTTGGTCTG 12 00 

GGGTCCCGAG AAGAACAAGA TCTACTTCTT CCGAGGCAGG GACTACTGGC QTTTCCACCC 1260 

CAGCACCCGG CGTGTAGACA GTCCCGTGOC CCGCAGGGCC ACTGACTGGA GAGGGGTOCC 1320 

CTCTGAGATC QACGCTGCCT TCCAGGATGC TGATGGCTAr GCCTACTTCC TGCGCGGCCG 1380 

CCTCTACTGG AAGTTTGACC CTGTGAAGGT GAAGGCTCTG QAAGGCTTCC CCCGTCTCGT 1440 

GGGTCCTGAC TTCTTTGGCT GTGCCGASOC TGCCAACACT TTCCTCTQAC CATGGCTTGG 150 0 

ATGCCCTCAG GGGTGCTGAC CCCTGCCAGG CCACGAATAT CAGGCTAGAG ACCCATGGCC 1560 

ATCTTTGTGG CTGTGGGCAC CAGGCATGGG ACTGAGCCCA TGTCTCCTGC AGGGGGATGG 1620 

GGTGGGGTAC AACCACCATG ACAACTGCCG GGAGGGCCAC GCAGGTCGTG GTCACCTGCC 1680 

AGCGACTGTC TCAGACTGGG CAGGGAGGCT TTGGCATGAC TTAAGAGGAA GGGCAGTCTT 1740 

GGGACCCGCT ATGCAGGTCC TGGCAAACCT GGCTGCCCTG TCTCATCCCT GTCCCTCAGG 1S0Q 

GTAGCACCAT GGCAGGACTG GGGGAACTGG AGTGTCCTTG CTGTATCCCT GTTGTGAGGT 1860 

TCCTTCCAGG GGCTGGCACT GAAGCAAGGG TGCTGGGGCC CCATGGCCTT CAGCCCTGGC 1920 

TQAGCAACTG GGCTGTAGGG CAGGGCCACT TCCTGAGGTC AGGTCTTGGT AGGTGCCTGC 1980 

ATCTGTCTGC CTTCTGGCTG ACAATCCTGG AAATCTGTXC TCCAGAATCC AGGCCAAAAA 2040 

GTTCACAGTC AAATGGGGAG GGGTATTCTT CATGCAGGAG ACCCCAGGCG CTGGAGGCTG 2100 

CAACATACCT CAATCCTGTC CGAGGCCGGA TCCTCCTGAA GCCCTTTTCG CAGCACTGCT 2160 

ATCCTCCAAA GCCATTGTAA ATGTGTGTAC AGTGTGTATA AACCTTCTTC TTCTTTTTTT 2220 
TTTTTAAACT GAGGATTGTC ATTAAACACA GTTGTTTTCX 

Seq ID NO= 102 Protein sequence 
Protein Accession #i NP_00593l 

1 11 21 31 41 51 
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MAPAAWIiRSA AARAliLPPML DLLLQFPPLI* ARAIiPPDVHH LHAEHRGPQP WKAALPS8PA 60 

PAPATQBAPR PASSLUPPRC OVPDPSDGIiS ARNRQKRFVIi SGGRWEKTDI* TYRlIiRFFWQ 120 

LVQBQVRQIM AEALKVHSDV TPLTFXEVHE GRADIHlDFA RYWHGDDIjPP DGPGGILAHA 1B0 

FFPKTHREGO VHFDYDETWT IGDDOGTDLL QVAAHEPGHV IiGLQHTTAAK ALMSAFYTFR 240 

YPDS&SPDDC &GVQHLYGQP WPTVTSRTPA I/GPQAGIDTN EIAPLBPDAP PDACEA8FDA 300 

VSTIRGELFF PKAGFVWRLH GGQLQPGYPA IASRHWQQLF SPVDAAFEDA QGHIWFFQGA 360 

QYWVYDGEKP VLGPAPLTEL GLVRPPVHAA XATKGPEKNKI YFFBGHDYWR FHPSTKRVDS 420 

PVPRHATDWR GVPSEIDAAF QDADGYAYFI. RGELYMKFDP VXVKAIiEGFP MtVGPDFPGC 480 
AEPANTPL 

•* 

Seq ID NO* 103 DMA gggumagg 
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Coding sequence; 351.. 4499 
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ACTGGGATAA AGCAAGAAGA CTGATTTTAT GAGCAOGGGT TTGATACATC AAAGOAQATT 60 

GGCCAGGAtrC AAGGGTGCGG TGTTOGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 

ACTAAGTGAT TTGGGCCTGA GGCCTGAGAA GATGTTTAAA AAGAGGGATC AAGCACAGGC 180 

TAAGGAGAGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TGCCTGCAGG 240 

GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 

CTTTTCATAT TCTCTGATTC TOGGAAACGA AGAATTGGCA GOAACTGAAA ATGACXAOOA 360 

AGAGGACAXA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 

GGGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT GGCCCCTGGA 4 GO 

GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 

ATGATGCTGC CTTGAGAACC ATGATTCCCT TOCGTCCCAA GCOGAGGTTT CCTGCCCCCC 600 

AGOCCCTGOA CAATGCTGGC CTGTTCTCCT ACCTCACCQT GTCATGGCTC ACXXXGCTCA 660 

TGATCCAAAG CTTACGGAGT CGCTTABATG AGAACACCAT OCCTCCACTG TCAGTCCATG 720 

ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA GTCTCAAGGC 780 

GAGGGATTGA AAAAGCTTCA GTGCTTCTGG 1GATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 

TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 

TTATACCAAA OATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC CATGGAGTGG 960 

GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTOTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 

GGATCAXCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC IGTTTCCTCC TTTGGCTTTG 1080 

AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCX 1140 

TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCIA TGGACCCCXA GTACTGATCA 1200 

CCTGCGCATC GCTGGTCATC 1GCAGCATTT CTTCCTACTT CATTATTGOA TACACTGCAT 1260 

TTATTGCCAT CITATGCTAT CTCCTGQTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 

CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGOGACCA GCGCATCOQT GTGACCAGTG 1380 

AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGIACACATG GGAGAAACCA TTTGCAAAAA 1440 
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TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTQTCCftGA 1500 
GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA IS 60 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CTCACGAATT 1660 
5 CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1740 

TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 
AACAOACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT 106D 
CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GGAAGAAGGG AACAGCCTGQ 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCXGCG 1980 
10 GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGOCTATGT COCCCAGCAG GCCTGGATOG 2100 
TCAGCGGGAA CA.TCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGAOCTGGA ACITCTGCCC TTTGGAGACA 2220 
TGACAGAGAT tggagagcgg ggcctcaacc tctctggggg GCAQAAACAG AGGATCAGCC 2 2 BO 
15 TGGCCCGCGC OGTCTATTCC GACCGTCAGA TCTACCTGCT GGACGACCCC CTGTC7TGCTG 2340 
TGGACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT TAAGAAGACA CTCAGGGGGA 2400 
AGACGGTCGT CCTGGTGACC CACCflGCTGC AGTACTTAOA ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGGAAAATC TGTGAAAATG GAACTCACAG TGAQT TAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CftGAAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 
20 ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 
AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAOGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAQGCAGCT GGAGGTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 
TCTGGTGGCT GAGCTACTGG TTGGAGCAOG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 28 BO 
25 ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCXGTGTGGG GOTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CAOGAGGAAG GCATCCACGG CCCTGCACAA CAAGCTCTTC AAGAAGGTTT 3060 
TGCGCTGCOC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 
CAGGGGACTT GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3180 
30 TGTCCTTAAT GGTGATCGCC GTCCTGTTGA TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGGAGC CATAATCATG GTTATTTGCT TCATTTATTA TATQATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGOC 3420 
AGTTrAAGAG GCTGACTGAT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 
35 GATGGATOGC ATTGAGGCTG GAGATCATGA CCAACCTXGT GACCTTGGCT GTTGCCCTGT 35 40 

TCGTX3GCTTT TGGCATTTCCZ TCCAOCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAQCT GGCGXCCAGC TTCCAGGCCA CIGCCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 
AGTTCACGGC TGTAGAQAGG ATACTGCAGT ACATGAAGAT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCftTATTTC 3780 
40 AGGATTAT CA CATGAAATAC AOAGACAACA CACCCACCGT nCTTCACGGC ATCAACCTOA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGQ AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGGAGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 
TGGACATTTG CAGCATOGGC CTGGAGGACT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4.020 
ATCCAGTGCT GCTCTCAGGA ACCATCAGAT TCAACCTAGA TCCCTTTGAC CGTCACACTG 4080 
45 ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 

CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGGCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATGAAG 4260 
CCACAGCCTC CATTGACATG GASACAGACA OCCTGATOCA GCGCACAATC CGTGAAGCCT 4320 
TCCAOGGCTG CAC03TGCTC GTCATTGCCC ACCGTOTCAC CACTGTGCTG AACTGTGACC 4380 
50 ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 
AGAAGOCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGATAAG 4500 
GAGATGTGGA GACTXCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 
TCGAGGCOCA CAGICTGCGA CCTTCTTGTT TGGAGAXGAG AACTTCTCCT GGAAGCAQGG 4620 
GTAAATGTAG GGQGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
55 TCTCAAGACC TTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 

TCCTTTTAAC TTACA3X3CTG AATAATTTTA TAATAAQGTA AAAGCTTATA GTTTTCTGAff 4800 
GTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 48 60 
TC 
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1 11 21 31 41 51 

« $ I 1 I 1 I 

OJ MTRKRTYWVP HSSGGLVMRG HJlGDDMVSG LIYKTYTLQD GPWSQQERNP EAPGRAAVPP 60 

WGKYDAAIiRT MIPFRPKPRF PAPQPLDUAG LFSYLTVSWL TPIWIQ3LRS RlJOfiMTIPPL 120 

EVHDASDKNV QELHRiWEEE VSRHG1EKAS VLLVMLRFQR TRLIFDALLG ICFCJASVLG 180 

PILIIPKILE YSEEOLGNW HGVGLCFALP LSECVKSLSF SSSWJXNQRT AIRFHAAV88 240 

FAFBKL2QPK SVIHIT9GEA ISPPTGDVNY IjFEGVCYGPIi VBITCASLVI CSISSYFIIG 300 

70 YTAFIAXLOT IiLVFPLAVFM TRMAVJCAQHH TSEVaDORtR VTSEVLTCIK I»IHMYTWEKP 360 

FAKIIEDLRR KERiOJjEKCG LVQgiiTSITL FIIPTVATAV WVLLHTSLKL KLTASMAFSM 420 

LASLNLUUiS VFFVPIAVKG LTMEKSAVMR PKKFFLQESP VFYVQTLQDP 3KALVFSEAT 480 

LSWQQTCPGI VNGALELERN GHA3BGMTRP RDALGPEEEG NSZX3PELHK1 NLVVSKGMMLi 540 

GVCGNTGSGK SSU^SAILEE MHLLBGSVGV QGSIAYVPQQ AWIVSGNIRS NILMGGAYDK 600 

75 ARYl^VLHCC SLNRDLEKLP FGDMTEIGER GLNLSGGQKQ RISLARAVYS DRQIYLLDDP 660 

liBAVDAHVGK HIPEECIKKT LHGKTWLVT EQLQYliEFGG QIILLENGKI CEMOTHSELM 720 

QKKGKYAQLI QKMHKEATSD MLQDTAKIAE KPKVESQALA TgLEESLKGN AVPEHQLTQE 7B0 

EEMEEGSLSW RVYHHYIOAA GGYMVSCIIF FFWIiIVFLT IPSFWWBSYM UEQQSGTN6S 840 

0/x RESNGTHADI* GNIADNPQIiS FYQLV^GLNA LIJ-ICVOVCS SQIPTKVTRlC ASTAtHNKLP 900 

80 UKVPRCPMEP PDTIPIGKLL NCPAGDLEQL DQIiPIFSEQ PLVLSLMVIA VLUVSVDSP 960 

YILIiMGAUM VICFIYYMMF KKAIGVFKHL EHYSRSPLTS HILNSLQGLS SIKVYGKTED 1020 

F1SQFKKLTD AQBCWLLLFL SSTRWMALRL EIMTNIiVTLA VALPVAPGIS STPYSFKVMA 1080 

VNTVLQIiASS FQATARIGI*B TBAQFTAVER IUQYMKMCVE EAPLHMEGTC CPQGWPQHGE 1140 

riPQDYHMKT RDNTPTVLHG XHLTIRGHEV VGIVGRTGSG KSSLGMALFR LVSPMAGRIIi 1200 
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IDGVDICSIG LEDLR3KL.SV IPQDPVUjSG TIRFNLDPFD RHTDQQIWDA LERTFLTKAI 1260 
SKPPKKLHTD WENGGNFSV GKRQLLCIAR AVLRNSKIIL, IBEATASIDK ETDTLIQRTI 1320 
REAPQGCTVL VIAHRVTTVIj KCDHILVMGN GKWEFDRPE VbRKKPGSLF AAIMATATSS 13 60 
lift 

Seq ID NO: 105 DMA gequ**"^ 

Nucleic Acid Accession #2 Bos sequence 

Coding se<juence: 1-.4043 
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ATGACTAGGA AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT C3AATCGTGGC 60 

ATCGACATAG GCGATGAOVT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT 120 

GGCCCCTGGA GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG 180 

TGGGGGAAGT ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTOCCAA GGOGAGGTTT 240 

CCTGCCCCCC AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC 300 

ACCCXZGCTCA TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG 360 

TCAGTCCATG ATCCCTCAGA CAAAAATGTC CAAAGGCTTC AOCGCCTTTO GGAAGAAGAA 420 

GTCTCAAGGC gagggattga AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA 4 bo 

ACAAGGTTGA TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTOQGG 340 

CCAATATTGA TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC 600 

CATGGAGTGG GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC 660 

TCCTCCAGTT GGATCATCAA CCAACGCACA GCCATCAGGT TGCGAGCAGC TGTTTCCTCC 720 

TTTGCCTTTG AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC 7 B0 

ATCAGCTTCT TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGAOCCCTA 840 

GTACTGATCA CCTGCGCATC GCTGGTCATC XGCAGCATTT CTTCCTACTT CATTATTGGA 900 

TACACTGCAT TTATTGCCAT CTTATGCXAT CTCCTGGTTT TCCCACTGGC GGTATTCATG $60 

ACAAGAATGG CXGTGAAGGC TCAGCAtfCAC ACATCTGAGG TCAGCOACCA GCGCATCCGT 1020 

GTGACCAGTG AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA 1060 

TTTGCAAAAA TCATTGAAGG TATGGAAAGT CTGACTTTCT □CTCCAAACC TGGTGATGGC 1140 

ATGGCCTTCA GCATGCTGGC CTCCTTGAAT CTCCTTCGGC TGTCAGTGTT CTTTGTGCCT 1200 

ATTGCAGTCA AAGGTCTCAC GAATTCCAAG TCTGCAGTGA TGAOGTTCAA GAAGTTTTTC 1260 

CTOCAGGAGA GCCCTGTTTT CTATGTCCAG ACATTACAAG ACCCCAGCAA AGCTCTOGTC 1320 

TTTGAGGAGG CCACCTTGTC ATGGCAACAG ACCTGTCCCG GGATCGTCAA TGGGGCACTG 1360 

GAGCTGGAGA GGAAOGGGCA TGCTTCTGAG OGGATGACCA GGCCTAGAGA TGCCCTCGGG 1440 

CCAGAGOAAG AAGGGAACAG CCTGGGOCCA GftGTTGCACA AGATCAAOCT GGTGGTGTCC 1500 

AAGGGGATGA TGTTAGGGGT CTGOGGCAAC ACQGGGAGTG GTAAGAGCAG CCTGTTOTCA IS 60 

GCCATCCTGG AGGAGATGCA CTTGCTCGAG GGCTOGGTGG GGGTGCAGGG AAGCCTGGCC 1620 

TATGTCCCCC AGCAGGCCTG GATCGTCAGC OGGAACATCA GGGAGAACRT CCTCATGGOA 1680 

GGOGCATATQ ACAAGGCCCG ATACCTCCAG GTGCTCCACT GCTGCTCCCT GAATCGGGAC 1740 

CTGGAACTTC TGCCCTTTGG AGACATGACA GAGATTGGAG AGOGGGGCCT CAACCTCTCT 1800 

GGGGGGCAGA AACAGAGGAT CAGCCTGGCC GGCGCCGTCT ATTCOGACCG TCAGATCTAC 1860 

CTGCIGGACG ACCCCCTGTC TGCTGTGGAC GCCCACGTGG GGAAGCACAT TTTTGAGGAG 1920 

TGCATTAAGA AGACACTCAG GGGGAAGACG GTCGTCCTGG TGACCCACCA GCTGCAGTAC I960 

TTAGARTTTT GTGGCCAGAT CATTTTGTTG GAAAATGGGA AAATCTGTGA AAATGGAACT 2040 

CACAGTGAGT TAATGCAGAA AAAGGGGAAA TATGCCCAAC TTATCCAGAA GATGCACAAG 2100 

GAAGCCACTT CGGACATGTT GCAGGACACA GCAAAGATAQ CAGAGAAGCC AAAGGTAGAA 2160 

AGTCAGGCTC TOGCCACCTC CCTGGAAGAG TCrrCTCAACG GAAATGCTGT GOCGGAGCAT 2220 

CAGCTCACAC AGGAGGAGGA GAIGGAAGAA GGCTCCTTGA GTTGGAGGGT CTACCACCAC 2260 

TACATCCAGG CAGCTGGAGG TTACATGGTC TCTTGCATAA TTTTCTTCTT CGTGGTGCTG 2340 

ATCGTCTTCT TAACGATCTT CAGCTTCTGG TGGCTGAGCT ACXGGTTGGA GCAGGGCTCO 2400 

GGGACCAATA GCAGCOGAGA GAGCAATGGA ACCATGGCAG ACCTGGGCAA CATTGCAGAC 2460 

AATGCICAAC TGTCCTTCTA CCAGCTGGTG TAGGGGCTCA AGGCCCTGCT CCTCATCTGT 2520 

GTGGGGGTCT GCTCCTCAGG GATTTTCACC AAAGTCACGA GGAAGGCATC CACGGCCCTG 2580 

CACAACAAGC TCTTCAACAA GGTTTTCOQC TQCCCCATGA GTTTCTTTGA CAOCATCCCA 2640 

ATAGGCCGGC TTTTGAACTG CTTCGCAGGG GACTTGGAAC AGCTGOACCA GCTCTTGCCC 2700 

ATCTTTTCAG AGCAGTTCCT GGTCCTGTOC TTAATGGTGA TOGCCGTCCT GTTGATTGTC 2760 

AGTGTGCTGT GTCCATATAT CCTGTTAATG GGAGGCATAA TCATGGTIAT TTGCTTCATT 2820 

TAT TATATGA TGTTCAAGAA GGCCATCGGT GTGTTCAAGA GACTGGAGAA CTATAGCCGG 2680 

TCTCCTTTAT TCTCCCACAT CCTCAATTCT CTGCAAGGCC TGAOCTCCAT CCATGTCTAT 2940 

GGAAAAACTG AAQACTTCAT CAGCCAGTTT AAGAGGCTGA CTGATGCGCA OAATAACTAC 3000 

CXGCTGTTGT TTCTATCTTC CACACGATGG ATGQCATTGA GGCTGGAGAT CATGACCAAC 306O 

CTTOTGAGCT TGGCIX3TTGC CCTGTTaGTG GCTTTTGGCA TTXCCTCCAC CCC5CTACTCC 3120 

TTTAAAGTCA TGGCTGTCAA CATCGTGCTG CAGCTGGCGT CC3ySCTTCCA GGCCACTGCC 3180 

CGGATTGGCT TGGAGACAGA GGCACAGTTC ACGGCTGTAG AGAGGATACT GCAGTACATG 3240 

AAGATGTGTG TCTCGGAAGC TCCTTTACAC ATGGAAGGCA CAAGTTGTCC CCAGGGGTGG 3300 

CCACAGCATG GGGAAATCAT ATlTCAGGAT TATCACATGA AATACAGAGA CAACACACCC 3360 

ACCGTGCTTC ACGGCATCAA CCTGACCATC CGOBGOCAOG AAGTGGTGGG CATCGTGGGA 3420 

AGGAOGGGCT CTGGGAAGTC CTCCTTGGGC ATGGCTCTCT TCCGCCTGGT GGAGCCCATG 3480 

GCAGGCOGGA TTCTC3VXTGA OGGGGTGGAC ATTTGCAGCA TCGGCCTGGA GGACTTGCGG 3540 

TCCAAGCTCT CAGTGATCCC TCAAQATCCA GTGCTGCTCT CAGGAACCAT CAGATTCAAC 3600 

CTAGATCOCT TTGACCGTCA CACTGACCAG CAGATCTGOS ATGCCTTGGA GAGGAGATTC 3660 

CTGACCAAGG CCATCTCAAA GTTCCCCAAA AAGCXGCATA CAGATGTGGT GGAAAACGGT 3720 

GGAAACTTCX CTGTGGGGGA GAGGCAGCTG CTCTGCATTG CCAGGGCTGT GCTTCGCAAC 37SO 

TCCAAGATCA TCCTTATGBA TOAAGCCACA QCCTCCATTG ACATGGAGAC AGACACCCTG 3840 

ATCCRGCGCA CAATCCGTGA AGOCTTOCAG GGCTGCACCG TGCTCGTCAX TGCOCACOGT 390O 

GTCACCACTG TGCTGAACTG TGACCACATC CTGGITATGG GCAATGGGAA GGTGGTAGAA 3960 

TTTGATCGGC OGGAGGTACT GGGGAAGAAG OCTGQGTCAT TGTTCGCAGC CCTCATGGCC 4020 
ACAGCCACTT CTTCACTGAG ATAA 
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MTRKRTYWVP NSSGGIAOSRG IDIGDDHVSG LITKTYTLQD GPWSOQERNP EAPGRAAVPP 
T^GKYDAMjRT MIPFRPKPRP PAPQPL.DNAG USYliTVSHI, TPLMIQSI/RS PPI*' 
SVHDASDKNV QRLHRLWEBE VSRRGIKKAS VLIiVMIjRFQR THLIFDAIiG ICFCIASVTjG 
PILIIPKILE YSEBQLGNW HGVGLCFALP LSECtfKSLSF SSSWriHQRT AIRFRAAVS8 
FAFEKLIQFK SVIHITSGEA ISFPTGOVNY LFEGVCYGPI, VMTCASLVI CSISSYFIIG 
YTAFIAIIiCY LLVFPlAVFH TRMAVKRQHH TSEVSDQRIR VTSEVLTCIK IiIKMYTWEKP 
FAKIIBGMES I.TFCSKEGDG MAFSMLASLN liIiRLSVFFVP IAVKGLTNBK 8AVMRFKKFF 
LQE9PVFYVQ TLQDPSKALV FEBATIiSHQQ TCPGIVNGAL ELERNGHASB GMTRPRDALG 
PBBEGNSLGP EZJBK3NI.WS XGM*LGVCG» TGSGK6SLLS AILEEMHLLE GSVGVQGSIiA 
YVPQQAWIVS GWIREN1LMG GAYDKARYI.Q VLHCCStiNRD LELLPFGDHT EIGERGLNLS 
GGQKQRISIA RAVYSDRQIY LUDDPLSAVD AHVGKMIFEE CJKKTLRGKT WLVTHQIiOY 
LEPCGQIILL ENGKICENGT HSELMQKXGK YAOfclQKMHK EATSDMLQDT AKIAEKPKVE 
SQA1ATSLEE SkNGNAVPEH QLTQEEEMEE QSLSWHVYEH YIQAAGGYMV SCIIFFFWL 
IVFI/TIFSFW WLSYWLEQGS GTM3SRESHG TMADIX3AIAD HFQLSFYQCV YGLNALLliIC 
VGVCSSGIFT KVTRKA9TAI* HNKLFNKVFR CPMSFFDTIP IGRLLNCFAG DLEQLDQIiTiP 
IFSEQFIiVIiS l^MVIAVLLTV SVIiSPYILLK GAIIHVICFI YYMMFKKA1G VFKKLENYSR 
SPLFSHILNS LQGL9SIHW GKTEDFISQF KRIiTUAOHNY LIJJFbSSTHW MALRLEXMTN 
LVTLAVAIaFV AFGISSTPYS FKVWAVNIVL QLASSFQATA RIGLETEAQF TAVERHjQYM 
KMCVSEAPLH MEGTSCPGGW PQHGEIIFQD YHMKYRDNTP TVIiKGIML/TI HGHEWGIVG 
RTGSGK3SLG MALFRLVEPM AGRILIDGVD ICSIGIiEDLR SKLSVIPQDP VLIiSGTXRFN 
liDPFDRHTDQ QIWDALERTF 1>TKAI SKF PK KLHTDWEUG GWFSVQERQXj LCIARAVIiRJJ 
SKIILIDEAT ASIDMETDTI* IQRTIREAFQ GCTVLVIAHR VTTVLNCDHI LVHONGKWE 
FDRPEVLRKK PG9LtFAALJ*3A TATSSIiR 

Seq ID NO: 107 DMA sequence 
Nucleic Acid Access ion KMJJ24022 
Coding sequence : 202 .,1563 

1 11 21 31 41 51 

I I 1 I I I 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA SO 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TXCCGTGGTG 120 

OCATCTACAT TTTTGGGACT CQGGAATTAT GAGGTAGAGG TGGAGGOGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTQA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGITG CTGCACAGAT CCTGTCACTG CTGGCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCX3ACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATGG AGCTGAIAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGQAC GAGTACCGCT GTGTCCGGGT GGGTGGTCftG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCI TCGTGGAAGA OCATGTBCTC CGATGACTGG ■ 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCIGGAGGGG CAGTTCCGQG AGGAGTTTGT GTOCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACIGCA TTACACCACT CAGTATATGT GAGGGAGGOA 780 

TGTG CCTCTG GCCACGTGGT TACCTTGCAG TGCACAGOCT G1GGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TOGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAQGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTQTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CA7GGACCAT CCAGGTOGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAQAT TGTCTACCAC 1OS0 

AGCAAGTACA AGDCAAAGAG GCTGGGCAAT GACATOGCCC TTATGAAGCT GGCOGGGCCA 1140 

CTCACGTICA ATGAAATGAT CCAGCCTGTG TGCCTQCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TQTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGG TGAOGCCTCC 1260 

CXZTGTCCTGA ACCACGOGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTADGGTG GCATCATCTC COCCTCCATG CTCTGCGOGG GGTACCTGAC GGGTGGOSTG 1380 

GACAGCTGCC AGGGQGACAG CGGGGGGCCC CTOGTGTGXC AAGAQAQGAG GCTGTGGAAC? 1440 

TTAGTGGGAG OGACCAGCTT TGGCATCGGG IGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACClGA GTXCCTGAGG TGATGAAGAC AGCCCGATCC 1620 

TCGCCTGGAC TCCCQTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

CGGCACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

GCTGCTTTTT OTTTTTTGTT TTTTTGAGGT GGAGTCTCGC TCTGTTGCCC AGGCTGGAQT 1800 

GCAGTGGGGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATTCTCTTG I860 

CCTCAGCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 19B0 

CAAATOATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCAOG 2040 

CCTAGCCTCA 05CTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCXZTTTC CCACTGGTCC ATCTGGTTTT CXCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCAGGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAG TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TIGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAA 

Seq ID NO- 10B Protein sequence 
Protein Accession #: NP_07G937 

1 11 21 31 41 51 

I t I I I I 

MGENDPPAVE AFPSFRSIiFQ U3DDKISPVA PDADAVAAQI L8LLPI1KFFP IIVIGIIALX 60 

LAT.MGLGIH PDCSGKYRCR SfiFKCIEIiIA RCDGWSDCKD GBDEYRCVRV GGQNAVLQVF 120 

TAA6WKTHCS DDWBSHYANV ACAQLGFPSY VSSDKLRVSS ItEGQFREEFV SIDHLLPDDK 100 

TTTAIOmSVYV REGCASGHW TLQCTACGHR RGYSSRTVGG NMSLIiSCJHPW QASLQFQGYH 240 
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I.OGG8VXTPL WIITAAHCVY DLYLPKSWTI QVGIiVSLUDN PAPSHEVEKI VYHSKYKPKfc 300 

LGNDIALMKL AG Pl/TFNE M I QPVCLPNSEE NFPDGKVCWT SGWGATEBGG DASPVLNHAA 360 

VPLISMKICM H&DVYGGIIfi PSMLCAGYU GGVDSCQGDS GGPLVOQERR LWKLVGATSF 420 
^ GIGCAEVNKP GVYTRVTSFL DWIHEQMJERD LET 

Seq ID HQs 109 DMA sequence 

Wucleic Acid Accession #: NM 000493.2 

Coding sequence : 97 2 13 9 

10 1 11 21 31 41 51 

I I I I I i 

CACCTTC1GC ACTGCTCATC TGGGCAGAGO AAGCTTCAGA AAGCTGCCAA GGCACCATCT 60 

CCAGGAACTC CCAGCACGCA GAATCCATCT GAGAATATGC TGCCACAAAT ACCCTTTTXG 120 

CTGCTAGTAI CCTTGAACTT GGTTCATGGA GIGXXXTACG CTQAADGATA CCAAATGCCC 180 

15 ACAGGCATAA AAGGCCCACT ACCCAACACC AAGACACAGT TCTTCATTCC CTACACCATA 240 

AAGAGTAAAG GTATAGCAGT AAGAGGAGAQ CAAGGXACXC CTGGTCCACC AGGCCCTGCT 300 

GGACCTCGAG GGCACCCAGG TCCTTCXGGA CCACCAGGAA AAGCAGGCTA CGGAAGTCCT 3E3 

GGACTCCAAG GAGAGCCAGG GXTGCCAGGA CCACOGGGAC CATCAGCTGT AGGGAAACCA 420 

GGTGTGCCAG GACTCCCAGG AAAACCAfiGA GAGAGAGGAC CATATGGACC AAAAGGAGAT 480 

20 GTTGGACCAG CTGGCCXACC AGGACCCOGG GGCCCACCAG GAOCACCTGG AATCCCTGGA S40 

CCGGCTGGAA TTTCTGTGCC AGGAAAACCT GGACAACAGQ GAjCCCACAGG AGCCCCAGGA 600 

CCCAGGGGCT TTCCTGGAGA AAAGGGTGCA CCAGGAGTCC CTGGTATGAA XGGACAGAAA 660 

GGGGAAATOG GATATGGTGC TCCTGGTCGX CCAGGTGAGA GGGGXCTTCC AGGCCCTCAG 720 

GGTCCCACAG GACCATCTGG CCCTCCTGGA GTGGGAAAAA GAGGTGAAAA TGGGGTTCCA 780 
25 GGACAGCCAG GCATCAAAGG TOATAGAGGT XTTCCGGGAG AAATGGGACC AATTGGCCCA B40 
CCAGGTCCCC AAGGCCCTCC TGGGGAACGA GGGCCAGAAG GCATTGGAAA GCCAGGAOCT 900 
GCTGGAGCCC CAGGCCAGCC AGGGAITCCA GGAACAAAAG GTCTCCCTGG GGCTCCAGGA 960 

ATAGCTGGGC CCCCAGGGCC TCCTGGCTTT GGGAAACCAG 5CTTGCCAGG CCTGAAGGGA 1020 

GAAAOAGGAC CTGCTGGCCT TCCTGGGGGT CGAGGTGCCA AAGGGGAACA AGGGCCAGCA 1080 

30 GGTCTTCCTG GOAAGCCAGG TCTGACTGGA CCCCCTGGGA ATATGGGACC CCAAGGACCA 1140 

AAAGGCATCC CGGGTAGCCA TGGTCTCCCA GGCCCTAAAG GTGAGACAGG GCCAGCTGGG 1200 

CCTGCAGGAT ACCCTGGGGC TAAGGGTGAA AGGGGTTCCC CTGGGTCAGA TGGAAAACCA 1260 

GGGTACCCAG GAAAACCAGG TCTCGATGGT CCTAAGGGTA ACCCAGGGTT ACCAGGTCCA 1320 

AAAGGTGATC CTGGAGTTGG AGGACCTCCX GGTCTCCCAG GCCCTGTGGG CCCAGCAGGA 1380 

35 GCAAAGGGAA XGCCOGGACA CAATGGAGAG GCTGGCGCAA GAGGTGCCCC TGGAATACCA 1440 

GGTACXAGAG GOCCTATTGG GCCACCAQGC ATTCCAGGAT TCCCTGGGTC TAAAGGGGAT 1500 

CCAGGAAGTC CCGGTCCTCC TGGCCCAGCT GOCATAGCAA CTAAGGGCCT CAAXGGACCC 1560 

AOCGGGCCAC CAGGGCCXCC AGGTCCAAGA GGOCACTCTG GAGAGCCTGG TCTTCCAGGG 1620 

CCCCCTGGGC CTCCAGGCCC ACCAGGTCAA GCAGTCATGC CTGAGGGTTT TATAAAGGCA 1680 

40 GGCCAAAGGC CCAQTCTTTC TGGGACCCCT CTTGTTAGTG CCAACCAGGG GGTAACAGGA 1740 

ATGCCTGTGT CXGCTTTTAC TGTTATTCTC TCCAAAGCTT ACCCAGCAAT AGGAACTCCC 1B0D 

ATACCATTTG ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCCAAG GACTQGAATC 1860 

TTTACTTGTC AGATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT GAAAGGGACT 1920 

CATGTTTGGG TAGGCCTGXA TAAGAATGGC ACCCCTGTAA TOTACACCTA TGATGAATAO 1980 

45 ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGIGCCATCA TCGAXCTCAC AGAAAAIGAC 2040 

CAGGTGTGGG tccagcttcc caatgccgag tcaaatggcc tatactcctc TGAGTATGTC 2100 

CACTCCTCTT TCXCRGGATT CCTAGTGGCT CCAATGTGAG TACAOOCCAC AGAGCTAATC 2160 

TAAATCTTGT gctagaaaaa gcattctcta ACTCTACCCC ACCCTACAAA ATGCATATGG 2220 

AGGTAGGCTG AAAAGAATGT AATTTTTAXT TTCTGAAATA CAGATTTGAG CTATCAGACC 2280 

50 AACAAAOCTT OOCCCTGAAA AGTGAGCAGC AACGXAAAAA CGTATQTGAA GCCTCTCTTG 2340 

AATTTCTAGT TAGCAATCTT AAGGCTClTT AAGGTTTTCT CCAATATTAA AAAAXATCAC 2400 

CAAAGAAGTC CTGCTATGTT AAAAACAAAC AACAAAAAAC AAAGCAACAA AAAAAAAAAT 2460 

TAAAAAAAAA AACAGAAATA GAGCTCTAAG TTATQTGAAA TTTGATTTGA GAAACTCGGC 2520 

ATTTCCTTTT TAAAAAAGCC TGTTTCTAAC TATGAATATG AGAACTTCTA GGAAACATCC 2S80 

55 AGOAGGTATC ATATAACTTT GTAGAACTTA AATACTXGAA TATTCAAATT TAAAAGACAC 2640 

TGTATCCCCT AAAATATTTC TGATGGTGCA CTACTCTGAG GCCTGTATGG CCCCTTTCAT 2700 

CAATATCTAT TCAAATATAC AGGTGCATAT ATACTTGTXA AAGCTCTTAT ATAAAAAAGC 2760 

CCCAAAAIAT TGAAGTTCAT CTGAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 2820 

CTTTTCTATG ATTGCAGAGA AGCTTTTTAT ATACCCAGCA TAACTTGGAA ACAGGTATCT 2800 

60 GACCTATTCT TATTTAGTTA ACACAAGTGT GATTAATTTG) ATTTCTTTAA TTCCTTAXTG 2940 

AATCTTATGT GATATGATTT TCTGGATTTA CAGAACATTA GCACATQTAC CTTGTGCCTC 3000 
CCATTCAAOT GAAGTTATAA TTTACACTGA GGGTTTCAAA ATTCGACTAG AAGTGGAGAT 30fiO 
ATATTATTTA TTTATGCACT GTACTGTATT TTTATATTGC TGTTTAAAAC TTTTAAGCTG 3120 
TGCCTCACTT AXTAAAGCAC AAAATGTTTT ACX7TACTCCT XATTTACGAC ACAATAAAAT 3180 

65 AACAT CAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 3240 

TTCTTTCAAG GCTTTTCATT CGACACAATA AAATAACATC AATAG 



Seq 3D WO: 110 pyqtein sequence 
_ Protein Accession # : JJP 000484.2 

70 

1 U 21 31 41 SI 

1 I 1 1 I I 

MLPQIPFIJiL VSMILVHGVP YAERYQMPTG XKGPIaPNXKI QFFIFVTIKS KGIAVXGHQG 60 

TPGPPGPAGP UGHPGPSGPP GKPGYGSPGL QGEPGLPGPP QPSAVGKPGV PGLPGKPGBR 120 

75 GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 

VPGHHGQKGE MGVGAPGRPG ERGIiPGPQGP XGP8GPPGVG KROENGVPGQ PGIKGDRGFP 240 

GEMGPXGPPG PQGPPGSRGP BGIGKPGAAG APGQPGIPGX KGLPGAPGIA GPPGPPGFGK. 300 

POLPGLKGER GPAGLPGGPB AKGSQGPAGIj pgkpgltgpp GHMGPQGPKG JPGSHGLPGP 360 

XGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGItPGPKG DPGVGGPPGH 420 

80 PGPVGPAGAK GMPGHBGBAG PRGAPGIPGT HGPIGPPGIP GFPGfiKGDPG aPGPPGPAGI 480 

ATKGLNGPTG PPGPPGPRGH SGEPGLPGPP GPPGPPGQAV MPBGFIKAGQ RPSDSGTPLV S40 

SANQGVXGMP VSAFTWILSK. AYPAIGTPIP FPKILYNHQQ H3TOPHTGIFT CQIPGlYYPS 600 

YHVHVKGTHV ttVGLYKHGTP VMYTYBEYTK GYLDQAfiGSA XTDIjTENDQV WLQLPNAESN 660 
GLYSSKYVH3 SPSGPLVAPM 



995 



WO 03/042661 



Seq ID NO* 111 DMA sequence 
KFucleic Acid Accession tft NM_000 949 
Coding sequence: 285.. 2153 

1 11. 21 31 4.1 51 

111111 

QGAGGCTCAA ATCCCCAGAC GGCQGTTTTC TGQGCTOGGC TTTCTGCTTA CTCACTCCTT 60 

CTCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 120 

ATQTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 160 

TAAAQAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 

GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 

CA.TCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATC3GACAGT 360 

TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 

GCTGGTGGAG GCCTGGGACA GATGGAGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 460 

GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 

GCCAJCTTXGG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 

CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTAXGT GGACGTGACT TACATAGTTC 660 

AGCCAGACCC TOCTTTGGAG CTGtSClJGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 

ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTOACTT AAAAACTGGT TGGTTCACGC 7 BO 

TCCTGTATGA AATTCGATTA AAACCCGAGA AAGCAGCTGA GTGQGAGATC CATTTTGCTG 840 

GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGB 900 

TTCGCTGCAA ACCAGACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TTCATTCAGA 960 

TACCTAGTGA CTTCACCA1G AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 

CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAQC ATGGTGACCT 10 B0 

GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGGATT TGATGCTCAT CTGTTGGAGA 1140 

AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 

ACTATGAGGA CTTGCTGGTG QAGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 1260 

TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 

ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTQTGAGG 1380 

AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAOAAG CCAGAGAATC 1440 

CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 

TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1560 

CCAGATCCTC TEACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 

CACCGGCCAC TCTGTTGAAT GAAGCAGGTA AAGATGCTIT AAAATCCTCT CAAACCATTA 1680 

AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CRTTCTGAGA 1740 

CTGACCAGGA TACGCCCTGG C1GCTG0CCC AGGAGAAAAC OCCCTTTGac TCCGCTAAAC 1800 

OCTTGGATTA TGTGGAGAT T CACAAOOTCA ACAAAGATGG TGCATlATCA TTGCTACCAA 1860 

AACAGAGAGA GAACAGCOGC AAGCOCAAGA AGCCCGGGAC TCCTGAGAAC AATAAGGAGT 1920 

ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 

CTAAAAACGT GGCTTGCTTT GAAGAATCAG CCAAAGAGGC CCCACCATCA CTTGAACAGA 2040 

ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 

TGGGTGGTTT GGATTACCTG GATCCCGCAT GTTTTACAGA CTCCTTTCAC TGATAGCTTG 2160 

ACTAATGGAA TGATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 

AATGCTCAAG AATGTAGTCA GACTGACACT ACTAAAGCTC OCAGCTCCTT TCA TOCT CCA 2280 

TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 

AACTCTGATT TGTAGATTTA C1TTTTGCTG T TAGTT ATAA AACTATGTGT TCAATGAAAT 2400 

AAAAGCACAC TGCTTAGTAT TCTTGAGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 

□CTTTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTAOCATAGA 2520 

AAGATGACAA AAQAAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 

AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGGA TTACCCAAAT ATAACCATTT 2640 

AAAGAAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
TTGCTGATAT GCAAGTAAGA AAT 

Seq m NO: 112 Protein sequence 
Protein Accession #; NP_O0OS40 



l 
I 

MKEKVASATV 
SLTYHREGET 

vnrrvoEDPp 

EIHFAGQQTE 
VAVIiSAVICL 
FPPTSDXBDIi 



PSQHNPRSS* 



EMNKEYAKVS 
KCRDOJLGGLD 



11 
I 

FTLLLFLKTC 
TuMHECPDYIT 
LBLAVHVKQP 
FKrbSIiHPGQ 
I I VWAVALKQ 
LVEYLEVDDS 
PSTFYDPBVI 
KKITDVGELA 
PMLItPQEKTP 
GVMDMKILVIj 
YLDPACFTHS 



21 
1 

uaragiiPPGK 

GGENSCHPGK 

KYLVQVRCKP 
YBHVTC1PPP 
SDQHLMSVHS 
EKFBHPETm 
VGPAGAPATL 
FGSAKPLDYV 
VPDPHAKNVA 
FH 



31 
I 

PEIFKCRSPK 
OYTfiMHRTYI 
WSPPTEjIDUC 



41 51 
I I 

KETFTCHWRP GTDGGLPTWY 60 

MKVHAXUQHG 6SF8DSLYVD 120 

TGWFTLXjYEI KLKPEKAABW 180 

ATPlQIPfiDF TMNDTTVWI9 240 

AKLLEKGK5 E KLLSALGCQD 300 

TCLDFDTDSG RGSCDSPSIiL 360 

GKIPYFHAGG SKCSTHPLPQ 42 0 

5SGTIKSREE GKATQQREVB 480 

IjSIiLPKQREH SGKPKKPGTP 540 

CFEEBAKEAP PSLBQNQAHK ALAN FT ATS S 6D0 



VPGFKZKGFD 



TWDPQCISME 
LiJEAGKDABK 



Seq ID HOs 113 DMA sequence 
Nucleic Acid Accession #; XM_0628il 
Coding eeguence: 1..88B 



1 

\ 

ATGTGGGGOG 
CTGCIGCTGG 
TGGCTOGACG 
GGCGACGCCA 
GCGCGCCTGG 
CGGGCGGACA 
GTTGGCTCOG 



11 
1 

CTCGCCGCTC 
CTGCGCTGCT 
CGCAGGGCGT 
CCATCTGCTG 
ACCAGGGCGG 
AAGACGGCCC 
TQTTTQTQ3C 



21 
1 

GTCCGTCTCC 
QGCGGCGOGG 
CTGGCGCATC 
CGGCAGCTGC 
CTGCGACAAT 
CGACGGCTCX3 
CTTTATCATC 



31 
1 

TCATCCTGGA 
GGGAGGGCCA 
GGCTTCCAGT 
GCGTTGCGCT 
GACCGCCAGC 
GCAGTGCCCA 
TTQGGGTCCC 



41 
1 

ACGCCGCrrc 

GCGGCGAGTA 
GTCCCGAGCG 
ACTGCTGCTC 
AOGGCGCTGG 
TCTACGTGCC 
TGGTGGCAGC 



51 
I 

GCTCCTGCAG 
CTGCCACGGC 
CTTCGAOGGC 
CAGCGCCGAG 
CGAGCCTGGC 
GTTCCTCATT 



60 
120 
180 
240 
300 
36D 
420 



996 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



AGATGTCTCC 
ATGGAGACCA 
TCCAGCACAG 
AGGTCACAGA 
CCCACGAATT 
CAGTATCTGC 
GCTGTGCCAC 
CCTCACACCA 



GGOCTAAGCA 
TCCCCATGAT 
CTGCCAGTTC 
CCAAGTGTTG 
TCTCTGTGCT 
ATCCCCCATA 
CTTTCATGGA 
ACAGTGAACA 



GGATCCCCAG 
CCCCAOTGCC 
CAGCTCCAGC 
CTTGCCGGAA 
GAACTGTCAG 
CGTGGGGTAC 
CGGCCTGCAG 
GAAGATGTAC 



Seq ID MO: 114 Protein sequence 
Protein Accession ff: XP 062811 



CAGAGCCGAG CCCCAGGGGG TAACCGCTTG 
AGCACCTCCC GGGGGTCGTC CTCACGCCAG 
GCCAACTCAG GGGCCOGGGC gcccccaaca 
GGGACCATGA ACAACBTGTA TGTCAACATG 

caggccaccc AGATTGTGCC acatcaaggg 

ACGGTGCAGC ACGACTCTGT GCCCATGACA 
CCTGGCTACA GGCAGATTCA 
CCAGCGGTGA CTGTATAA 



460 
540 
600 
GGO 
720 
780 
840 



1 11 21 31 41 51 

t I I I 1 I 

MHGARRSSVS 3SWNAAGLLQ T.T.TiAALLAAG AUASGEYCHG WLDAQGVWRI GFQCPERFEG 
GOATXCCGSC ALRYGCSSAE AELEQGGCDN DRQQGAGEPG RADKDQPCQS AVPIYVPFLI 
VGSVFVAFII LG9I»VAACCC RCLRPKQDPQ QSRAPGGNRL KETIPMIPSA STSRGSSSSQ 
B6TAAS8SSS ANSGARAPPT BSQJTNCCLPE GTMNMVYVNM PTNFSVIiNGQ QATQIVPHQG 
QYLHPPYVGY TVQHDEVPMT AVPPFMDGLQ PGYRQIQSPF PHTMSEQKMY PAVTV 

Seq ID NOs 115 DHA sequence 
Nucleic Acid Accession fl: NM_013257 
Coding sequence! 223 . ,1512 

1 11 21 31 41 51 

I I I I I 1 

GGTGTGCTCT TGAGGGATTA AATGCAAAGA GATCACACCA TGGACTACAA GGAAAGCTGC 
CCAAGTGTAA GCATTCCCAG CICCGA1GAA CACAGAGAGA AAAAGAAGAG GTTTACTGTT 
TATAAAGTTC TGGXTTCAGT GGGAAGAAGT GAATGGTTTG TCTTCAGGAG ATATGCAGAG 
XXTGATAAAC TTTATAACAC TTTAAAAAAA CAGTTTCCTO CTATGGCCCT GAAGATTOCT 
GCCAAGAGAA TATTTGGTGA TAATTTTGAT CCAGATTTTA TTAAACAAAG ACGAGCAGGA 
CTAAACGAAT TCATTCAGAA CCTAGTTAGG TATCCAGAAC TTTATAACCA TCCAGATGTC 
AGAGCATTCC TTCAAATGGA CAGTCCAAAA CACCAGTCAG ATCCATCTGA AGATGAQQAT 
GAAAGAAGTT CTCAGAAQCT ACACTCTACC TCAGAGAACA TCAACCIGGG ACCGTCTGGA 
AATOCTCATG CCAAACCAAC TGACTTTGAT TTCTTAAAAG TTATTGGAAA AGGCAGCTTT 
GGCAAGGTTC TTCXTGCAAA ACGGAAACTG GATGGAAAAT TTTATGCTGT CAAAGTGTTA 
CAGAAAAAAA TAGTTCTCAA CAGAAAAGAG CAAAAACATA TTATGGCTGA ACGTAATGTG 
CTCTTGAAAA ATGTGAAACA TCCGTXTTTG GTTGGATTGC ATTATTCCTT CCAAACAACT 
GAAAAGCTTT ATTTTGTXCT GGATTTTGTT AATGGAGGGG AGGTTTTTTT CCACTTACAA 
AGAGAAC3GGT CCTTTCCTGA GCACAGAGCT AGGTTTTACG CTGCTGAAAT TGCTAGTGCA 
TTGGGTTACT TACATTCCAT CAAAATAGTA TACAGAGACI TGAAACCAGA AAATATTGTT 
TTGGASTTCAG TAGGACATGT TGTCTTAACA GATTTTGGGC TTTGTAAAGA AGGAATTGCT 
ATTTCTGACA CCACTACCAC AITTTGTGGG ACACCAGAGT ATCTTGCACC TGAAGTAATT 
AGAAAACAGC CCTATGACAA TACFGTAOAT TGGTGGTGCC TTGGGGCTGT TCTGTATGAA 
ATGCTGTATG GATTGCCTCC TTTTTAITGC CGAGATGTTG CTGAAATGTA TGACAATATC 
CTTCACAAAC CCCTAAGTTT GAGGCCAGQA GTGAGTCTTA CAGCCTGGTC CATTCTGGAA 
GAACTCCTAG AAAAAGACAQ GCAAAATCGA CTTGGTGCCA AGGAAGACTT TCTTGAAATT 
CAGAATCAIC CTTTTTTTGA ATCACTCAGC TGGGCTGACC TTGTACAAAA QAAGATTCCA 
OCACCATTTA ATCCTAATGT GGCTGGAOCA GATGATATCA GAAACTTTGA CACAGGATTT 
ACfiGAAGAAA CAGTTCCATA TTCTGTGTGT GTATCTTCTG ACTATTCTAT AGTGAATGCC 
AGTGTAT TGG AGGCAGATGA TGCATTCX5TT GGTTTCTCTT ATGCACCTCC TTCAGAAGAC 
TTATTTTTGT GAGCAGTTTG CCATTCAGAA AOCATTGAGC AAAATAAGTC TATAGATGGG 
ACTGAAACTT CTATTTGTGT GAATATATTC AAATATGTAT AACTAG2GCC TCATTTTTAT 
ATGTAATGAT GAAAACTATG AAAAAATGXA TXXTCTTCTA TGTCCAAGAA AAATAGGGCA 
TTTCAAAGAG CTGlTl'TGAT XAAAATTTAT ATTCTTGTTT AATAAGCTTA TTTTTAAACA 
ATTTAAAAGC TATTATTCTT AGCAITAACC TATTTTTAAA GAAACCTTTT TTGCTATTGA 
CTGTTTTTTC CCTCTAAGTT TACACTAACA TCTACCCAAG ATAGACTGTT TTTTAACAGT 
CAATTTCAGT TCAGCTAACA TATATTAATA CCTTTGTAAC TCTTTGCTAT GGCTTTTGTT 
ATCACACCAA AACTATGCAA TTGGTACATG GTTGTTTAAG AAGAAAOGGT ATTTTTCCAT 
GATAAATCAC TGTTTGAAAT ATTTGGTTCA TGGTATGATC GAAATGTAAA AGCATAATTA 
ACACATTGGC TGCTAGTTAA CAATTGGAAT AACTTTATTC TGCAGATCAT TTAAGAAGTA 
ACAGGCCGGG CGCGGTGGCT CACGOCTGTA ATCCCAGCAC TTTGGGAGGC TGAGGCGGGC 
AGATCACGTG AGGTCAGGAG TTGGAGACCA GCCTGACCAA CATGGACAAA CCCGGTCTCT 
ACTAAAAATA CAAAATTGGC AGGGTGTGGT GGCACATGCC TATAATCCCA GCTACTTGGG 
AGGCTAAGGC AGGAGAATCG CTTQAACCCG GGAGGCGGAG GTTGCAGTGA GCCQAjGATCG 
. CACCATTGCA CrCCTGCCTG GGCAACAAGA GTGAAACTCC ATCTCCAAAA A 

Seq ID NOi lis Protein sequence 
Protein Accession ft: NP_037389 



X 
I 

MAXiKIPAKRI 
PEEDEDER6S 
YAVKVliQKKl 
IiPFHLOjaERS 
CKEGIAISDT 
EMYDNILHKP 
VQKK1PPPFN 
APPSEDXjPXi 



11 21 

I 1 

FGDNPDFDFI KQRRAGLNEP 
QKLH8TSQNI NLGPSGKPHA 
VI2JHKEQKHI MAERUVtiLKN 
FPEERARPYA AEIA9ALGYI, 
TTTFCGTPEY lapevihkop 
LSLRPGVSLT AW8ILEELHE 
PHVAGPDDIR NFDTAFTEET 



31 
I 

XGNLiVRyPEIi 
KPTDFDPLKV 
VKHPFLVGliH 
HSIKIVYRDL 
TDNTVDWWCL 
KDRONRLGAK 
VPYGVCVSSO 



41 
I 

TNHEDVRAFL 
IGKGSFGKVIr 
YSFOTTEKLY 

GAVLYEMLYG 
BDFX>EIGHHP 
YSrVKASVBE 



60 
120 
180 
240 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 D 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
21 60 
2220 
2280 
2340 



51 

! 

QMDSPKHQSD 
IiAKRKLDGKF 
FVLiDPWSGGE 
GHWXjXDFGL 
LFPFYCRDVA 
FFESLSWADL 



60 
120 
1B0 
240 
300 
360 
420 



Seq ID NO s 117 UNA sequence 

Nucleic Acid Accession ft: NM_004004.1 



997 



WO 03/042661 



coding sequence: 1..S81 

1 11 21 31 41 51 

I 1 1 I 1 I 

ATGGATTGGG GCAOGCTGCA GACGATCCTG GGGGGTGTGA ACAAACACTC CACCAGCATT 60 

GGAAAGATCT GGCTCACCGT CCTCTTCATT TTTCGCATTA TGATCCTCST TGTGGCTGCA 120 

AAGGAGGTGT GGGGAGATGA GCAGGCCGAC TTTGTCTGCA ACACCCTGCA GCCAGGCTGC 160 

AAGAACGTGT GCTAOGATCA CTACTTCCCC ATCTCCCACA TCCGGCTATG GGCCCTGCAG 240 

CTGATCTTCO TGTCCAGCCC AGCGCTCCTA GTGGCCATGC ACGTGGCCTA CCOGAGACAT 300 

GAGAAGAAGA GGAAGTTCAT CAAGGGGGAG ATAAAGAGTG AATTTAAGGA CATCGAGGAG 360 

ATCAAAACCC AGAAGGTCCG CATCGAAGGC TCCCTGTGGT GGACCTACAC AAGCAGCATC 420 

TTCTTCCGGG TCATCTTCGA AGCOGCCTTC ATGTACGTCT TCTATGTCAT GTACGACGGC 4 B0 

TTCTOCATGC AGOGGCTGGT GAAGTGCAAC GCCTGGCCTT GTCCCAACAC TGTGGACTGC 540 

TTTOTGTCCC GGCCCACGGA GAAGACTGTC TTCACAGTGT TCATGATTGC AGTGTCTGGA 600 

ATTTGCATCC TGCTGAATGT CACTGAATTG TGTTATTTGC TAATTAGATA TTGTTCTGGQ 660 
AAGTCAAAAA AGCCAGTTTA A 

Seq ID NO t 118 Protein sequence 
Protein Accession fli HF_003995.1 

1 ll 21 31 41 51 

I I i I 1 i 

MDWGTLQTIEi GGVNKHSTSI GKIKLTVLiFI FRIMILWAA KEVKGDEQAD FVCNTLQPGC 60 
KNVCYDHYFP ISHIRLHALQ LIFVSSPALL VAMHVAYRRH EKKRKFIKGE IK3EFKDISE 120 
IKTQKVRIEG SLWWTYTSSI FFKVIFEAAF MYVFYVMYDG FSKQRIiVKCU AKPCPHTVDC 180 
FVSRPTERTV FTVFWIAVSG ICIUJflVTEL CYLLIRYCSG KSKKFV 



Seq ID NO: 119 DNA sequence 

nucleic Acid Accession #i XM_061091.1 

Coding sequences 1 . .2431 

1 11 21 31 41 51 

I f I I I 1 

ATGCCAAATA CTTCAGOAAC AACCAGGATT GAAATTTGGC TTCTCCAAGA GCCGCCCGGQ 60 

CACCGAGCGC TGGTCGCCGC TCTCCTTCCG GTGAGTCCCA GCCCCGAGTT GGCTCTGGCG 120 

CCCGGGTACC CGCCAOTGCC GGCTGCGGAT GACCGATTCA CX5CTCCOGAT GATTGGAGGT 180 

CAGATGCATG GTGAGAAGGT AGATCTCTGG AGCCTTGGTG TTCTTTGCTA TGAATXTTTA 240 

GTTGGGAAGC CTCCTTTTQA GGCAAACGAA GTCCATGTAA GCAAAGAAAC CATCGGCAAG 300 

ATTTCAGCTG CCAGCAAAAT GATGTGGTGC TCQGCTGCAG TGGACATCAT GTTTCTGTTA 360 

GATGGGTCTA ACAGCGTCGG GAAAGGGAGC TTTGAAAGGT CCAAGCACTT TGCCATCACA 420 

GTCTGTOACG GTCTGGACAT CAGOCCCGAG AGGGTCAGAG TGGGAGCATT CCAGTTCAGT 480 

TCCACTCCTC ATCTGGAATT CCCCTTGGAT TCATTTTCAA CCCAACAGGA AGTGAAGGCA 540 

AGAATCAAQA GGATGGTTTT CAAAGGAGGG CGCACGGAGA CGGAACTTGC TCTGAAATAC 600 

CTTCTGCACA GAGGGTTGCC TGGAGGCAGA AATGCTTCTG TGOCCCAGAT CCTCATCATC 660 

GTCACTGATG GGAAGTCCCA GGGGGATGTG GCACTGCCAT CCAAGCAGCT GAAGGAAAGG 720 

GGTGTCACTG TGTTTGCTGT GGGGGTCAGG TTTCOCAGGT GGGAGGAGCT GCATGCACTG 780 

GCCAGCGAGC CTAGAGGGCA GCACGTGCTG TTGGCTGAGC AGGTGGAGGA TGCCACCAAC 640 

GGCCTCTTCA GCACCCTCAG cagctcggcc atctgctcca gcgccacgcc AGCTGGGAGC 9DO 

CCOGAGCTTG TCTTCATGGA GCGGTTAATG GGCATCTCTC TGATAGGCCC CTGTOACTCG 960 

CAGCCCTGCC AGAATGGAGG CACATQTGTT CCAGAAGGAC TGGACGGCTA CCAGTGGCTC 1020 

TGCCCGCTGG GCTTTOGAGG GGAGGCTAAC TGTGCCCTGA AGCTGAGCCT GGAATGCAGG 1060 

GTCGACCTCC TCTTCCTGCT GGACAGCTCT GCGGGCACCA CTCTGGACGG CTTCCTGCGG 1140 

GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG GCCGTGCTGA GCGAGGACTC TCGGGCCCGA 1200 

GTGGGTGTGG CCACATACAO CAGGGAGCIG CTGGTGGCGG TGCCTGTGGG GGAGTACCAG 1260 

GATGTGCCTG ACCTGGTCTG GAGCCTCGAT OGCATTCCCI TGCGTGGTGG CCCCACCCTG 1320 

ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG CGTGGCTTCG GGAGCGCCAC CAGGACAGGC 1380 

CAGGACCGGC CACGTAGAGT GGTGGTTTTG CTCACTGAGT CACACTCCGA GGATGAGGTT 1440 

GGGGGCCCAG OGCGTCACGC AAGGGOQCGA GAGCTGCTCC TGCTQGGTGl! AGGCAGTGA3 1500 

GCCGTGCGGG CAGAGCTGGA GGAGATCACA GGCAGCCCAA AGCATGTGAT GOTCTACTCG 156 0 

GATCCTCAGG ATCTGTTCAA CCAAATCCCT GAGCTGCAGG GGAAGCTGTO CAGCCGGCAG 1620 

CQQCCAGGGT GCCGGACACA AGCCCTGGAC CTCGTCTTCA TGTTGGACAC CTCTGOCTCA 1680 

GTAGGGCCCG AORATTTTGC TCAGATGCAG AGCTTTGT3A GAAGCTGTGC CCTOCAGTTT 1740 

GAGGTGAACC CTGACGTGAC ACAGGTCGGC CXGGTGGTGT ATGGCAGCCA GGTGCAGACT 1800 

GCCTTDGGGC TGGACACCAA ACCCACCCGG GCTGCGATGC TGCGGGCCAT TAGCCAGGCC 1660 

CCCTACCTAG GTGGGGTGGG CTCAQCCGGC ACCGCCCTGC TGCACATCTA TGACAAAGTG 1920 

ATGACCGTCC AGAGGGGTGC CCGGCCTGGT GTCOCCAAAG CTGTGGTGGT GCTCACAGGC 1980 

GGGAGAGGOG CAGAGGATGC AGCCGTTCCT GCCCAGAAGC TGAGGAACAA TGGCATCTCT 2040 

GTCTTGGTCG TGGGCGTGGG GCCTOTOCTA AGTGAGGGTC TOCGGAOGCT TGGAGGTCCC 2100 

CGGGATTCCC TGATCCACGT GGCAGCTTAC GCCGACCTGC GGTACCACCA GGACGTGCTC 2160 

ATTGAGTGGC TGTGTGGAGA AGCCAAC3CAG CCAGTCAACC TCTGCAAACC CAGCCCGTGC 2220 

ATGAATGAGG GCASOTGOGT CCTGCAGAAT GGOAGCTACC GCTGCAAGTG TCGGQATGGC 2260 

TGOGAGGGCC CCCACTGCGA GAACOGTGAG TGGAGCTCTT GCTCTQTATG TGTGAGCCRG 2340 

GGATGGATTC TTGAGACGCC CCTGAGGGAC ATGGCTCCOG TGCAGGAGGG CAGCRGCCST 2400 

A0COCTCCCA GCAACTACAG AGAAGGCCTG GGCACTGAAA T0GTGCCTAC CTTCTGGAAT 2460 
GTCTGTGOCC CAQGTCCTTA G 

Seq ID WQi 12 0 Probein eeguence 
Protein Accession #s XP_061031.1 

1 11 21 31 41 51 

I I I I I i 

MPNTSGTTH.I filWLLQEPPG HRALVAALLP VSPSPELAIA PGYPPVPAAD DRFTLPMIGG GO 

QMHGEKVDLM SLGVLCYBFL VGKPPPEANE VHVSKETIGK IBAASKMMWC SAAVDIMFLL 120 



998 



WO 03/042661 



PCT/US02/36810 



DGSNSVGKGS FERSKHFAIT VCDGLDISPE RVRVGAFQFS STPHLEFPLD SFSTQQEVKA 1B0 

RIKRMVFKGG RTCTELAUKY LrLHRGLPGGR NASVPQILII VTDGKSQGDV ALPSKQLXHR 240 

GVTVFAVGVR FPRWEELHAL A9EPRGQHVL IAEQVEDATN GLFSTLSSSA ICS SAT PAG 3 300 

PELVPHEKLM GISLIGPCDS QPCQKGGTCV PEGI»DGYQCL CPLAFGGEAN CALKLSIjECR 360 

5 VDtiliFLiLDSS AGTTLDQFLR AKVFVKRFVR AVLBBDSRAR VGVATtfSREL LVAVPVGEYQ 420 

DVEDLVWSLD G I PFRGGP TX» TGSAIiRQAAE RGFGSATRTG QDRPRRVWI* LTESHSEDEV 480 

AGPAKHARAR ELliLGVGSE AVRAEUBEIT G9PKHVMVYS DPQDIiPNQIP ELQGKLCSRQ 540 

RPGCRTQALD LVFMLDTSAS VGPENFAQMQ SFVRSCALQF EVHPDVTOVG LWYGSQVQT 600 

AFGLDTKPTR AAMLRAISQA PYIiGGVGSAG TALLHITOKV MTVORGARPG VPKAVWliTG 560 

10 GRGABDAAVP AQKLRNHGlS VLWGVGFVIi SEGLRRLAGP RDSJjIHVAAY ADLRYHQPVI* 720 

lEWLCGEAKQ PVNLCKPSPC MHEQSCVLQN G8YRCKCRDG WEGPHCENRE WSSCSVCVSQ 780 
GWILETPLRH MAPVQEGSSR TPPSNYREGL GTBMVPTFWN VCAPGP 



Seq ID NOi 121 DMA sequence 
15 Nucleic Acid Accession #i Eos sequence 
Coding sequence: 1-.2424 

1 11 21 31 41 51 

„ I I I I I I 

ZU ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCX3GGAAGAT TTCAGCTGCC 120 
AGCAAAATGA TGTQGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TOGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACQGT 240 
CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 300 
25 CTGGAATTOC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 
ATOGTCTTCA AAGGAGGGCG CACGGAGAOG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 
GGGTTCCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 460 
AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 540 
TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 600 
30 AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 660 
ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAQ ACTGCAGGGT CGAGGCTCAC 720 
CCCTGTGAGC ACAGOACGCT GGAGATGGTC CGGGAGTTOG CTGGCAATGC CCCATGCTGG 780 
AGAGGATCGC GGOGGACCCT TGOGGTGCTG GCTGCACACT GTCCGTTCTA CAGCTGOAAG 840 
AGAGTGTTCC TAACOCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 
35 TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGAOGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCOC TGAAGCTGAG CCTGGAATGC 1020 

AGGGTCGAGC TCCTC1TCCT GCK3GACAGC TCTGCGGGCA CCACTCTGGA Q3GCTTCCTG 1080 

03GGCCAAAG TCTTCGTGAA GCGGTTTGTG OGGGCCGTGC TGAGGGAGGA CTCT0GGGOC 1140 

GGAGTGGQTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 1200 

40 CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 1260 

CTGACGGGCA GTGCCITGOG GCAGGCGGCA GAGGGTGGCT TCGGGAGCGC CACCAGGACA 1320 

GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC OGAGGATGAG 1380 

GTTGCGGGCC CAGCGCGTCA OGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 1440 

GAOGCGGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT QATGGTCTAC 1500 

45 TCGGATCCTC AGGATCTGTT CAACCAAATC CCTOAGCTGC AGGGGAAGCT GTGCAGCCGG 1560 

CAGOGGCXZAG GGTGCCGGAC ACAAGCCCTG GAOCT0GTCT TCATGTTGGA CACCTCTGCC 1620 

TCAGXAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 1680 

TTTGAGGTGA ACCCTGACGT GACACAOGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGQCTGOGA TGCTGCGGGC CATTAGCCAG 1800 

50 GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCAC&T CTATGACAAA I860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCGCA AAGCTGTGGT GGTGCTCACA 1920 

GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCXGAGGAA CAATGGCATC 1980 

TCTGTCTTGG TCGTGGGCGT GGGGOCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 

CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCOGACC TGQGGTACCA CQH3GACGTG 2100 

55 CTCATTGAGT GGCTGXGTGG AGAAGCCAAG OVGCCAGTCA ACCTCTGCAA ACGCAGCCOQ 2160 

TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGGTGCAA GTGTCGGGAT 2220 

GGCTGGGAGG GOCCCCACTG OGAGAACOGT GAGTGGAGCT CTTGCTCTGT ATGTG1GAGC 2280 

CAGGGATGGA TTCTTGAGAC GCOCCTGAGG CACATGGCTC COGTGCAGGA GGGCAGCAdC 2340 

OGTACCCCTC CCAGCAACIA CAGAGAAGGC CTGGGCACTG AAATGGTGOC TACCTTCTGG 24.DO 
60 AATGTCTGTG OCCCAGGTCC TTAG 



Seq ID HOi 122 Protein sequence 
Prot-eixi Accession #: Eos sequence 

65 1 11 21 31 41 51 

I I 1 I I 1 

MPPFLIiZiEAV CVFLFSRVPP SLPLQEVHVfi KETIGKISAA SKMMWCSAAV DIMFLLDG3N 60 

SVGKGSFRR3 KHFAXTVCDG LBISPERVKV GAFQFSSTPH LBFPLD3PST QQEVKARIKR 120 

MVFKGGH.TET ELALKYLLER GLPGGRHASV PQIIillVTDG KSQGDVALPS KQLKERGVTV 180 

70 FAVGVRFPRW EELEALASEP RGQHVIiIiAEQ VEDATKGLPS TLS£SAICSS ATPDCRVEAH 240 

PCEHRTXjBKV REFAGNAPCW RGSRRTLAVIi AAHCPFYSWK. RVFLTHPATC YRTTCPGPCD 300 

9QPC0MGGTC VPEGLDGYQC LCPIiAFGGEA UCALKLSIiEC RVDLLFLLDS 9AGTTLDGFL 360 

RAKVFVKRFV RAVL3EDSRA HVGVATYSRE IiLVAVFVGEY QDVPDLVWSli DGI PFRGGP T 420 

LTG3ALRQAA ERGFGfiATRT GQDRPRRVW LLTBSHSEDE VAGPARHARA REXiLLLGVGS 480 

75 EAVRAELEEI TGSPKHVMVY GDPQDLFKQI PELQGKLCSR QRPGCRTQAIi DLVFMLDTSA 540 

SVGPEHFAOM QSFVRSCALQ FEWPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYIiGGVGSA GTAIiWHIYDK VMTVQRGARP GVPKAVWIiT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVBPV LSEGliRRIAG PRDSLIHVAA YADLRYHODV LIEWIiCGEAK QPVNLCKPSP 720 

. . CMNBGSCVLQ HG3YRCKCRD GWEGPHCBNR EMSSCSVCVE QGWILBTPLR HMASVQEGS8 780 

80 RTPPSNYREG LGTEMVPTFW KVCAPGP 



Heq ID NO: 123 UNA sequence 

Nucleic Acid Accession #- Sob sequence 

Coding sequencer 89- -2356 



999 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



i 
1 

GOCCCCTQQC 
GTOGCOGCTC 
TGTTTTCCTG 
AGAAACCATC 
CATCATGTTT 
GCACTTTQCC 
AGCATTCCAG 
ACAGGAAGTG 
ACTTGCTCTG 
CCAGATCCTC 
GCAGCTGAAG 
GGAGCTGCAT 
GGAGGATGCC 
CAOGCCAGAC 
GGAGITCGCT 
TGCACACTOT 
CZAGQACCA.CC 
TCCAGAAGGA 
CTGTGCCCTG 
TGOSGGCACC 
GGOOGTGGTG 
GCTGGTGGCG 
TGGCATTCCC 
GCGTQGCTTC 
GCTCACTGAG 
AGAGCTGCTC 
AGGCAGCCCA 
TGAGCTGCAG 
CCTCGTCTTC 



CCTGGTGGTG 
GGCTGCGATG 
CAOCGCCCTG 
TGTCCCCAAA 
TGCCCAGAAG 



CGCCGACCTG 

gccagtcaac 

TGGGAGCTAC 
CTTGAGACGC 
AGCAACTACA 
CCAGGTCCTT 
GAGGATGTCC 
AAACOATOTT 
GOCTTGTTGA 
GACTTAAATT 
TGACACAGTA 



11 21 

I I 

CCGAGCCGCG 
TCCTTCCGTT ATATCAACAT 
TTTTCCAGAG TGCCCCCATC 
GGGAAGATTT CAGCTGCCAG 

CTGTTAGATG ggtct aacag 
ATCACAGTCT QTGACGGTCT 
TTCAGTTCCA CTCCTCATCT 
AAGGCAAGAA TCAAQAGGAT 
AAATACCTTC 7GCACAGAGG 
ATCATCGTCA CTGATGGGAA 
GAAAGGGGTG TCACTGTGTT 
GCACTGGCCA GCGAGCCTAG 
ACCAAGGGCC TCTTCAGCAC 
TGCAGGGTCG AGGCTCACCC 
GG CAATGCCC CATGCTGGAG 
CCCTTCTACA GCTGGAAGAG 
TGCCCAGGOC CCTGTGACTC 
CTGGACGGCT ACCAGTGCCT 
AAGCTGAGCC TGGAA7GCAG 
ACTCTGGACG GCTTCCTGOG 
AGCGAGGACT CTCGGGCCCG 
GTGCCTGTGG GGGAQTACCA 
TTCCGTGGTG GCOCCACCCT 
GGGAGCGCCA CCAGGACAGG 
TCACACT CCG AGGATGAGGT 
CTGCTGGGTG TAGGCAGTGA 
AAGCATGTGA TGGTCTACTC 
GGGAAGCTGT GCAGCCGGCA 
ATGTTGGACA CCTCTGCCTC 
AOAAGCTGTG CCCTCCAG3T 
TATGGCAGCC AGGTGCAGAC 
CTGCOGGCCA TTAGOCAGGC 
CTGCACATCT ATGACAAAGT 
GCTGTGGTGG TGCTCACAGG 
CTGAGGAACA ATGGCATCTC 
CTGCGGAGGC TTGCAGGTCC 
CGGTACCACC AGGACGTGCT 
CTCTGCAAAC CCAQCCCGTG 
CGCTGCAAGT GTCGGGATGG 
CCCTGAGGCA CATGGCTCCC 
GAGAAGGCCT GGGCACTGAA 
AGAATGTCTG CTTCCCGCCG 
CAACTGCAGC CATGCTGCTT 
GTTGAAAAGT TTTGATGTQT 
GGCTA1GTCA TCTGCCACCT 
TAGCGGCCTG ACGTTCCTTT 
ATGCCCAGCA GAGGCCTTTA 



31 41 51 

I I 1 

TGAGrAGAGC CGCCCGGGCA CCGAGCGCTG 60 

GCCCCCTTTC CTGTTGCTGG AAGCCGTCTO 120 

TCTCCCTCTC CAGGAAGTCC ATGTAAGCAA 180 

CAAAATGATG TGGTGCTCGG CTGCAGTGGA 240 

CGTCGGGAAA GGGAGCTTTG AAAGGTCCAA 30 0 

GGACATCAGC CCCGAGAGGG TCAGAGTGGG 3 60 

GGAATTCOCC TTGGATTCAT TTTCAACCCA 42 O 

GGTTTTCAAA GGAGGGCGCA CGGAGACGGA 480 

GTTGOCTGGA GGCAGAAATG CTTCTGTGCC 540 

GTCCCAGGGG DATOTGGCAC TGCCATCCAA 600 

TGCTGTGGGG GTCAGGTTTC CCAGGTGGGA £60 

AGGGCAGCAC GTGCTGTTGG CTGAGCAGGT 720 

OCTCAGCAGC TOGGCCATCT GCTCCAGCGC 7BD 

CTGTGAGCAC AGGACGCTGG AGATGGTCCG B40 

AGGATCGCGG CGGACCCTTG CGGTGCTGGC 900 

AGTGTTCCTA ACCCACOCTG CCACCTGCTA 960 

GCAGCCCTGC CAGAATGGAG GCACATGTGT 1020 

CTGCCCGCTG GCCTTTGGAG GGGAGGCTAA 1080 

GGTCGACCTC CTCTTCCTGC TGGACAGCTC 1140 

GGCCAAAGTC TTCGTGAAGC GGTTTGIGCG 1200 

AGTGGGTGTG GCCACATACA GCAGGGAGCT 1260 

GGATGTGCCT GAGCTGGTCT GGAGCCTCGA 1320 

GAGGGGCAGT GCCTTGCGGC AGGCOGCAGA 1380 

CCAGGACCGG CCACGTAGAG TGGTGGTTTT 1440 

TGCGGGCCCA GCGCGTCACG CAAGGGCGCG 1500 

GGCCGTGCGG GCAGAGCTGG AGGAGATCAC 1560 

GGATCCTCAG GATCTGTTCA ACCAAATCCC 1620 

GCGGOCAGGG TGCOGGACAC AAGCCCTGGA 16BD 

AGTAGGGCCC GAGAA.TTTTG CTCAGATGCA 1740 

TGAGGTGAAC CCTGACGTGA CACAGGTCGG 1800 

TGCCTTCGGG CTGGACACCA AACCCACCCG 1860 

CCCCTACCTA GGTGGGGTGG GCTCAGCCGG 1520 

GATBACCBTC CAGAGGGGTG CCCGGCCTGG 1980 

CGGGAGAGGC GCAGAGGATG CAGCCGTTCC 2040 

TGTCTTGGTC GTGGGCQTGG GGCCTGTOCT 2100 

CCGGGATTCC CTGATCCAGG TGGCAGCTTA 2160 

CATTGAGTGG CTGTGTQGAG AAGCCAAGCA 2220 

CATGAATGAG GGCAGCTGCG TCCTGCAGAA 2280 

CTGGGAGGGC CCCCACTGCG AGAACOSATT 2340 

GTGCAGGAGG GCAGCAGCCG TACCOCTCCC 2400 

ATGGTGCCTA CCTTCTGGAA TDTCTGTGCC 2460 

TGGCCAGGAC CACTATTCTC ACTGAGGGAG 2520 

AGAGACAAGA AAGCAGCTGA TGTCACCCAC 2580 

AAGTAAATAC CCACTTTCTG TACCTGCTGT 2640 

TTOCCT7GAG GATAAACAAG GGGTCCTGAA 2700 

GCACACAATC AAXGCTCGCC AGAATGTTGT 2760 
CTAGAGCATC CTTTGGACGG 



Seq ID NO; 124 Protein sequence 
Protein Accession #s Eos sequence 



1 
1 

MPPFLLLEAV 



11 
I 

CVFIiFSRVPP 
KKFAITVCDG 
ELALKYIxLHR 



MVFfGSGRTBT 
FAVGVRFPRW 
PCEHRTIiEMV 
SQPCQNGGTC 
RAKVFVKRFV 
I/TGSALRQAA 
EAVRAELEEI 
SVGFEHFAQK 
APYLGGVGSA 
EVLWGVGPV 
CMBIEGSCVLQ 



Seq ID NO: 125 DNA ggquence 

Nucleic Acid Accession fts NM_031942.1 

Coding Bequences 145 ..1260 



REFAGNAPCH 
VPBGLDOTQC 
RAVLBEDSRA 
ERGFGSATRT 
TGSPKHVMVY 
QSFVRECAL0 
GTAUiHIYDK 
LSEGLRRLAG 



21 
I 

SLPLQSVHVS 
BDISPKRVRV 
GLPGGRNASV 
RGQHVLIJVEQ 
RGSRHTTaAVI* 
LCPIiAFGGEA 
RVGVATVSRE 
GQDEPRRWV 
SDPQDLFNQI 
FEVNPDVTQV 
■UMTVQRGARP 
PRDSLIHVAA 



31 
1 

KETIGKI6AA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGIiFfl 
AAHCPFYSWK 
NCALKLSLEC 
IaLVAVPVGEY 
UjTESHSEDK 
FHLQGKLCSR 
GLWYGSOVQ 
GVPKAVWLT 
YADLRYHQDV 



41 
I 

SKMMWCSAAV 



KSQGDVALPS 
TLBSBArCSS 
RVPLTHPATC 
RVDLIiFliLDS 
QD VEDUVHSIi 
VAGPARHARA 
QRPGCRTQAI* 
TAPGLDTKPT 
GGRGABDAAV 
LIEWLCGEAK 



51 
I 

DIMFLIiDGSN 
QQSVKAUIXR 
KQLKERGVTV 
ATPDCRVEAH 
YKTTCPGPCD 
SAGTTIjDGPJj 
DGIPFHGGPT 
REUuLLGVGS 
DIjVPKLDTSA 
RAAMLRAISQ 
PAQKLRHNGI 
QFVKI.CKP9P 



60 
120 
180 
240 
30O 
360 
420 
480 
540 
6O0 
660 
720 



1 
I 

CCCGAGCCCC 
TGCTCCTCCT 
CCQATCTGGG 
GTAAAGAAGA 
TCCTCTOATG 
TCAGTTCGGG 
GCGATGAAGT 
CAGCCCTCAG 



11 
I 

GCOCCTCCGG 
GCTGTGGGAC 
CACCCGCCAC 
ACTTAAAGAA 
ACAGTTGTGA 
AAGGCTGTAG 
TTCCAGCGCG 
AGAATTCTGT 



21 
1 

GCCCGQGTOG 
CGCTGAOCGC 
CAGCATGGAC 
ATTCAGATAT 
CAGCTTTGCT 
GACCCGCAGC 



GACTGATTCC 



31 
1 

GCGGGCCCAG 
GCGGCTGCTC 
GCT0GCCGCG 
GTGAAGTTGA 
TCTQATAATT 
CAGTGCAGGC 
GGAGCAACCA 
AACTCCGATT* 



41 
I 

CCtGCCAGCC 
CGCTCTCCCC 
TGCCGCAGAA 
TTTCCATGGA 
TTGCAAACAC 
ACTCTGGACC 
ACAAAAAAGC 
CAGAAGATGA 



51 
1 

GCGCTGCTGC 
GCTCCAAGCG 
AGATCTCAGA 
AACCTCGTCA 
GAGGCTGCAG 
TCTCAGGGTG 
AGAGTCCCGC 
AAGTGGAATG 



60 
120 
1B0 
240 
300 
360 
420 
460 



1000 



WO 03/042661 



AATTTTTTGG AGAAAAGGGC TTTAAATATA AAGCAAAACA AAQCAATC3CT TGCAAAACTC 540 

ATGTCTGAAT TAGAAAGCTT COCTGGCTCG TTCCGTGGAA GACATCCCCT CCCAGGCTCC 60 0 

GACTCACAAT CAAGGAOACC GCGAAGGCGT ACATTCCOGG GTGTTOCTTC CAGGAGAAAC 660 

CCTGAACGOA GAGCTCGTCC TCTTACCAGG TCAAGGTCCC GGATCCTGGG GTCCCTTGAC 720 

GCTCTACCCA TGGAGGAGGA GGAGGAAGAG GATAAGTACA TGTTGGTGAG AAAGAGGAAG 7 BO 

ACCGTGGATG GCTACATGAA TGAAGATGAC CTGCCCAGAA GCCGTCGCTC CAGATCATCC 840 

GTGACCCTTC CGCATATAAT TCGCCCAGTG GAAGAAATTA CAGAQGAGGA GTTGGAGAAC 900 

GTCTGCRGCA ATTCTCGAGA GAAGATATAT AACCGTTCAC TGGGCTCTAC TTGTCATCAA 960 

TGCCGTCAGA AGACTATTGA TACCAAAACA AACTGCAGAA ACCCAGACTG CXGGGGCGTT 1020 

CGAGGCCAGT TCTGTGGCCC CTGCCTTCOA AACGGTTATG GTGAAGAGGT CAGGGATGCT 1080 

CTGCTGGATC CGAACTGGCA TTGCCCGCCT TGTCGAGGAft TCTGCAACTG CAGTTTCTGC 1140 

CGGCAGCGAG ATGGACGGTG TGCGACTGGG GTCCTTQTGT ATTTAGCCAA ATATCATGGC 12 DO 

TTTGGGAATG TGCATGCCTA CTTGAAAAGC CTGAAACAGG AATTTGAAAT GCAAGCATAA 1260 

T ^gggft R ft> ATTTGCTGCC TGCCTTCTAC TTCTCAAATC TTTCTTQTAA AAGTTTCCAA 1320 

TTTTTTCACT GAAACCTGAG TTAAAAATCT TGATGATCAG CCTGTTTCAT AAGAAACTCC 1380 

AATCAAQTTA ATCTTAGCAQ ACATGTGTTT CTGGAGCATC ACAGAAGGTA TATTGCTAGT 1440 

TACACTXTQC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 1500 

TCCATTTCCA ATGCTCCTCT CCAACCGCIT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT IS 60 

TATGAAAGCA TATTTT AT TT ACXTOGTGIT GAAATAGOOC TCATAAAACC TAAGCACTTG 1620 

GAAACACAAT AATAGTATTA ACTAACTAGA TCTATTOAAT TTCAGAGAAG AGCCTTCTAA 1680 

CTTGTTTACA CAAAAACGAG TATGATTTAG CACT CATACT AGTTGAAATT TTTAATAGAA 1740 

TCAAGGCACA AAAGTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTXG TCACAGTTGA GACTTAATTT 1860 

CC ?!t^ AATTT CTTCTGCCOa AAGGGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 1920 

AAGGTTGGTG GGCAATQTAA TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 1580 

ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 2040 

TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 2100 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATT TAT AAACCAGGCA 2160 

OVAGGTTCAA GTTTAGATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 2220 

GTAACTTTTA GCAGTTTGTT AACCTGACAr CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 2280 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAC3GT ATTTGGTAGA GGTGGAATCT 2340 

AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAAC CAT TGCTGTGCCA TTCAATGTTT 2400 

GATGCATAAT TGGACCTTQA ATCGATAAQT GTAAATACAG " CTTTTGATCT GTAATGCTTT 2460 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 

Seq ID NO: 126 Protein sequence 
Protein Accession &t NP_ii4i4B*i 

\ 11 21 31 41 51 

I I I 1 1 I 

MDARRVPQKD IJWKKEJLKKF RYVKLISMET SfiSSDDSCDS FASDNFAJJTR LQSVREGCRT 60 

RSQCRHSGPL EVAMKFPARS THGATHKKAE SRQPSENSVT DSNSDSKDES GHNFLEKRAI. 120 

NnCQSHKAMIA KLMSBLESPP GSFRGRHPLP GSDfiQSRRPR HRTFPGVASR KHFERRARPi* 180 

TRSRSRILGS LDAIiPMEEEE EEDJCTOLVHK RXTVDGYWNE DDLPRERKSR SSVTLEHIIR 240 

PVEEITEEEI, ENVCSKT3RBK IYNRSTjGSTC HQCRQKTIDT XTNCRNPDCW GVRGQPCGPC 300 

LRNRYGEEVR DAUJ>PN«HC PPCRGICNCS FCRQRDORCA TGVI1VYX1AKY HGFGNVHAYL 360 
KSURQEFEMQ A 

Seq ID NO! 127 DMA sequence 

Nucleic Acid Accession ft: AF305616.1 

Coding sequence z 1 . . 863 

] r t v 41 51 

ATGCACCGCT TGATGGGGQT CAACAGCACC GCCGCCGCCG GCQCCGGGCA GCCCAATGTC 60 

TCCTGCACGT GCAACTGCAA ACGCTCTTTG TTCCRGAGCA TGGAGATCRC GGAGCTGGAG 120 

TTTGTTCAQA TCRTCATCAT CGTGGTOGTG ATGATGGTGA TGGTGGTGGT GATCACGTGC 180 

CXGCTGAGOC ACTACAAGCT GTCIGCACGG TCCTTCATCA GCCGGCACAG CCAGGGGCGG 240 

AGGAGAGAAG ATGCCCTGTC CTCAGAAGGA TGCCTGTGGC CCTOGGAGAG CACAGTGTCA 300 

GGCAACGGAA TCCCAGAGCC GCAGGTCTAC GCOXGCCTC GGCCCACCGA CCGCCTGGOC 360 

GTGCCGCCCT TCGCCCAGCG GGAGCGCTTC CACCGCTTCC AGCCCACCTA TCCGTACCTG 420 

CaGCACGAGA TCGACCTGCC ACCCACCATC TCGCTGTCAG AOGGGGAGGA GCCC0CACCC 480 

TACCAGGGCC CCTGCACCCX OCAGCTTCGG QACCCCGAGC AGCAGCTGGA ACTGAACCGG 540 

GAGTCGGXGC GCGCACCCCC AAACAGAACC ATCTTCGACA GTGACCTGAT GOATAGTGCC 600 

AGGCTGGGCS GCCCCTGCCC CCCCAGCAGT AACTCGGGCA TCAGCGCCAC GTGCTACGGC 660 

ASCGGCGGGC GCATGGAGGG GCCGCCOCCC ACCtACAGCG AGGTCATCGG CCACTACCCG 720 

GGGTCCTCCT TCCAGCACCA GCAGAGCAGT GGOCCGCOCT CCTTGCTGQA GGGGACCCGG 7 B0 

CTCCACXaCA CACACATCGC GCCCCTAGAG AGCGCAOC5CA TCTGGAGCAA AGAGAAGGAT 840 
AAACAGAAAG GACACOCTCT CTAG 

Seq td NO: 128 Protein sequ« a ^"-e 
ProteijQ Accession #; AAL09357.1 

] j 1 21 31 41 51 

MHSLMGVNHT AAAAAGQPMV SCTOJCKafil, FQSMEITELB FVQIIUVW MKVMVWITC 60 

UjSHYKLSAR SPISRHSQGR RaBDADSSEG CLWPSESTVS GNGIPEPQVY APPRPTDRLA 120 

VPPFAQRERF HRFQPTYPYL QHEIDLPPTI SLSDGEEPPP YQGPCTIiQLR DPEQQLELNR 180 

ESVRAPPNRT IFDSDLKDSA RIGGPCPPas NSGISATCYG SGGRMEGPPP TYBEVIGHYP 240 
GSSPQHQQSS GPPSLLEGTR LHHTHIAPLE SAAIMSKEKD KQKOIIPr, 



Seq ZD 190: 129 DMA gggggncg 

Nucleic Acid Accession #: BM_004952.1 



1001 



WO 03/042661 



PCT/US02/36810 



Coding sequence* 1..718 



5 

10 

15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



1 
I 

ATGGCGGCGG 

CTGGCCCAAG 

AACCAGCACC 
ATTTACTGCC 

GGCGGGGCAG 
GCCAGCCaGg 
AAGTTCTCGG 
GGCCACGAGT 
ATQAAGGTGT 
CTCCCCCAGT 
GAGAACCCTC 
CACCTGCCCC 



11 
1 

CTCCGCTGCT 
GGCCCGGAQG 
TGCGGCGAGA 
CGCACTACAA 
AGCAGTACGT 
GCTTCAAGCG 
AGAAGTTCCA 
ACTACTACAT 
TCQTCTGCTQ 
TCACCATBGG 
AGGTGCCCAA 
TQGCCGTGGG 



21 
I 

GCTGCTGCTG 
GGCGCTGGGA 
G GG CT ACACC 
CAGCTCGGGG 
GCTGTACATG 
CTGGGAGTGC 
QOGCTACAGC 
CTCC3U3GCCC 
CGCCTCCACA 
CGOCAATOTG 
GCTTCAGAAG 
CATCGCCTTC 



31 
I 

CTGCTCGTGC 
AACCGGCATG 
GTGCAGGTGA 
GTGGGCCCCG 
GTGAGCGGCA 
AACCGGCCGC 
GCCTTCTCTC 
ACTCACAACC 
TCBCACTCCG 
AAGATCAACG 
AGCATCAGCG 
TTCCTCATGA 



41 
I 

COGTGCCGCT 
CX3GTOTACTG 
ACGTGAACGA 
GGGCGGGACC 
ACGGCTACCG 
ACGCCCCGCA 



TGCACTGGAA 
GGGAGAAGCC 
TGCTGGAAGA 
GGACCAGCCC 
CGTTCTTGGC 



51 
I 

GCTGCCaCTG 
GAACAGCTCC 
CTATCTGGAT 
GGGGCCCGGA 
CACCTGCAAC 
CAGCCCCATC 
GTTCCACGCC 
GTGTCTGAGG 
GGTCCCCACT 
CTTTGAGGGA 
CAAAGGGGAA 
CTCCTAG 



Seq ID NDt 130 Protein sequence 
Protein Accession ftt NP 004943,1 



51 



1 11 21 31 41 

i i i 1 I i 

MAAAPLfcLLL LLVPVPLLPL lAQGPGGALG HEHAVYWNSS 29QHI.REEGYT VQVNVNDYLD 
IYCPHYMSSG VGPGAGPGPa GGAEQYVLYM VSRNGYRTCM ASQGFKRWEC HRPBAPHSPI 
KFSEKFQRYS AFSliJYEFHA GHEYYYISTP THKU1WKCLR MKtfFVCCAST SHSG3SK5VPT 
LPQFTMGPHV KINVI^EDFBG BHPQVPKLEK SISGTSPKRE ELPUWGIAF FLHTFXAS 

Seq ID MO i 131 DMA sequence 

Nucleic Acid AccesHion #; NM_0l244S.l 

Coding sequence: 276.. 1271 



l 
t 

GCACGAGGGA 
GCGGCAGCCC 
TCTCGCTGGA 
GGCCCGGGGC 
QCTGCCGGCC 
CCCXGGGCAA 
TTGGGGGAGA 
GCAAGTGGAG 
GGTCTTCGCT 
ACGTCAGTAA 
AGATCGAGGC 
TCC CCAGC GG 
TCTOGTTTGT 
ACCTGTGCGA 
CCGGGACGGA 
CGGTGACCGA 
GGCTGAAGGC 
GGGCCTTCAT 
CCTCAGTTCC 
GCGGAGGCCA 
CCGCCAACAA 
ACTGOGTCTA 
GGCTCCTGTG 
GACCGCGOTG 
GQCATTGGGA 
TCTGGTCTCA 
AATTATGGTC 
CCTGGCTCCC 
CTCTCCCGAG 
GGAAGCGTCA 
TGCTCAC 



Seq ID KOs 132 Protein sequence 
Protein Acceeoion 4 = MP 036S77.1 



n 
I 

AGAGGGTGAT 
COGCCGCCCC 
GGCCAGGCGG 
GCOGGCCTCG 
□CGCTCCCGC 
GGCCCTCTGC 
GTCCATCXGT 
CCAGACGGCC 
GCTGGGGGCC 
CGGGCTGOGC 
GGCGGGGGAG 
CACCGGGCAG 
GGTGCGCATC 
CGGGGACOGT 
CAGCGGCTTC 
GATAACGTCC 
CCTGCCTCCC 
CGCTCCCGCC 
AGAAAOGCCG 
CTGTGGGAGG 
CGGGAGCCCC 
AGACCAOAGC 
CAGGCTCATG 
AGGCCGCGCC 
AACAGCCTCC 
GOCTCCTCCT 
ICCTTATAAG 
ACGTGGTTGC 
GGCGCATCCA 
OTOTTTCCAT 



21 
I 

OCGACCCGGG 
CGCAGCCCCT 
TQCAGCATOG 
GGCTTAAATA 
TGCTCCTGCC 
GCTCTCCTCC 
TCCGCCAGAG 
TTCCCCAAGC 
GCGCATAGCT 
GACTTTGOGG 
GCGCTGCAGA 
ACGTCGGCGG 
GTGCCCAGCC 
TGGCGGGAAC 
ACCTTCTCCT 
ICCTCTCCCA 
ATCGCCAGGG 
CCAGTCCTGC 
CTGGACTGCG 
CTCGOGACCA 
TGCCCCGAGC 
CCCGCAQCCC 
CTGCAGGCGG 
GACCATCTCT 
TCCTTTCCCA 
CCTGCAGGAT 
TTRTTGCTQC 
AGATACCTCA 
AGCGGGGGCC 
GTTATGGATC 



31 
I 

GAAGGTCGCr 
TCTCCTCCTT 
AAGACAGGAG 
GGAGCTCGOG 
GGGTGATGGA 
TGGCCACTCT 
CCCCGGCCAA 
AGTACCCCCT 
CCGACTACAG 
AGCGCGGCGA 
GCGTGCACGC 
AGCTGGAGGT 
CGGACTGGTT 



CCCCCAACTT 
GCCACCCGGC 
TGACACTGGT 
CCAGCAGGGA 

AGAGCAGGAC 
TCQAAGAAGA 
CTGGGGCCCC 
CCGAGGCACA 
GCACTGAAGG 
ACCTTGCTTC 
AAAGTCATCC 
TCCAGGAGAT 
GACCTGGTGC 
ACTTGAGAAG 



41 

I 

GGGCAGGGCG 
TCTCCCACGT 
GAACTGGAGC 
GCTCTGGCTG 
AAACCCCAGC 
CGGCGCCGCC 
ATACAGCATC 
GTTCCGCCCC 
CATGTGGAGG 
GGCCTGGGGG 
GGTGTTTTGG 
GCAGCGCAGG 
CGTGGGCGTG 
GGACCTGTAC 
CGCCAGCAXC 
CAACTCCTTC 
GCGGCTGOGA 
CAATGAGATT 

TCGC1ACGTC 
GSCTGAGTGC 
CGGAGCCATG 
GGGGGTXTCG 
GCCGTCTGGT 
TTAGGGGCCC 
CCAAGGCTCC 
TGTCCTTCAT 



51 
1 

AGTTGGGAAA 
CCTATCXGCC 
CTCATTGGCC 
GGACCCGACC 
CCGGGCGCCG 
GGCCAGCCTC 
ACCTTCACGG 
CCTGCGCAGT 
AAGAACCAGT 
CTGATGAAGG 

GcacccGCCG 

CACTCGCTGG 
GACAGCCTGG 
CCCTACGACG 
CCGCAGGACA 
TACtACCCGC 
CAGAGCCOCA 
GTAGACAGCG 
TGGGGACTGT 
OGGGTCCAGC 
GTCCCTGATA 
GGGTGTCGGG 
CGCTGCTCCT 



TGAATAAATG 



CCGTGTCCOG 
AGCTACTCTA 
CGTCCAGGGG 
GCTOAGOCCA 
GGGCGGTTTC 
CTATCTCTGT 



1 11 
I t 
MBHPSPAAAli GKALCALIiLA 
PLFRPPAQWS SLLGAAHSfiD 
HAVP8APAVP SGTGQTSAEL 
ALDLYPYDAG TDSGFTFS&P 
ItVRIiSQSPEA PIPPAPVUS 
RTRYVHVQPA KNGSPCPKLE 



21 31 41 51 

1111 

TLGAAGQPLG GESICflAHAP AKYSITFTK3K HSQTAFPKQY 
YfiPMHKWQYV fiMG^KDPABR GBAWA1KKBI BAAGEALQSV 
BVQRRHSLVS FWRIVPSPD WPVGVD&IiDL CDGDRWREOA 
NFATIPQDTV TEITSSSPSH PANSFYYPBIj KALPP JAEVT 
RDHElVDSAS VPBTPLDCEV SLWSSWGLOG GHCGRLGTKS 
EEAECVPDNC V 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
9G0 
1020 
10B0 
1140 
1200 
12 SO 
1320 
13S0 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



60 
120 
180 
240 
300 



Seq ID HO: 133 DMA sequence 
HUcleic Acid Accession tf : HMJJ19894 
Coding sequence; 1. .1314 

} I* 21 31 41 51 

I 1 I 1 1 | 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCOATGTCAA ACOOCTGCGC 60 

1002 



WO 03/042661 



PCT/US02/36810 



10 

15 
20 



AAACCCCGTA 
CTGAGCCTGQ 
TACTTCCTCT 
CTGGACTGTC 
GCftGTGGCAG 
GGGAACTGGT 
AGGCAOAIGG 
GATCTGGATG 
GGGCCCTGTC 
AAGACGCCCC 
AGCATCCAGT 

GGCTCAGACA 
TTCAACCCCA 
ACTTTCTCAG 
GCCACCCCAC 
GACATACTGC 
GCGTACCRGG 
GACACCTGCC 
GTGGGCATCG 
AAGGTCTCAG 



TCCCCATGGA 
CGAGTATCAT 
GCGGQCAGCC 
CCTTGGGOGA 
TCCGCCTCTC 
TCTCTGCCTG 
GCTACAGCAG 
TTGTTGAAAT 
TCTCAGGCTC 

ACGACAAACA 
K3CACTGCTT 
AACrGGGCAG 
TGTACCCCAA 
GCACAGTCAQ 
TCTGGATCAT 
TGCAGGCGTC 
GQGAAGTCAC 
AGGGTGACAG 
TTAGCTGGGG 
CCTATCTCAA 



GACCTTCAGA 
CATTGTGGTT 
TCTCCACTTC 
GGACGAGGAG 
CAAGGACCGA 
TTTOGACAAC 
CAflACCCACT 
CACAGAAAAC 
CCTGGTCTCC 
TGGGGAGGAG 
GCAD3TCTGT 
CAGSAAACAT 
CTTCCCATCC 
AGACAATGAC 
GCCCATCTGT 
TGGATGGGGC 
AGTCCAGOTC 
CGAGAAGATG 
TGGTGGGCCC 
CTATGGCTGC 
CTGOATCTAC 



AAGGTGGGGA 
GTCCTCATCA 
ATCCCGAGGA 
CACTGTGTCA 
TCCACACTGC 
TTCA.CAGAAG 
TTCAGAGCTG 
AGCCAGGAGC 
CTGCACTGTC 
GCCTCTGTGG 



ACGGATGTGT 

ATCGCCCTCA 
CTGCCCTTCT 
TTTAGGAAGC 
ATTOACAGCA 
ATGTGTGCAG 
CTQATGTACC 
GGGGGCCCGA 
AATGTCTGGA 



TCCCCATCAT 
AGGTGATTCT 
AGCAGCTGTG 
AGAGCTTCCC 
AGGTGCTGGA 
CTCTCGCTGA 
TGGAGATTGG 
TTCGCATGCG 
TTGCCTGTGG 
ATTCT XGGCC 
TCCTGGACCC 
TCAACTGGAA 
CCAAGATCAT 
TGAAGCTGCA 
TTQATGAGGA 
AGAATGGAGG 
CACGGTGCAA 
GCATCCCGGA 
AATCTQACCA 
GCACCOCAGG 
AGGCTGAGCT 



CATAGCACTA 
GGATAAATAC 
TGACGGAGAG 
OGAAOGGCCT 
CTCGGCCACA 
GACAOCCTGT 
CCCAGACCAG 
GAACTCAAGT 
GAAGAGCCTG 
TTGGCAGGTC 
CCACTGGGTC 
GGTGCGGGCA 
CATCATTGAA 
GTTCCCACTC 
GCTCACTCCA 
GAAGATGTCT 
TGCAGAOGAT 
AGGGGGTGTG 
GTGGCATGTG 
AGTATACACC 
GTAA 



120 
18 D 
240 
300 
3SD 
420 
4B0 
540 
€00 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 



25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Seq ID WO: 134 Protein aequance 
Protein Accession #? NP_063947,1 

1 11 21 31 

I I t I 

MLQDPDSDQP LNSLDVKPLR KPRIBMETFE KVGIPIIIAI* 
YFLCGQPIjHF IPRKQMTC5 LDCPLGEDEE HCVKSFPEGP 
GNWFSACEDN FTEALRETAC KQMGYSSKPT FBAVE IC7PE Q 
GPCXaSGSLVS IiHdiACGKSL KTPKWGGEE ASVDSWPWQV 
LTAAHCFHFH TDVFNWKVRA QSDKLGSFPS IiAVAKIIIIB 
TPSGTVHPJC LPFFDEELTP ATPLWIIGWG PTKQHGGKMS 
AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP I24YQSEQWHV 



Seq ID NO; 135 DMA 
nucleic Acid 



41 51 

I I 
IiSIASIIIW VfclKVIlDKY 
AVAVfcLSKDA BTLOVLt)SAT 
DIiDWEITEN SQELRMHNSS 
SIQYDKQHVC GGSILDPHWV 
FNPMYPKDND IALMKLQFPL 
DltiliQASVQV IDSTROIADD 
VGIV9WGYGC GGPSTPGVYT 



Coding gequence? 



bs ion fti HM_003045 
148, .2037 



1 
I 

CGATCCTGCC 
CTGAGACATC 
OGTCATATTC 



CTGAACACTT 
GTCCTGGCTG 
ATOGCTGCGC 
CCCAAGACGG 
ATCAOCGGCT 
TGGAGCGCCA 
ATGACTCTGA 
ATTCTCATCT 
ATATTCBETT 



TGTTTGAACA 
TTCTCTGGTG 
ATCGCCACCA 
GCGTCCCICT 
ATGCCCTACT 
TGGGAAGGTG 
CTAGGTTCCA 
TTTAAATTCT 
TCGGGTGCOS 
ATGTCCATTG 
TACCAGOCAG 
CCAGCAQRCC 
GCAGAGATGT 
TCTGQGCT7AA 



11 
1 

GGAGCCCCGC 
TTTGCTGCAA 
CAGGTCTGAA 
GGCGGAAGGT 
TTGATCTGGT 
GAGCTGTGGC 
TGGCCTCAGT 
GCTCAQCTTA 
GGAACTTAAT 
OCTTCQACGA 
ACGCCCCCGG 
TGACRGGACT 
GTATTAACGT 
AAAACTGGCA 
ATGACACAAA 
TCCTGTCGGG 
CAGGTGAAGA 
TGATCTGCTT 
TCTGCCTGGA 
CCAAGTACGC 
TGTTIOCCAT 
TAGCCAACGT 



21 
I 

CGCOGCCGGC 
GATCGAGGCT 
CBE3CAACATG 



CCGTGAGAAT 
GCTGGCTGGC 
CCTCTACAGC 
CCTCTCCTAC 
GCTGATAGGC 
CGTGCTGGCT 
TTTAACTCTT 
CCTGGTCCTG 



31 41 

I I 

ntSGATTCTG AAACCTTCCT 
GTCCTCTGGT GAGAAGGTGG 
GGGTGCAAAG TCCTGCTCAA 
AGCCGGGAGG AGACGCGGCT 



GCAGGCCCTG 
CTGTQCTATG 
TATGTCACOG 
ATCATCGGTA 
AGACCCATCG 
GAAAACCCCG 
GGTGTGAAAG 
GGCTTCATAA 



GCACTCTCCT 
AGCAGCCTAA 
AAAATGAATT 

tctctttgaa 

TTGTGAACAT 



CTOGCAGGGT 
AGCAAGACCA 
TTCGTGAACG 
TGGATGCTOA 
TCOCTGGATG 
ACAGCCCCGC 
CACCCCACCC 



CTGCCCTCCT 
AGCTCPCATT 
TCTATCTCAT 
TAGGCTTCAT 
CCGACGAAGC 
CCCCCGOAGG 
TCCCCACCAG 



AGAAGGGAAG 
GGCAGCGACT 
GGTGAAGAAC 
CATCGCCTAC 
CAATAACAGC 
AGTGGCCGTG 
GCCTCGGGTT 
CAATGATAGG 
GATGGCCTTC 
GGCTTACTCG 
CCTQGTATAC 
GGCAAGCACC 
AACCATACTC 
TTCAACCAGC 
GGAGGCTCTC 
CTGTGCCQTG 
TAAGGTTCCC 
GATGCAGCTG 
CATCTACTTT 
AAGOACTCCT 
TOGCAGCAGC 
TGCAACAGAA 



CCCGGTGTTG 
TGCTTCTATG 
CCACAGAAGG 



CCCCTGCCCG 
GGCTCCCTCT 
ATCTATGCCA 
ACCAAAACAC 
CTCT1TGACC 
TTGGTGGCTG 
CAGATOGCCA 
AATGATTCCC 
TCACCCAAAA 
CTTATAGCTG 
ACCAAAGGGG 



TTCCTGCCAG 
GACCAGGGCA 
GGCTATGGCC 
GACOQCAACT 
CCCGAGGGAC 



CCATTGTCAT 
GCGAGTTTGG 
TTGGAGAGCT 
CTTCAAGGGT 
GGGAGTTCTC 
ACATATTCGC 
AGTCGGCCAT 
TGGTGTCAGG 
GQAACACATC 
GTGGATTCAT 
CCTTCGTGGG 
OCATOCCCGT 
CGQCTGCCCT 
ACGCCTTTAA 



TGGCTGAGGA 
CAATAATCGC 
TGAAGGACTT 
CCTGTGTGXX 
GTACTTCCGA 
AGCTGGGGTT 
ACATGGAGCC 
TTCTCATCAT 
GGCTGTGGGC 
TCATCTGGAG 
TGCTOCCCAT 
CCTGGGTCCG 
TGTGGCACAG 



51 
I 

TGTATCCCTC 
TGAGGCTTCC 
CATTGGGCAG 
GTCTOGCTGC 
TGGTGTCTAC 
CTCCTTCCTG 
TOCTCGGGTC 
CTGGGCCTTC 



ACGGACACAC 
AGTGATCATA 
GGTCAACAAA 
ATTTGTGAAA 
AGGCCS3TCTC 
GCC3CTTCGGG 
CTTTGACTGC 
GGGGATCGTG 
C3VCGCTCATG 



GCCCCCAGAG 



CQCCAGTCTT 
TGGACTGCTA 
CACATTAGCC 
GGTGGACCTC 
GGTCTTACGG 
CGAGTTAGAT 
TTTACCAGAG 
TTCCAAAATC 
CACCTTCTGC 
AGTCT1TCTG 
GCAGCCCGAG 
CCTGAGCATC 
GTTTGCTGTG 
GGAGGAGGCG 
CAAOTQACGC 
GACCGGGAGG 



Seq ID NO: 136 Protein sequence 
Protein Accession #: HM_003045 



11 
I 



21 

I 



31 
I 



41 



51 



60 
120 
180 
240 
300 
3 SO 
420 



60 
120 
180 
240 
300 
360 
420 
460 
54 0 
600 
tifiO 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1330 
1380 
1440 
1500 

lseo 

1620 
1680 
1740 
1800 
1660 
1920 
1960 
2040 
210O 



1003 



WO 03/042661 



MGCKVLLNIG QQMC.RRKWD CSREETRLSR CLNTFDLVAL GVGSTLGAGV YVLAGAVARE 60 

NAGPAIVISF LIAALASVLA GLCYGEFGAH VPKTGSAYLY SYVTVGE LWA FlTOWBLILS 12D 

YIIGTSSVAR AWSATPDELr GRPIGEFSRT HMTLNAPGVL AENPDIFAVI IILILTGI.LT 180 

LGVXESAMVN KIPTCINVLV LQF IMVSGFV KGSVKMWQLT EEDFGNTSGR LCLNNDTKEG 240 

KPGVGGFMPP GFSGVLSGAA TCFYAFVGFD CIATTGEEVK NPQKAIPVGI VASLLICFIA 300 

YFGVSAALTL MMPYFCLDNN SPLPDAFKHV GWEGAKYAVA VGSLCAL8AS LLGSMFPMPK 3 GO 

VIYAMAEDGX, LFKFLANVND RTKTPIIATL ASGAVAAVMA FLFDLKDLVD LMSIGTLLAY 420 

SLVAACVLVL RYQPEQPNLV YQMASTSDEL DPADONELAS TNDSQLGFLP EAEMFBLKTI 4BO 

LSPKHMEPSK ISGLIVNIST SLIAVLIITF CIVT7LGREA LTKGALWAVF LLAGSALLCA 540 

WTGVIWHQP ESKTKLSFKV PPLPVLPILS IFVHVYLMMQ LDQGTWVRFA VWMLTGFIIY 600 
FGYGLWHffHET ASLDADGART PDGNLDQCK 

Seq ID NO i 137 DMA sequence 

Nucleic Acid Accession *i NM_032044.l 

Coding sequence: 1B2 . .658 

} H 31 41 51 

1 I I f I 

AAGATATAAA AGCTCCAGftA ACGTTGACTD GGACCACTGG AGACACTGAA GAAGGCAGGG 60 

GCCCTTAGAG TCTTGGTTGC CAAACAGATT TGCAGATCAA GGAGAACCCA GGAGTTTCAA 120 

AGAAGCGCTA GTAAGGTCTC TGAGATCCTT GCACTAGCTA CATCCTCAGQ GTAGGAGGAA 180 

GATGQCTTCC AGAAGCATGC GGCTGCTCCT ATTGCTGAGC TGCCTGGCCA AAACAGGAGT 240 

CCTGGGTGAT ATCATCATGA GACCCAGCTG TGCTCCTGGA TGGTTTTACC ACAAGTCCAA 300 

TTGCTATGGT TACTTCAGGA AGCT3AGGAA CTGGTCTGAT GCCGAQCTCG AGTGTCAGTC 360 

TTACGGAAAC GOAGCCCACC TGGCATCTAT CCTGAGTTTA AAGGAAGCCA aCACCATAGC 420 

AGAGTAGATA AGTGGCTAT C AGAGAAGCCA GCCGATATGG ATTGGOCTGC ACGACCXZACA 480 

GAAGAGGCAG CAGTGGCAGT GOATTGATGG GGCCATGTAT CTGTACAGAT CCTGGTCTGG 540 

CAAGTCCATG ggtgggaaca agcactgtgc tgagatgaqc TCCAATAACA ACTTTTTAAC 600 

TTGGAGCAGC AACGAATGCA ACAAGCGCCA ACACTTOCTG TGCAAGTACC GACCATAGAG 660 

CAAGAATCAA GATTCTGCTA ACTCCTGCAC AGCCCCGTCC TCTTCCTTTC TGCTAGCCTG 720 

GCTAAATCTG CTCATTATTT CAGAGGGGAA ACCTAGCAAA CTAAGAGTGA TAAGGQCCCT 780 

ACTACACTGG CTTTTTTAGG CTTAGAGACA OAAACTTTAQ CATTGGCCCA GTAGTGGCTT 840 

CTAaCTCTAA ATGTTTGOCC COCCATCCCT TTCCACAGTA TCCTTCTTCC CTCCTCCCCT 500 

GTCTCTGGCT GTCTCGAGCA GTCTAGAAGA GTGCATCTCC AGCCTATGAA ACAGCTGGGT 360 

CTTTGGCCAT AAGAAGTAAA GATTTGAAGA CAGAAGGAAG AAACTCAGGA GTAMCTTCT 1030 

AGACCCCTTC AGCTT CTACA CCCTTGTGCC CTCTCTCCAT TGCCTGCACC CCACCCCAGC 1080 

CACTCAACTG CTGCTTGTTT TTCCTTTGGC CATAGGAAGG TTTACCAGTA GAATCCTTGC 1140 
TAGGTTGATG TGGGCCATAC ATT CCTTTAA TAAACGATTQ TGTACATAAG AAAAAAAAAA 

Seq Tt> WO i 138 Protein sequence 
Protein Accession #j NP_114433.1 

1 11 21 31 41 51 

I ! I 1 I | 

MA5BSMRLLL LLSCLAKTGV LGDIIMRPEC APGWFYHKSN CYGYFRKLEN MSDAELECQS 60 

YGNGAHLASI LSLKEASTXA EYISOYQRSQ PIWIGLHDPQ KRQQWQH3DG AMYLYHSWSG 120 
KSMEGNKHCA EMSSNNNFLT WSSNECNKHQ HFLCKYRP 

Seq ID NO; 139 DMA sequence 

Nucleic Acid Accession #; XM_051860,2 

Coding sequences 52.. 3 042 

\ 11 21 31 41 51 

I 1 ) i | | 

GCTCACCCAG GAAAAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 60 

GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCTTGCTAC 120 

GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTACGGTTCC TCTGTGGGAA GCCTGTGAGG 160 

CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCTTTCTGAA CTTGGAGGAT 240 

AATGTACAGT CATGGAAACC TGGAGATACC CrGGTCATTG CCAGTACTGA TTACTCCATG 300 

TACCAGGCAG AAQAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCOCAA CCAGGTCAAA 3 60 

GTGGCAGOGA AACCAATGTA CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 420 

GCGGAGGTTG GGCTTCTGAG CCGGAACATC ATAOTGATGG GGGAGATGGA GGACAAATGC 480 

TACCCCTACA GAAACCACAT CTGCAATTTC TTTGACTTCG ATACCTTTGG GGGCCACATC 540 

AAGTTTaCTC TGGGATTTAA GGCAGCACAC TTOGAGOGCA CGGAGCTGAA GCATATGGGA 600 

CAOCAGCTGG TGGGTCAGTA CCCGATTCAC TTCCACCTGG CCGGTGATGT AGACGAAAGG 660 

GGAGGTTATG ACCCACOCAC ATACATCW3G GACCTCTCCA TCCATCATAC ATTCTCTCGC 720 

TGCGTCACAG TCCATGGCTC CAATGGCTTG TTGATCAAGG ACGTTOTGGG CTATAACTCT 780 

TTGGGCCACT GCTTCTTCAC GGAASATGGG C0GGAGGAAC GCAACACTTT TGAOCACTQT 840 

CTTGGCCTCC TTGTCAAGTC TGGAACOCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 900 

AAGATBATCA CAGGAGACTC CTACDCRGGG TACATCCCCA AGCCCAGGCA AGACTGCAAT 960 

GCTGTGTCCA CCTTC1GOAT GGCCAATCCC AACAACAACC TCATCAACTG TGCOQCTGCA 1020 

GGATCXGAGG AAACTGGATT TTGOTTTATT TTTCACCACG TACCAACGGG CCCCTCCGTG 1080 

GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAAOCGA 1140 

GCACATTCCA ACTACCGGGC TGGCATQATC ATAGACAAOG GAQTCAAAAC CACCGAGGCC 1200 

TCTGCCAAGG ACAAG0GGCC GTTCCTCTCA ATCATCTCTG CCAGATACAO CCCTCACCAG 1260 

GAOGOCGACC C3GOTGAAGCC CCGGGAGCCG GOCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCflGGAOC AOGGGGCCTG GCTGGGOGGC GGGGATGTGT □□CTGGACAG CTGCCGGTTT 1380 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 1440 

tccaagcaaq agataaagaa cagcttgttt gttggooaqa gtggcaaogt GGGGACGGAA 1500 

ATGATGGACA ATAGGATCTG GGGCOCTGGC GGCTTGGACC ATAGCGGAAG GACCXTOCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 

AACTGCACTT TOGGAAAGTT TGTGGCCCTG GAGGGCCGGC ACACCAGOGC CCTGGCdTC 1680 

OGCCTGAATA ATGCCTGGGA GAGCTGCCCC CATAACAACG TGACCGGCAT TGCCTTTGAG 1740 

GACGTTCCGA TTACTTOCAG AGTGTTCTTC GGAGAGCCTG GGCCCTGGTT CftACCAGCTG 1800 



1004 



WO 03/042661 



GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCQ AOGGCTCCGT GTCCGAGTAC IB 60 
CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA CTGCATCAAT 1920 
GTTCCCGACX GGAGAGGGGC CATTFGCAGT GGGTGCTATG CACAGATGTA CATTCAAG.CC 19 SO 
TACAAGACCA GTAACCTGOG AATGAAGATC ATCAAGAATG ACTTCOCCAG CCACCCTCTT 2040 
TACCTGGAGG GGGCGCTCAC CAQGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 2100 
CTQCAGAAGG GCTACACCAT CCACTGGGAC CAGACGGCCC CCGCOSAACT CGCCATCTGG 21 SO 
CTCAXCAACT TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 
AOVTTCTCCA TCCTCTCGGA TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 
GTCTTCGTGA GOACCTTGCA GATGGACAAA GTGGAGCAGA GCTACCCTGG CAGGAGCCAC 2340 
TACTACTGGG ACGAGGACTC AGGGCTGTTG TTCCTGAAGC TGAAAGCTCA GAACGAGAGA 2400 
GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTQAGAGGA TAAAGATTAA AGCTCTGATT 24 SO 
CCAAAGAACG CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACOGAGAGG 2520 
GCTGTOGTAG AC5GTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTOAA AACAAAGGAC 2580 
CATTTCTTGG AGGTGAAGAT GGAGAGXTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2640 
TTCGCTTACA TTGAAGTGGA TGGGAAQAAG TACCOCAGTT GGGAGGATGG CATCCAGGOT 27 OO 
GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA CGAGCTTCAG GAACTCCATT 2760 
CTGCAAOGCA TACCATGGCA GCTTTTCAAC TATOTGGCOA CCATCCCTBA CAATTCCATA 2820 
GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGOC CATGGACCAG AGTGCTGGAA 2880 
AAGCTTGGQG CAOACAGGGG TCTCAAGTTG AAAGAGCAAA TGQCATTCGT TGGCTTCAAA 2940 
GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC 3000 
CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTBT GAGGACAGCT GCCGCCOGGT 3O60 
GCCACCTCGT GGTAGACTAT GAOGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 
OTCCCCCAGC CCCTGCCftGC AGCTOCCTGG QAAGQCD37G TTTCAGCCCT GATGGGCCAA 3180 
GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 
CCCCTGGGGC GQTGCraaCC AATGCTQGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 
TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3360 
GTGCTGACAG CAAAGATCCA CTTTQGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 
GGGCTGGTCA TTCACAGATC CCCATQGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 34 BO 
GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3540 
QATTAGOAGC TGGGGTAGAA CTGOCTATCC TTGGGGAAQA GGCAAGCCCT GCCTCTGGCC 3600 
GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 
AGGCCCTTTT AGTTCTGAQA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 
AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 
GAGAGGTGAO AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAQTAQGGC AGGCAGTCAG 3840 
GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3900 
CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAQG CTGGGOGCAT 3 960 
TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCCG 4020 
CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GOCCTTTGCX CACGTCTCTC TGGCCCRCTC 4O80 
ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 
ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 
TGCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCAGAGGG GGCCTCCAGG 4260 
AGACCCTAGA TGTGCTCGTA CTCGCTCGGC CTGGGATTTC AGAGCT-GGAA ATATAGAAAA 4320 
TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 
AACCACACAG CTAAGGGAGG GOCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACAOCACA 4440 
TTGCCTCAAC AACOGGCGGC AQABTGCCCA GGGACTCCTB AGGTAGCTTC TGGAAATGGG 4500 
GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT 4560 
CCCXCCTGCT CCCAGCGCAC ACAAACCCQC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 4620 
CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 
TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 
GQGCTCGCCA TGTTTC1GGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 
GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCr GGCCCACXTTT 4860 
ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4920 
ACGAGGCACC AGAGTCTCCC TGGGTCTTGr GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 
CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGQCT CCCTCCACCC AACTGCACCC S040 
ATGAGACTCG GTCCRAGAGT CCAT^CCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 
CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 
CIGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 
AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGTCCAOGGT TTTGTTQAGT TTTCACTCTT 5280 
CTAATGCAAG GGTCTCACAC TGXGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 
AATGTTGAAT GTCTTTGGCT C31GTTCATTT AAAAAAGATA TCIATTTGAA AGTTCTCAGA 5400 
GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 
AGCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TUMTCTTATT GCTTAGAAAA 5520 
TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 
ACGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 
GTTAGATGTA TAGAOTQTTT GTATGTAAAC ATTTCTTGTA QOC3VTCACCA TGAACAAAGA 5700 
TATATTTTCT ATTTATTTAT TATATGTGCA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 
AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 



Seq TD NOi 140 Pro tela sequence 
Protein Accession #: XP_051860.2 

1 11 21 31 41 51 

1 1 I \ I I 

MDGVNLSTBV VYKKGQUYRF ACYDRGRACR SYRVRFLCGK PVRPKLTVTI DTNWNSTIRT 60 

LEENVQSWKP GDTLVIAfiXD Y3KYQAEEFQ VLPCRSCAPBf OVKVAGKPKY LHIGEEIDGV 120 

DMRASVGLL9 RNITVMGEME DKCYPYRNHI CNFFDFDTPG GH1KPALGPK AAHLEGTBLK 180 

HMGQQLVGQV PIHFHLAGDV DERGGYDPPT YIHDIjGIHHT FSRCVTVHGS KGLL I KD WG 240 

"YNSLGHCFFT EEGPBHSNTF DHCLGliliVXS GTL^PSDRDS KMC0OM2TGDS YPGY1PKPRQ 300 

BCmVSTFWM ANPNNNLINC AAAGSEBTGP WFIPHHVPTG P3VGMYSPGY SEHIPLGKFY 360 

MMRAS3WYPA GMIXDHGVKT TBASAKDKHP FX9IXSARYS PHQDADPLKP REPAIJRHPI 420 

AYK2JQEHGAW I*RGGDVWU>5 CRPADKGIGL TLASGGTFFY DDGSKQEIKKT SIjFVGBSGNV 480 

GTEflMDWRIW GPGQUJHSGR TliPIGQKPPI RGIQLYDGPI WIQNCTPRKF VAbEGRHTSA 540 

1AFRLNNAWQ SCPHNSVTGI APED VP ITS R VFFGEPQPWF NOLDMDGDKT SVPEDVDGSV 600 
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SEYPGSYLTK NDNWLVRHPD COJVPDWRGA JCSGCYAQMY IQAYKTSNLR MKIIKNDFPS 660 

HPLYLEGAliT RSTHYQQYQP WTLQKGYTI HWDQTAPAEL AIWLINFNKG DWIRVGLCYF 72 0 

HGTTFSIIiSD VHNRLLKQTS KTGVFVRTLQ HDKVBQSYPG RSHYYWDED3 GLLFLKLKAQ 780 

NEREKFAPCS MKGCERIKIK. ALIPKNAOVS DCTATAYPKF TERAWDVPH PKKLFGSQLK B40 

TKDHFLEVKM ESSKQHFFHL WNDFAYIEVD GKKYPSSEDG IOVWIDGNQ GRWSHTSFR 900 

NSILQGIPWQ LFNYVATIPD N3IVLMA3KG HYVSRGPWTR VLEKLGADRG LH&KEQMAFV 9 SO 
GFXGSFRPIW VTLDTEDHKA KIFQWPlPV VKKKKIi 

Sfeq ID NO: 141 DMA sequence 

IJucleic Acid Accession # : Eos sequence 

Coding sequence c 2 61. .2861 

1 11 21 31 41 51 

1 I I I I I 

QAGCTAGCGC TCAAGCAGAG CCCAGOGCGG TQCTATCGGA CAGAGCCTGG CGAGCGCAAG GO 

CGGCOCGGGG AGCCAGCGGG QCTGAGCG03 GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 

AQAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGQGAGGCA GQACTTCCTC TTCAAGGCCA 300 

TGCT3ACCAT CAGCTGGCTC ACTCTGACCT GCTTCOCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGOCAT GACCAAGACC 420 

AOCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT GOO 

GCCCTTTCCA GGGCAATTTC AOCATCAITT TGTATGGAAG GGCTGATGAA GGTATTCAQC 660 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGOGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGQ CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TOSACCCCAA ATCAGGCACA GTCATCCATT CTGACOGGTT TGACACCTAT AGATCCAAGA 900 

ARGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAQOAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATT TAG ACA CCCTTGGAGT TTTCTAACTG 1080 

TCAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT ATACAGG CCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAOCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA " 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATOTA CAGTCATGGA IS 60 

AACCTGGAGA TACCCTGGTC ATTGOCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 

TCCAGGTGGT TCCCTGCAGA TCCTGCGCCC CCAACCAGGr CAAAGTGGCA GGGAAACCAA 16SO 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGOACAT GCGGGCGGAG GTTGGGCTTC 1740 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG OOCACGGAGC TGAAGCATAT GGGACAGCAG C1GGTGGGTC 1920 

AGTACCCGAT. TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCITCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTAXAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAABA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

AGrCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 222 0 

ACTCCTACOC AGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GABGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCOCCTC CGTGGGAATG TACTCCCCAG 2400 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA OCGAGCACAT TCCAACXACC 2460 

GGGCTGGCAT GATCATBSAC AACGGAOTCA AAACCACCGA GGCCTCTGCC AAQGAGftAGC 25 20 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACOCGCTGA 2580 

AGCCCCGGGA ©CCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2640 

CCTGGCTGCG C GGCG GGGAT GTGTOGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 

AAGaCTTCTT GCTIACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 

CCTCTGGAAT GGCT CAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTOACTA 2820 

CGGGGTCGCC CTTTGCTCAC GTCTCXCTBQ CCCACTCATG AOGGAGAAGT GTGGTCAGAG 2880 

GGGAGCAATG GGCTTTGCTG CTTATGAGCA CAOAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 

CTCTGACTCC AAGAQGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 

TCTTCATCCA GGGAACTGAG CAGAGGGGGC CTCCAGGAGA CCCTAGATGT OCTCGTACTC 3060 

CCTCGGCCTq ggatttcaga gctggaaata tagaaaa-eat ctagcccaaa gccttcattt 3120 

TAACAGATGG QGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 31B0 

TGGGGASCCC CACCCTAGCC CTTGCTGCCA CACCACATTQ CCTCAACAAC CGGCCCCAGA 3240 

GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC AAGTCCGCTC GAAGGAAAGQ 3300 

AAATGACTAG AGTAGAATGA CAGCTMCAG ATCTCTTCCC TCCTGCTOCC AGCGCACACA 3360 

AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGOCTT CACTTTGTTC ACTACCTGTC 3420 

AGCOCAGCCT GGGTOCACAO TAGCTGCAAC TCCCCATTGG TGCTACCTGG CTCTCCTGTC 3480 

TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGRGTAGGG CTCGCCATOT TTCTGGTGAG 3540 

CCAATTTGGC TGATCTTGGG TGTCTGAACA GCTATTGGGT CCACCCCAGT CCCTTTCAGC 3600 

TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA GAGAGCCCAA AGAGCTCCTG 3660 

TAAGAGGGAG AACTCTATCT G T GG T TTATA ATCTTGCACG AGGCACCAGA GTCTCCCtGG 3720 

GTCTTGTGAT GAACTACATT TATCCOCITT CCTGCCCCAA CCACAAACTC TTTCCTTCAA 3780 

AGAGGHCCTG GCTOGCTOCC TCCACCCAAC TGCACCCATG AGACTCGGTC CAAGAGTCCA 3840 

TTCCCCAGGT GGGAGCCAAC TGTCAGGGAG GTCTTTCCCA CCAAACATCT TTCAGCTGCT 3900 

GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG CTTCAAAGGC CAGAGTCACA 3960 

ggaaggactt cttccaggga gattagtggx gatggagagg agagttaaaa TGACCTCATQ 4020 

TCCTTCTTGT CCACGGTTTT GITGAGTTTT CACTCTTCTA ATGCAAGGGT CTCACACTGT 4080 

GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT GTTGAATGTC TTTGGCTCAG 4140 

T1CATTTAAA AAAGAT AT CT ATTTGAAAGT TCTCAGAGTT GTACATATGT TTCACAGTAC 42 OO 
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AG GAT CT G TA CATAAAAGTT TCTTTCCTAA ACCATTCACC AAGAGCCAAT ATCTAGGCAT 4260 

TITCITGGTA GCACAAATTT TCTTATTGCT TAQAAAATTG TC CTCCT TQT TATTTCTGTT 4320 

TGTAAGAC TT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG CTCCTCTGAA ATGCTTGTCT 4380 

TTTTTCTGTT GCCGAAATAQ CTGGTCCTTT TTCGGGAGTT AGATGTATAG AGTGTTTGTA 4440 

TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT ATTTTCEATT TATTTATTAT 45 00 

ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 4560 

GAGATGTCCT TTGCATTGCT TGGAAGGGGT BTACCTAGAG CCAAGGAAAT TGGCTCTGGT 4620 

TTGGAAAAAT TTTGCTGTTA TTATAOTAAA CATACAAAGG ATGTCAAAAA AAAAAAAAAA 4680 
AAAAAAAAAA AAAAAAAAAA AA 
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Seq ID NO: 142 Protein sequence 
Protein Accession 4t s Bos sequence 



MQAAGRQDPIi FKAMHTISKL TLTCPPGATS TVAAGCPDQS PELQFWNPGH DQDBHVHIGO. 60 

GKTLLLrSSA TVYSIHISEG GKLVIKDHDE PIVLRTRBIL IDNQGEI^HAG SALCPFQGNF 120 

TIILTTGRADE GIQFDPYYGL KYIGVGKGGA LEfcHGQKKLS WTFLNKTiHP GGMAEGG*FF 180 

ERSKGHEGVL VHVIDPKSGT VZHSDRFDTY RSKKBBERLV QY1KAVPDGR I1»SVAVNDBG 240 

Z\J SRNIjDDMARK AMTKLGSKHF LHLGFRHPHS PLTVRGNPSS BVEDHIEYHG HRGSAAARVP 300 

KLFQTBHGE3f FNVSLSSBWV QDVEWTEWPD HDKVSQTKGG HKISDLWKAH PGKICNRPID 360 

IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYKVR FLCGKPVRPK IVTVTIDTNVN 420 

STIUOlrEDNV QSWKPGDTIiV IA3TDYSMYQ AEEFQVLPCR SCAPNQVKVA GKPMYLHIGE 480 

EIDGVDMHAE VGLL9RNZIV MGEMBDKCYP YRBHlCNFFD FDTFGGHIKF AI4GFKAAHLE 540 

ZD GTELKKMGQQ X.VGOYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHBSNGI»I>I 600 

KDWGYBTSLG HCFFTBDGPE ERNTFDHCU3 LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 660 

PKPRQDCNAV STPWMANPNN KLIHCAAAGS EETGFWFIFH HVPTGPSVGM YSPGYSEHJP 720 

LGKFYNNRAH SNYRAGMIID MGVKTTERSA KDKRPPliBlI 9ARYSPHQDA DPLKPREPAZ 780 

^ A IHHFIAYKNQ DHGAWLRGGD VHU3SCHPRG EAQBGFUiTG MKAGGILXGG J5EAAEGMAOG 840 
FSPPCRGliLK LVTTGSPFAH VSLAHS 



Seq ID NO: 143 KM DNA sequence 
Nucleic Acid Accession #1 NM_J)06G17 
Coding secjuence; 38*. 2635 

1 11 21 31 41 51 

I I 1 I } { 

CCAAGTTCTA CCTCATGTTT GGAGGATCTT GCTAGCTATG GCCCTCGTAC TOTGCTCCCT 6D 

GTTGCTGCTG GGGCTGTGCG GGAACTOCTT TTCABGAGGG CAGCCTTCAT CCACAGATGC 120 

TCCTAAOaCT TGGAATTATG AATTGCCK3C AACAAATTAT GAGACCCAAG ACTCCCATAA 180 

AGCTGGACOC ATTGGCATTC TCTTTGAACT AGTQCATATC TTTCTCTATG TGGTACAGCC 240 

GCGTGATTTC CCAGAAGATA CTTTGAGAAA ATTCTTACAG AAGGCATATG AATCCAAAAT 3 00 

TGAT TATGAC AAGCCAGAAA CTOTAATC TT AGOTCTAAAG ATTGTCTACT ATGAAGCAGG 360 

GA ^^ TTCTA TGCTGTG TCC TGGGGCTGCT GTTTATTATT CXGATGCCTC TGGTGGGGTA 420 

TTTCTTTIGT ATGTGTOGTT GCTGTAACAA ATGTGGTGGA GAAATGCAGC AGCOACAGAA 480 

GGAAAATGGG CCCTTCCTGA GGAAATGCTT TGCAATCTCC CTGTTGGTGA TTTGTATAAT 540 

AATAAGCATT GGCATCTTCT ATGGTTTTGT GGCAAATCAC CAGQTAAGAA CCCGGATCAA 600 

AAGGAGTGGG AAACTGGCAG ATAGCAATTT CAAGGACTTG CGAACXCTCT TGAATGAAAC 660 

TCCAGAGCAA AT CAAATAT A TATTGGCCCA GTACAACACT ACCAAGGACA AGGOSTTCAC 720 

AGATCTGAAC AGTATCAATT CAGTGCTAGG AGGGGGAATT CXTGACCGAC TGAGACCCAA 780 

CATCATOCCT GTTCTTGATG AGATTAAGTC CATGGCAACA GCGATCAAGG AGACCAAAGA 640 

GGOGTTGGAG AACATGAACA GCACCTTGAA GAGCTTGCAC CAACAAAGTA C31CAGCTTAG 900 

CAGCRGTCTG ACCAGCGTGA AAACTAGCCT GCGGTCATCT CTCAATX3ACC CTCTGTGCTT 960 

GGTGCATCCA TCAAGTGAAA CCTGCAACAG CATCAGATTG TCTCTAAGCC AG CTG AATAG 1020 

CAACCCTGAA CTGAGGCAGC TTCCACCCGT GGATGCAGAA CTTGACAACG TTAATAACGT 1080 

TCTTAGGACA GATTTGGATG GCCTGGTCCA ACAGGGCTAT CAATCCCTTA ATGATATACC 1140 

TGACAGAGTA CAACGCCAAA CCACGACTGT OGTAQCAGGT ATCAAAAGGG TCTTGAATTC 1200 

CATXGGTTCA GATATOGACA ATGTAACTCA GCGTCTTOCT ATTCAGGATA XACTCTCAGC 1265 

ATrCTCTGTT TATGTTAATA ACACTQAAAG TTACATDCAC AGAAAXTTAC CTACATTGGA 1320 

AGAGXATGAT TCATACTGGT GGCTGGGTGG CCTGGTCATC TGCTCTCTGC TGACCCTCAT 1380 

CGTGATTTTT TACCACCTGG GCTTACTGTG TGGCGTGTGC GGCTATGACA GGCATGCCAC 1440 

CCCGAOCACC OaAGGCTGTG TCTCCAACAC CGGAGGCGTC TTCCTCATGG TTGGAGTTGG 1500 

ATTAAGTTTC CTCTTTTGCT GGATATTGAT GATGATTGTG GTTCTTACCT TTGTCTTTGG 1560 

TGCAAATGTG GAAAAACTGA TCTGTGAACC TTACACGAGC AAGGAATTAT TCGGGGTTTT 1620 

GGATACACGC TACTTACTAA ATGAAG ACTG GGAATACTAT CTCTCTGGGA AGCTATTTAA 1680 

rAAATCAAAA ATGAAGCTCA CTTTTGAACA AGTTTACAGT GACTGCAAAA AAAATAGAGG 1740 

CACTTACGGC ACTCTTCACC TGCAGAACAG CTTCAATATC AGTGAACATC TCBUUJAVTAA 1800 

TGAGCATACT GGAAGCATAA GCAGTGAATT GGAAAGTCTG AAGGTAAATC TTAATATCTT I860 



TCTGT1QGGT GCAGCAGGAA GAAAAAACCT TCAGGATTTT GCTGCTTGTG GAATAGACAG 1920 
AArGAATTAT GACAGCTACT TGGCTCAGAC TGGTAAATCC COCOCAGGAG TGAATGTTTT 1980 

75 



ATCATTTGCA TATGAXCTAG AAGCJAAAAGC AAACAGTTTG CCCCCAGGAA ATTTGAGGAA 2040 

C^TCC3CTGAAA AGAGATGCAC AAACTATTAA AACAATTCAC CAGC3VACGAG TCCTTCCTAT 2100 

AGAACAATCA CTGAGCACTC TATACCAAAG OGTCAAGATA CTTCAAGGCA CAGGGAATGG 2160 

ATTGTTGGAG AGAGTAACTA GGATTCTAGC TTCTCTGGAT TTTOCTCAGA ACTTGATCAC 2220 

AAACftATACT TCCTCTGlTA TTATTGftG3A AACTAAGAAQ TATGGGAGAA CAATAATAGG 2280 

ATATT TTGAA CATTATCTGC AGTGGATCGA GTTCTCTATC AGTGAGAAAG TGGCATCGTG 2340 

CAAACCTGTG GCCACOGCTC TAGATACTGC TGTTGATBTC TTTCTGTGTA GCTACATTAT 2400 

CGACCCCTTG AATTTGTTTT GOrrTGGCAT AGGAAAAGCT ACTGTATTTT TACTTGCGGC 2460 

TCTAATTTTT GCGGTAAAAC TGGCTAAGTA CTATCGTCGA ATGGATTCGG AGGACGTGTA 2520 

CGATOATGTT GAAACTATAC OCATGAAAAA TATGOAAAAT GGTAAXAATG GTTATCATAA 2580 

AGATCATGTA TATGGTATTC ACAATOCTGT TATGACAAGC OCATCACAAC ATTGATAGCT 2640 

GATGTTGAAA CTGCTTGAaC ATCAGGATAC TCAAAGTGGA AAGGATCACA GATTTTTGGT 2700 

AGTTTCTGGG TCTACAAGGA CTTTCCAAAT GCAGGAGCAA OGCCAGTGGC AACGTAGTGA 2760 

CICAGGCGGG CACCAAGGGA ACGGCACCAT XGGTCTCTGG GTAGTGCTTT AAGAATGAAC 2620 
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ACAATCACGT 
TATTTTTGTT 

ACATATTGCC 
GTGTACAGTA 
TTATAGGACT 
AAATAATCCA 
TTCATCTATT 
CCATCATAGA 
GAGAGAGAAA 
ACAOTTTAAC 
GAGTATCATC 
TTC T TAG ATC 
AAAGCACAAG 
AAGTGATCAG 
GATGTAGGCA 
AAAAAGGAAC 



TATAGTCCAT 
TTTTACTTTT 
ATTGGATGCA 
ATGTGTGGAG 
AACGGTQTAT 
TTCTTCTAAA 
TTTTCACTAA 
ATACXTTATC 
GAAACCTGOG 
TGCTGTTCGT 
AGAGTTTTCT 
AGACTAQTAT 
TGGTGTCCAG 
GTAAATATT C 
CAATGAAGAA 
TTCCCCCCAG 
TTGG 



GGTCCATCAC 
TTACACTQAG 
TTTCTATCAA 
TGTC3CTGAAC 
ATACCTTTTG 
TGASCTAAAT 
AAGTGTGTGA 
AAAGATTGGC 
TAACTCCATC 
TCAAAAGTGG 
GTTGCATTAG 
CAAGTGTTCT 
CATGGATGAA 
ATTTGCTTCA 
CTGGTCGGAC 
GTCTTTTCAT 



TATTCAAGGA 
TTTCTATTTA 
AACTCTATCA 
ACACACCAGT 
TTACCACAGA 
AAGTCACCAT 
AACCTACAGC 
CATGTTCCAC 
TGACAAATXC 
AGTTGTTTTA 
GATAAACATT 
AAAATGAAAT 
ACCTTTGAGT 
GGAGTTTCAT 
AAAATTTAAC 
GTGCAGATTG 



TGACTCCCTC 
GACACTACAA 
AATGTGATGG 
TTACAGGAAA 
GTTTTTTAAA 
TG ACT TCTTG 
ATATTCTTCA 
TTGGAAATGG 
AAAAGAGAGA 
ACAGATGCCA 
AATTGGAGTG 
ATGAGAAGAT 
TTGCJTCCCTA 
GTTGGATCTG 
GT TGATGTAA 
CAGTTCTGAT 



Seq in NO: 144 NP protein sequence 
Protein Acceosion NP OO60O8.1 



1 
I 

MALVLGSLLL 
IFLYWQ PHD 
ILMFLVGYFP 
HQVRTEIKRS 
ILDRLRPNII 
SLNDPLCLVH 
YQ9LKDIPJ7R 



VFLMVGVGLS 
YLSGKLFNXS 
IiKVUIJJIFLl, 
LPPGNLHNSl* 
DFAQNP I TNN 

vPLceyiiDP 

NGNNGYHKDH 



11 
I 

LGLCONSFSG 
FPBDTIiRKFL 
CMCRCClffKCG 
RKIADSNFKD 
PVIiDEIKSMA 

VQRQTTTWA 
D9YWWUGGX»V 
FLFCWILMII 
KMKI/TPSQVY 
GAAGSKNIiQD 
KRDAQTIK.TI 
TSSVtlEETK 
LNIiPWPGIGK 
VYGIHMPVMT 



21 



GQPfiSTOAPK 
QKAYE9KIDY 



IiPTLlLNETPE 
TAXKBTKEAli 
LSLSQLNfiNP 
GISRVXiNSIG 
ICSLLTLIVI 
WIiTFVPGAW 
SDCKKKRGTY 
FAACGIDHMN 
KQQRVL>PTKQ 
KYGRTIIGYF 
ATVFIiLPAliI 
SPSQH 



31 41 

I 1 

AWNYELPATN YBTQDSHKAG 
PKPETVILGL KIVYYEAGII 
GPFLRKCPA1 5LLVICTII8 
QI1CYIIAQYN TTKDKAPTDL 



KLRQLPFVDA 
SDIDNVTQRL 
PYYLGLLCGV 
VKKLICSPYT 
GTKHIiQNSPK 
YDSYLAQTGK 
SLSTLYQSVK 
EHYLQWIEPS 
FAVKLAKYYR 



Seq ID NO: 145 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequences 38.. 2605 



1 

l 

CCAAGTTCTA 
GTTGCTQCTO 
TGCTAAGGCr 
AGCTGGACCC 
GCGTGATTTC 
TGATTATGAC 

gtttattatt 
atgtggtgga 
tccaatctcc 
ggcaaatcac 
caaggacttg 
gtacaacact 
aggcggaatt 
catggcaaca 
gagcttgcac 
gcggtcatct 

CATCAGATTG 

goaxgcagaa 

ACAGGGCTAT 
CGTAQCAGQT 
GCGTCTTCCT 

ttacatccac 

CCTGGTCATC 
TGGOSTGTGC 
CGGAGGCGTC 
GATCATTGTG 
TTACACGAGC 
GGAATACTAT 
AGTTTACAGT 
CTTCAATATC 
GGAAAGTCTG 
TCAGGATTTT 
TGGTAAATCC 
AAACAGTTTG 
AACAATTCAC 
QGTCAAGATA 
TTCTCTGGAT 
AACTAAGAAG 
GTTCTCTATC 



11 
1 

CCTCATGTTT 
GGGCTGTGCG 

tggaattatg 
attggcattc 
ccagaagata 

ATTGTCTACT 
CTGATGCCTC 
GAAATGCACC 
CTGTTGGTQA 
CAGGTAAGAA 



ACCAAGGACA 
CTTGACCGAC 
GCGATCAAGG 
CAACAAAGTA 
CTCA&TGACC 
TCTCTAAGCC 
CTTGACAACG 
CAATCCCTTA 
ATCAAAAGGG 
ATTCAGGATA 
AGAAATTTAC 
TQCTCTCTGC 
GGCTATGACA 
TTCCTCATGG 
GTTCTTACCT 
AAGGAATTAT 

ctctctggga 
gactgcaaaa 
agtgaacatc 

AAGGTAAATC 

GCtgcxtgtg 

COCGCAGGAG 
COCCCAGGAA 
CAGCAACGAG 
CTTCAAOGCA 
TTTGCTCAGA 
TATGGOAGAA 
AGTGAGAAAG 



21 
I 

GGAGGATCTT 
GGAACTCCTT 
AATTGCCTGC 
TCTTTQAACT 
CrrTGAGAAA 
ATGAAGCAGQ 
TGGTGGGGTA 
AGCGACAGAA 
TTTGTATAAT 
CCCGGATCAA 
TGAATX3AAAC 
AGGCGTTCAC 
TGAQACCCAA 
AGACCAAAGA 
CACAGCTTAB 
CrCTGTGCTT 
AGCTGAATAG 
TTAATAACGT 
ATGATATACC 
TCTTQAATTC 
TACTCTCAGC 
CTACATTGGA 
TGACCCTCAT 
GGCATGCCAC 
TTGGAGTTGG 
TTGTCTTTGG 
TCCGGGTTTT 
AGCTATTTAA 
AAAATAGAGG 
TCAACATTAA 
TTAATATCTT 
GAA TAGAGAG 
TGAATCTTTT 
ATTTGAGGAA 
TCCTTCCTAT 
CAGGGAATGG 
ACTTCATCAC 
CAATAATAGG 



31 
1 



TTCAGGAGGG 
AACAAATTAT 
AGTGCATATC 
ATTCTTACAG 
GATTATTCTA 
TTTCTTTTGT 
GGAAAATGGG 
AATAAGCATT 
AAGGAGTCGG 
TCCAGAGCAA 
AG AT CTGAAC 
CATCATCCCT 
GGCGTTGGAG 
CAGCAGTCTG 
GGTGCATCCA 
CAACCCTGAA 
TCTTAGGACA 
TGACAGAGTA 
CATTGGTTCA 
ATTCTCTGTT 
AGAGTATGAT 
CGTGATTTTT 
CCCGACCACC 
ATTAAGTTTC 
TGCAAATGTG 
GGATACACCC 
TAAATCAAAA 
CACTTACGGC 
TGAGCATACT 
TCTGTTGGGT 
AATGAAT TAT 
ATCATTTGCA 
CTCCCTGAAA 
AGAACAATCA 
ATTGTTGGAG 
AAACAATACT 
ATATTTTGAA 
CAAACCTGTG 



ELDNVNNVLR 
PIQD1LSAFS 
CGYDRfiATPT 
9KELFRVLDT 
ISEHLNINEH 
SPAGVHDDSF 
ILQRTGNGLI* 
IfiEKVASCKP 
EMDSEDVYDD 



41 
I 

GCCCTCGTAC 
CAGCCTTCAT 
GAGACCCAAG 
TTTCTCTATG 
AAGGCATATG 
TGCTGTGTCC 
ATGTGTCGTT 
CCCTTCCTGA 
GGCATCTTCT 
AAACTGGCAG 
ATCAAATATA 
AGTATCAATT 
GTTCTTGATG 
AACATGAACA 
ACCAGCGTGA 
TCAAGTGAAA 
CTGAGGCAGC 
GATTTGGATG 
CAACGCCAAA 
GATATOGACA 
TATGTTAATA 
TCATACTGGT 
TACTACCTGG 
CGAQGCTGTG 
CTCTTTTGCT 
GAAAAACTGA 
TACTTACTAA 
ATGAAGCTCA 
ACTCTTCACC 
GGAAGCATAA 
GCAGCAGGAA 
GACAGCTACT 
TATGATCTAG 
AGAGATGCAC 
CTGAGCACTC 
AGAGTAACTA 
TCCTGTGTTA 
CATTATCTGC 
GCC31CCGCTC 



CCTTCCTGTC 
CATATGGGGT 
CTAGATT C T A 
GATGCATTTT 
CAAATGAGTA 
GTGCTGTTGA 
CGCAGAGATT 
GATGCAAAAG 
GAGAGATCTT 
ATTACGGTGT 
CAGCTAACAT 
CCTGTGACAA 
AATTTGCATG 
TCATTATCAA 
TGGAATTCCA 
TCATTTGAAT 



SI 
I 

pigilfelvh 
lccvlgli.fi 

IGIFYGFVAN 
NSIHSVLGGG 
LTSVKT8LHS 
TDLDGLVQQG 
VYVHHTESYI 
TUGCVSHTGG 
PYLIiNEDWBY 
TGSISSELBS 
AYDLEAKANS 
BRVTRXLASI* 
VATALDTAVD 



2980 
2940 
3000 
3060 
3120 
31HQ 
3240 
3300 
3360 
3420 
34B0 
3540 
3600 
3660 
3720 
3780 



60 
120 

iao 

240 
300 
360 
420 
400 
540 
60 0 
660 
720 
780 
640 



51 
I 

TOGGCTOCCT 
CCACAGATGC 
ACTCCCATAA 
TGGTACAGCC 
AATGCAAAAT 
TBGGGCTGCT 
GCTGTAACAA 
GGAAATGCTT 
ATGGTTTTGT 
ATAGCAATTT 
TATTGGCCCA 
CAGTGCTAGG 
AGATTAAGTC 
GCACCTTBAA 
AAACTAGCCT 
OCTGCAACAG 
TTCCACCCGT 
GCCTGGTCCA 
CCACGACTGT 
ATGTAACTCA 
ACACTGAAAG 
GGCTGGGTGG 
GCTTACTGTG 
TCTOCAACAC 
GGATAT TGAT 
TCTGTGAACC 
ATGAAGACTG 
CTTTTGAACA 
TGCAGAACAG 
GCAGTGAATT 
GAAAAAAC CT 
TGGCTCAGAC 
AAGCAAAAGC 
AAACTATTAA 
TATACCAAAG 
GGATTCTAGC 
TTATTGAGGA 
AGTGGATCGA 
TAGATACTGC 



60 
120 
1B0 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



1008 



WO 03/042661 PCT/US02/36810 



TGTTGATGTC TTTCTGTGTA GCTACATTAT OHACOCCTTG AATTTGTTTT GGTTTGGCAT 2400 

AGGAAAAGCT ACTGTATTTT TACTTCCGGC TCTAATTTTT GOOGTAAAAC TGGCTAAGTA 2460 

CTATCGTCGA ATGGATTCGG AGGACGTGTA CGATGATGTT GAAACTATAC CCATQAAAAA 2520 

5TATGGAMAT GGTAATAATG GTTATCATAA AGATCATGTA TATGGTATTC ACAATCCTGT 2 580 

TATGACAAGC CCATCACAAC ATTGATAGCT GATGTTGAAA CTGCTTGAGC ATCAGGATAC 2S40 

TCAAAGTGQA AAGGATCACA GATTTTTGGT AGTTTCTGGG TCTACAAOGA CTTTCCAAAT 2700 

CCAGGAGCAA CQCCAGTGGC AACGTAGTGA CTCAGGCGGG CACCAAGGCA AOGGCACCAT 2760 

TG3TCTCTGG GTAGTGCTTT AAGAATGAAC ACAATCACGT 1ATAGTCCAT GGTOCATCAC 2820 

„ TATTCAAGGA TGACTCCCTC CCJTCCTGTC TATTT TTGT T TTTTACTTTT TTACACTGAG 2 960 

' 1\) TTTCTATTTA GACACTACAA CATATGGGGT GTTTGTTCCC ATTGGATGCA TTTCTATCAA 2940 

AACTCTATCA AATGTGATGG CTAGATTCTA ACATATTGCC ATGTGTGGAG TGTGCTGAAC 3000 

ACACACCAGT TTACAGGAAA GATGCATTTT GTGTACAGTA AACGGTGTAT ATACCTTTTG 3060 

TTACCAGAGA GTTTTTTAAA CAAATGAGTA TTATAGGACT TTCTTCTAAA TGAGCTAAAT 3X20 

AAGTCACCAT TGACTTCTTG GTGCTGTTGA AAATAATCCA TTTTCACTAA AAGTGTGTGA 3180 

ID AACCTACAGC ATATTCTTCA CGCAGAGATT TTCAT CT ATT ATACTTTATC AAAGATTGGC 3240 

CATGTTCCAC TTGGAAATGG CATGCAAAAS CCATCATAGA GAAACCTGCG TAACTCCATC 3300 

TGACAAATTC AAAAGAGAGA GAGAGATCTT GAGAGAGAAA TGCTGTTCGT TCAAAAOTOG 3360 

AGTTGTTTTA ACAOATGCCA ATTACGGTGT ACAGT TTAAC AGAGTTTTCT GTTGCATTAG 3420 

„ GATAAACATT AATTGGAGTG CAGCTAACAT G AGTAT CATC AGACTAGTAT CAAGTGTTCT 3480 

ZU AAAATGAAAT ATQAGAAGAT CCXGTCACAA TTCTTAGATC TGGTGTCCAG CATGGATGAA 3540 

AOCTTTBAGT TTGGTCCCIA AATTTGCATG AAAGCACAAG GTAAATATTC ATTTGCTTCA 3600 

GGAGTTTCAT GTTGGATCTG TCATTATCAA AAGTGATCAG CAATQAAOAA CTGGTCGGAG 3 6 B0 

AAAATTTAAC GTTGATGTAA TGOAATTCCA GATGTAGGCA TTCCCCCCAG GTCTTTTCAT 3720 
GTGCAGATTG CAGTTCTGAT TCATTTGAAT AAAAAGGAAC TTGG 



25 



45 



50 



Seq ID W0: 146 Protein sequence 
Protein Accession #: 



™ 1 11 . 21 31 41 51 

30 | i i | | | 

MALVLGSLLL LGLCOCTSFSG GQP8STDAPK AWNYKKPATN YETQDEHXAG PIGILFELVH 60 

IFLYWQPRD FPJSDTLUKFIi QKAYESKIDY DIVYYKAGII LCCVLGIiLFX IIjMFIiVGYFF 120 

CWCRCCHKCG GEMHQRQKBN GPPLUKjCFAr SLIiVICIIIS IGIFYGFVAN HQVRTRISORS 100 

„ RKLADSHPKD IjRTLLNETPE QIKYILAQYN TTKDKAFTDL KSINSVLGGG ILDHLRPNJI 240 

D 3 PVTiDSIKSMA TAIKBTKEAIi ENMNSTLKSli HQQgTOLSSS LTSVKTSLRS SUJDPLCLVH 300 

PSB KTCNS IR LSLSQLNSNP ELRQLPPVDA ELDHVNHVM TDLDG1.VQQG YQSLKDIPDR 360 

VQRQTTTWA GIKHVLNS3G SDIDNVTQaL PIQDII^AFS VYVMHTESYI EKNLPTLEEY 420 

DSYWWU3GI.V ICSLLTLIVI FYYIxGDLCGV CGYDRHATPT TRGCVSNTGG VFLMVGVGLS 480 

Af\ FX^FCH ILMI I Wr.TFVPGAW VEKLICEPYT SKEfcFHVLDT PYLLNEDWEY YXjSGKLFNKS 540 

W KMKIiTFEQVY SDCKKHRGTY GTLHLQNSFN ISEHLSINEH TGS IS SELES IJCVNLKTFU^ 600 

OAAG5RKNLQD FAACGIDRWN YDSYLAQTGK SPAGVNLLSF AYDLEAKRNS JLPPGNIiHNSTj 660 

KRBAQTIKTI HQQKVLPIEQ SLSnOTQSVJC ILQftTGNGLI* EKVTRILASI. DFAQNFITNN 720 

TSSVIIEETK KYGRTJIGYF EHYLQWIEF3 1SEKVASCKP VATALDTAVD VFLCSYIIDF 780 

LNLFWFGIGK ATVFLLPAil FAVKLAKYYR RMDSEDVYDD VETIPMKNME NGNHGYHKDH 840 
VYGIHNPVMT SP8QH 



Seq ID NO; 147 UNA sequence 

Nucleic Acid Accession #: BC012089.1 

Goding sequence i 236.. 2806 



1 11 21 31 41 51 

I 1 I I I 1 

ATGCTCTCAG CTCTCCOGCC GCGGGATGGT GCCTTGAGTG AATGACOCCG TTGGAGAACA 60 

_ ^ TTCTTCOGCA TCCCTCGCCT CARGCCAGCC TCAGACAGAA AACTGAAGAT TCAGCAGATC 120 

DD CAGTGCTTCC TGCTCCTCTT CIGGOCAGGA AC3U30CTTGC CTTGOOCAAG GCTTCCAGAA 100 

GCTCTGAGGC AGGA0GCACC AAGTTCTACC TCATGTTTGG AGQATCTTGC TAGCTATGGC 240 

CCTCGTACTG GGCTCXXTGT TGCTGCTGGG GCTGTGCGGG AACTCCTTTT CAGGAGGGCA 300 

GOCTTCATOC ACAGATGCTC CTAAGGCTTG GAATTATGAA TTGCCTGCAA CAAATIATGA 360 

GACCCAAGAC TOCCATAAAG CTGGACCCAT TGGCATTCTC TTTGAACTAG TGCATATCTT 420 

OU TCTCTATGTG GTACAGCOGC GTGATTTCCC AGAAGATACT TTQAOAAAAT TCTTACAOAA 480 

GGCATATGAA .TCCAAAATTG ATTATGACAA GATTGTCTAC TATGAAGCAG GG ATT ATT CT 540 

ATGCTGTGTC CTGGGGCTGC TGTTTATTAT TCTGATGCCT CTGGTGGGGT ATTTCTTXTG 600 

TATGTGTCGT TGCTGTAACA AATGTGGTGG AGAAATGCAC CAGCOACAGA AGGAAAATGG 660 

GCCCTTCCTG AGGAAATGCT TTGCAATCTC CCTGTTGGTG ATTTGTATAA TAATAAGCAT 720 

Oj TGGCATCTTC TATGGTTTTG TGGCAAATCA CCAGGTAAGA ACCCGOATCA AAAGGAGTCG 780 

GAAACTGGCA GATAGCAATT TCAAGGACTT GCGAACTCTC TTGAATGAAA CTCCAGAGCA 040 

AATCAAATAT ATATTGGCCC AGTACAACAC TACCAAGGAC AAGGCGTTCA CAGATCTQAA 9O0 

CAGTATCAAT TCAGTGCfTAG GAGGCGGAAT TCTTGACOGA CTGAGACCCA ACATCATCCC 960 

TOTGTTCTTGAT GAGATTAAGT OCATGGCAAC AGCGATCAAG GAGACCAAAG AGGCQTTGGA 1020 

GAACAT GAAC AGCACCTTGA AGAGCTTGCA CCAACAAAGT ACACAGCTTA GCAGCAGTCT 1080 

GACCAGCGTG AAAACTAGCC TGCGGTCATC TCTCAATGAC CCTCTGTGCT TGGTGCATCC 1140 

ATCAAGTGAA ACCTQCAACA GCRTCAQATT GTCTCTAAGC CAGCTGAATA GCAACCCTGA 1200 

ACTGAGGCAG CTTCCACCCG TGGATGCAGA ACTTGACAAC GTTAATAACG TTCTTAGGAC 1260 

AGATTTGGAT GGCCTGGTCC AACAGGQCTA TCAATCCCTT AATGATATAC CTGACAGAGT 1320 

tO ACAACGOCAA ACCACGACTG TC53TAGCAGG TATCAAAAQQ QTCTTGAATT CCATTGGTTC 1380 

AGATATCGAC AATGTRACTC AGCGTCTTCC TATTCAGGAT ATACTCTCAG CATTCTCTGT 1440 

TTATGTTAAT AACACTGAAA GTTACATCCA CAGAAATTTA CCTACATTGG AAGAGTATGA 1500 

TTCATACTGG TGGCTGGGTG GCCTGGTCAT CTGCTCTCTG CTGACCCTCA TCGTGATTTT 1560 

opj TTACTACCTG GGCTTACTGT GTGGCGTGTG CGGCTATGAC AGGCATGCCA CCCCGACCAC 1620 

COG^GCTGT GTCTCCAACA CCGGAGGCGT CTTCCTCATG GTTGGAGTTQ GATTAAGTTT 1680 

CCTCTTTTGC TGGATATTGA TGATCATTGT GGTTCTTACC TTTGTCTTTG GTGCAAATGT 1740 

GGAAAAACTG ATCTGTGAAC CTTACACGAG CAAGGAATTA TTCCGGGTTT TGGATACAOC 1800 

CTACTTACTA AATGAAGACT GGGAATACTA TCTCTCTGGG AAGCTATTTA ATAAATCAAA 1860 

AATGAAGCTC ACTTTTGAAC AAGTTTACAG TGACTGCAAA AAAAATAGAG GCACTTACGG 1920 

1009 



WO 03/042661 



PCT/US02/36810 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



CACTCTTCAC 
TGGAAGCATA 
TGCAGCAGGA 
TGACAGCTAC 
ATATG AT CT A 
AAGAGATGCA 
ACTGAGCACT 
6AGAGTAACT 

ACATTATCTG 
GGCCACCGCT 
GAATTTGTTT 
TGCGGTAAAA 
TGAAACTATA 
ATATGGTATT 
ACTeCTTGAQ 
GTCTACAA6G 
GCACCAAOGC 
T13VTAGTCCA 
TTTTTRCTTT 
CATTGGATGC 
CATGTOTGGA 
AAACGGTGTA 
TTTCTTCTAA 
ATT3TCACTA 
TATACTTTAT 
AGAAACCTGC 
ATGCTGTTCG 
CAGAGTTTTC 
CAGACTAGTA 
CTGQTOTCCA 
GGTAAATATT 
GCAATGAAGA 
ATTOOCCCCA 
CTTGGAAAAA 



CTGCAGAACA 
AGCAGTGAAT 
AGAAAAAACC 
TTGGCTCAGA 
GAAGCAAAA& 
CAAACTATTA 
CTATACCAAA 
AGGATTCTAG 
ATTATTGAGG 
CAGTGGATCG 
CTAGATACTG 
TGGTTTGGCA 
CTGOCTAAGT 
CCCATGAAAA 
CACAATCCTQ 
CATCAGGATA 
ACTTTCCAAA 
AACGGCACCA 
TGGTCCATCA 
TTTACACTGA 
ATTTCTATCA 
GTGTGCTGAA 
TATACCTTTT 
AT GAG CT AAA 
AAAGTGTGTG 
CAAAGATTGG 
GTAACTCCAT 
TTCAAAAGTQ 
TGTTGCATTA 
TCAAGTGTTC 
GCATGGATGA 
CATTTGCTTC 
ACTGGTCGGA 
GGTCTTTTCA 
AAAAAAAAAA 



GCTTCAATAT 
TGGAAAGTCT 
TTCAGGATTT 
CTGGTAAATC 
CAAACAGTTT 
AAACAATTCA 
GCGTCAAGAT 
CTTCTCTGGA 
AAACTAAGAA 
AGTTCTCTAT 
C7TGTTGATGT 
TAGGAAAAGC 
ACT AT CGTCG 
ATATGGAAAA 
TTATGACAAG 
CTCAAAGTGG 
TCCAGGAGCA 
TTGGTCTCTG 
CTATTCAAGG 
GTTTCTATTT 
AAACTCTATC 
CACACACCAG 
GTTACCACAG 
TAAGTCACCA 
AAACCTACAQ 
CCATGTTCCA 
CTGACAAATT 
QAGTTGTTTT 
GGATAAACAT 
TAAAATGAAA 
AACCTTTGAG 
AGQAOTTTCA 
CAAAATTTAA 
TQTGCAGATT 
AAA 



CAGTGAACAT 
GAAGGTAAAT 
TGCTGCTTGT 
CCCCGCAOGA 
GCCCCCAGGA 
CCAGCAACGA 
ACTTCAACGC 
TTTTGCTCAG 
GTATGGGAGA 
CAGTGAGAAA 
CTTTCTGTGT 
TACTGTA1TT 
AATGGATTCG 
TGGTAATAAT 
CCCATCACAA 
AAAGGATCAC 
ACGCCAGTGG 
GGTAGTGCTT 
ATGACTCCCT 
AGACACTACA 
AAATGTGATG 
TTTACAQGAA 
AGTTTTTTAA 
TTGACTTCTT 
CATATTCTTC 
CTTGGAAATG 
CAAAAGAGAG 
AACAGATGCC 
TAATTGOAOT 
TATGAGAAGA 
TTTGGTOCCT 
TGTTGGATCT 
CGTTGATQTA 
GCAGTTCTGA 



CT CA ACATTA 
CTTAATATCT 
GQAATAGACA 
GTGAATCTTT 
AATTTGAGGA 
GTCCTTCCTA 
ACAGGGAATG 
AACTTCATCA 
ACAATAATAG 
GTGGCATCGT 
AGCTACATTA 
TTACTTCCGG 
GAGGACGTGT 
GGTTATCATA 
CATTGATAGC 
AOATTTTTGG 
CAACGTAGTG 
TAAGAATGAA 
CCCTTCCTGT 
ACATATGGGG 
GCTAGATTCT 
AGATGCATTT 
ACAAATGAGT 
GGTGCTGTTG 
ACGCAGAGAT 
GOATGCAAAA 
AGAGAGATCT 
AATTACGGTG 
QCAGCTAACA 
TCCTGTCACA 
AAATTTGCAT 
GTCATTATCA 
ATGGAATTCC 
TTCATTTGAA 



AT GAG CATAC 
TTCTGTTGGG 
GAATGAATTA 
TATCATTTGC 
ACTCCCTGAA 
TAGAACAATC 
GATTGTTGGA 
CAAACAATAC 
GATATTTTGA 
GCAAACCTGT 
TCGACCCCTT 
CTCTAATTTT 
ACGATGATGT 
AAGATCATGT 
TGATGTTGAA 
TAGTTTCTGG 
ACTCAGGCGG 
CACAATCACG 
CTATTTTTGT 
TGTTTGTTCC 
AACATATTGC 
TGTGTACAGT 
ATTATAGGAC 
AAAATAATCC 
TTTCATCTAT 
GCCAT CAT AG 
TGAGAGAGAA 
TACAGTTTAA 
TGAGTATCAT 
ATTCTTAGAT 
GAAAGCACAA 
AAAGTGATCA 
AGATGTAGGC 



Seq ID NDi 148 Probe In sequence 
Protein Accession #t AAH120S9-1 



1 
1 

MALVLGSLLL 
IFLYWQFRD 
FCMCRCCKKC 
SRKLADSNFK 
IPVLDEIKfiM 
HPSSETCN8I 
RVQRQTTTW 
YDSYWWLGGL 
SFLFCWI1MI 
SKMKXiTFEQV 
IiGAAGRKNLQ 
LKRDAQTXKT 
KTSSVIIEET 
PLNLFHFGIG 
HVYQIHNPVM 



11 
I 

LGLCGHSP3G 
FPEDTLRKFL 
GGEMHQRQKE 
DLRTLLNETP 
ATAIXETKEA 
RLSLSQLNSN 
AGIKHUIJigi 
VICBLLTLIV 
IWLTPVFOA 



DFAACGIUHM 
IHQQRViiPIE 
KKYGRT11GY 
KATVFIiLPAIr 
TSPSQH 



21 
I 

GQPSSTDAPK 
QKAYESKXDY 
WGPFIiRKCEA 
BQIKYIIiAOOr 
LBKMNSTLKS 
PELEQLPPVD 
GHDIDNVTQR 
IFYYLGLIiCG 
NVEKLICEPY 

ygHjHIjQnsf 

NYPSYIAQTG 
QSLSTLYQSV 
FEHVTJQHIEP 
IFAVXLAKYY 



31 
I 

AWNYELPATN 
DKXVYYEAGI 
IfiLLVICIII 
NTTKDKAFTD 
LHQQfiTQLSS 
ASLEHTVKKVI/ 
LPIQDILSAF 
VCGYDRHATP 
TSKKTiFRVIiD 



KSPAGVNLL6 
XHjQRTGNHL 
SISEKVASCX 
SKMDSEDVYD 



41 
I 

YETgDSHKAG 
II^CCVLGLIxF 
SIGrFYGFVA 
LNSZHSVLGG 
SLTSVKTSliR 
RTDLDGLVQQ 
SVYVNNTESY 
TTRGCVSNTG 
TPYLIJBEDWE 
HTGSISJBELE 
PAYDLEAKAK 
IiERVTRIkAS 
PVATALDTAV 
DVETIPKKNM 



LDFAQNFITN 
UVFLCSY1ID 



Seq ID NO s 149 DMA sequence 

Nucleic Acid Accession # ; NJ4_033049 . 1 

Coding sequence? 28..1S66 



1 
1 

CCACGCGTCC 
CTCCTTTCTG 
GAAACTGCGA 
GAAACTGCTA 
CCCATAATTA 
ACACATAGTT 
GTAAATTCAT 



CAATCATCAG 
GGTCCCAGCA 
CATAATACAA 
AAAGGAAAGG 
GAGAAACATT 
GTATTTGGCA 
CCAAGATCTG 
GCAGAAACCA 
AGXAGCTCAA 
AACCAGACTG 
AGGCCTAACC 
AACGCACAGC 
GTGCCCOGCT 



11 
1 

GAGCAAGAAC 
TAAACACAGC 
CTAGTGGTCC 
GCACCACAGC 
GTACACATAG 
CCTCCACAAT 
TAGCTAOCTC 
CTTCTGAAAC 
GGCCTCCCAC 
ATCCTTGCCA 
GTTTTTGCCT 
TATTCCCTGG 
CCATGGCCTA 
CATCTGTTTA 
AAATGCGTGC 
CAAGTGACAA 
GCAACTTTCT 
CGGATGACTG 



21 
1 

AGCTAAAATG 
CACCAACCAA 
TACAGTAGCT 
AAATACACCT 
TTOCTGCACA 
TCCTATACCT 
TGACATAATC 
ACAAAGTAAC 
TGG CACCGCT 
AGATGATCCC 
GTGTTTAGAA 
GAAGATTTCA 

TCAAGACrra 

TGGACAGACT 
TGATGACAAG 
TGAGAAGACT 
AAACTATGAT 
CCTCAATGGT 



31 

i 

AAAGCCATCA 
GGCAACTCAG 
GCAGCTGATA 
TCXTTCCCAA 
ATtCCTACAC 
ACTGCTGCAG 
ACCdCTTCAT 
AATGAAATGT 
TTATTGGAGA 
TGTGCAGATA 
GQQTAT TACT 
GTGACAGTAT 



41 
I 

TTCATCTl'AC 
CTGATGGTGT 
CCACTQAAAC 
CAGCTACTTG 

ACAGTGAGTC 
CTCCAAATGA 
CXK5GCACCAC 
CCAGCACOCT 
ATTCGTTATG 
ACAACTCTTC 
GAGAAACATT 



GTAATTCTTA 
TTTGTTAATG 
GTGACTGAQA 
TTGACCCTTC 



ACAAGCAATG CTTAATAAAG 
ACCAGGAAGA TOCTAATGGG. 



GCTTCCAGTC 
AAGAGTGGTG 
AACTGCCAAA 



CTGTAAGCAC 
TAACAATAGT 
AAATTAATAA 
GGTGTGATTA 
ATTGCAAATC 
TCAAGTGTCC 
GGGCCCCTGA 
AGTGTGCATT 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3 540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



51 
1 

PIGILPELVH 
IILMPLVGYP 
NHQVHTRIKR 
GIXjDRLKFNI 
SSIJ7DPLCX.V 
GYQSLNDIPD 
IHSNUPTLHE 
GVFLMVGVGL 
YYLSGKLFNK 
SLKVNLHIEL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



51 
I 

TCTTCTTGCT 
AACAACCACA 
TAATT T CCCT 
ACCTGCTCCC 
CATAATTAGT 
AACCACAAAT 
TGGATTAATC 
AGAAGACAAT 
AAACAGCACA 
TQTTAAGCTG 
TACATGTAAG 

tgacccagaa 

GTTTAAAGAT 
ATCTCTGTCA 
AACAATTTTG 

agcaattaga 
ttatggctgt 
tgacctgcaa 



GTGTGCGTBC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
€60 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1010 



WO 03/042661 



GGACTCGACT GTAAGGACAA ATTTCAGCTG ATCCTCACTA TTGTGGGCAC CA.TCG CTGGC 1320 

ATTQTCATTC TCAGCATGAT AATTGCATTG ATTGTCACAG CAAGATCAAA TAACAAAACG 1380 

AAGCATATTG AAGAAGAGAA CTTGATTGAC GAAGACTTTC AAAATCTAAA ACTGCGGTCG 1440 

ACAGGCTTCA CCAATCTTGG AGCAGAAGGG AGCGTCTTTC CTAAGGTCAG GATAACGGCC 1500 

TCCAGAGACA GCCAGATGCA AAATCCCTAT TCAAOACACA GCAGCATGCC CCGCCCTGAC 1560 

TATTAGAATC ATAAGAATGT GGAACCCGCC ATGGCCCCCA ACCAATGTAC AAGCTATTAT 162 0 

TTAGAGTOTT TAGAAAGACT GATGGAGAAG TGAGCACCAG TAAAGATCTG GCCTCCGGGG 1680 

TTTTTCTTCC ATCTGACATC TGCCAGCCTC TCTGAATGGA AQTTGTGAAT GTTTGCAACG 1740 

AATCCAGCTC ACTTGCTAAA TAAQAATCTA TGACATTAAA TGTAGTAGAT GCTATTAGCG 1800 

CTTGTCAGAG AGGTGGTTTT CTTCAATCAG TACAAAGTAC TGAGACAATG GTTAGGGTTG I860 

TTTTCTTAAT TCTTTTCCTG GTAGGGCAAC AAGAACCATT TCCAATCTAG AGGAAAGCTC 1920 

CCCAGCATTG CTTGCTCCTG GGCAAACATT GCTCTTQAGT TAAGTGACCT AATTCCCCTG 1980 

GGAGACATAC GCATCAACTG TGGAGGTCCG AGGGGATGAG AAGGGATACC CAjCCACCTTT 2040 

CAAGGGTCAC AAGCTCACTC TCTGACAAGT CAGAATAGGG ACACTGCTTC TATCCCTCCA 210Q 

ATGGAGAGAT TCTGGCAACC TTTGAACAGC CCAGAGCTTG CAACCTAGCC TCACCCAAGA 2160 

AGACTGGAAA GAQACATATC TCTCAGCTTT TTCAGGAGGC GTGCCTGGGA ATCCAGGAAC 2220 

TTTTTGATGC TAATTAGAAG GCCTGGACTA AAAATGTCCA CTATGGGGTG CACTCTACAG 2280 

TTTTTGAAAT GCTAGGAGGC AGAAGGGGCA GAGAGTAAAA AACATGACCT GGTAGAAGGA 2340 

AOACJAGGCAA AGGAAACTGG GTGGGGAGGA TCAATTAGAG AGGAGGCACC TGGGATCCAC 2400 

CTTCTTCCTT AGGTCCCCTC CTCCATCAGC AAAGGAGCRC TTCTCTAATC ATGCCCTCCC 2460 

GAAGACTGGC TGGGAGAAGG TTTAAAAACA AAAAATCCAG GAGTAAGAGC CTTAGGTCAG 2520 

TTTGAAATTG GAGACAAACT GTCTGGCAAA GGGTGCOAGA GGGAGCTTGT GCTCAGGAGT 2 580 

CCAGCCGTCC AGCCTCGGGG TGTAGGTTTC TGAGGTGTGC CATTGGGGCC TCAGCCTTCT 2640 

CTGGTGACAG AGOCTCAGCT GTGGCCACCA ACACACAACC ACACACACAC AACCACACAC 2700 

ACAAATGGGG GCAACCAGAX CCAGTACAAG CTTTTACAAA TGTTATTAGT GTCCTTTTTT 27 GO 

ATTTCTAATG CCTTGTCCTC TTAAAAGTTA TTTTATTTGr TATTATTATT TOTTCTTGAC 2820 

TGTTAATTGT GAATGGTAAT GCAATAAAGT GCCTTTGTTA GATGGTGAAA AAAAAAAAAA 28 BO 
AAAAAAA 

Seq ID HO; 150 Protein sequence 
Protein Accession #: NP_149038.1 

1 11 21 31 41 51 

I I 1 1 t | 

MKAllHLTLIi ALIiSVNTATN QGtJSADAVTT IBTATflGPTV AAADTTECWF PETASTTANT 60 

PSPPTATSPA PPIISTHS3S TIPTPAPFII STHSS8TIPI PTAADSEfiTT NVNSLATSDI 120 

ITASSPNDGI* ITMVPSETQS NNEMSPTTED NQSBGEPTGT ALLETSTIiNS TGPSNPCQDD 180 

PCADNSLCVK LHKTSPCLCL EGYYYNSSTC KKGKVFPGKI SVTVSETFDP EEKHSMAYQD 240 

LHSEITSIiEK DVFGTSVYGQ TVILTVfiTSL SPRSEMRADD KFVHVTJVTI LAETT8DNEK 300 

TVTEKINKAI RSfiSStfFlJNY DLTIiRCDYYG OSQTADDCLN GLACDCXfiDL QRPNPQSPFC 360 

VASSI*KCFDA CNAQHKQCLI XKSGGAPECA CVPGYQEDAN GNCQKCAFGY SGLDCKDKFQ 420 

LILTIVGTIA GXVILSMXIA IiIVTARSNNK TKHIEESNIil DEDPQKLKLR STGFTSTLGAE 480 
GSVFPKVHIT A3RDSQMQKP YSRHSSKPfcP DY 

fieg; id NO s X51 DMA sequence 

Nucleic Acid Accession #: Bos sequence 

Coding sequence: 87. .514 

1 11 21 31 41 51 

I \ I I I I 

AGTACGCGGG GOCGCGTTTT GCCTOCGCAG CAGCTCTGGG CT C T T CTCRG CTGCGOGAGC 60 

AaCTGCTOCA ATGCCCCGGA GTGGCCATGG GCGCCCCGGA CTGGTGGGftC CAGCTGCAGG 120 

CTGGTAGCTC GGAGGTGGAC TGGTGCGAGG ACAACXACAC CATCGTGCCT GCTATCGCCG 180 

AGTTCTACAA CACGATCAGC AATQT CT TAT TTTTCATTTT ACCGCCCATC TGCATGTGCT 240 

TGTTTCGTCA GTATGCAACA TGCTTCAACA GTGGCATCTA CTrAATCTGG ACTCTTTTGG 300 

tfTGTAGTOGG AATTGGATCC GTCTACTTCC ATGCAACCCT TAGTTTCTTG GGTCAGATGC 360 

TTGATGAACT TGCAGTCCTT TGGGTTCTGA TGTGTGCTTT GGCCATGTGG TTOOCCAGAA 420 

GGTATCTACC AAAGATCTTT CGGAATGACC GGGGTAGGTT CAAGGTGGTG GTCAGTGTCC 480 

TGTCTGCGGT TAGGACGTGC CTGGCATTTG TCAAGCCTGC CATCAACAAC ATCTCTCTGA 540 

TGACCCTGGG AGTTCCTTGG ACTGCACTGC TCATCGCAGA GCTAAAGAGG TGTGACAACA 600 

TGCGTGTGTr «EAAGCTGGGC CTCTTCTCGG GCCTCTGGTG GACCCTGGCC CTGTTCTGCT 660 

ggatcagtga ccgagctttc tgcgagctgc tgtcatcctt caacttcccc TACCTGCACT 720 

GCATGTGGCA CATCCTCATC TOCCTTOCTG CCTACCTGGG CTGTGTATGC TTTGCCTACZT 780 

TTGATGCTGC CTCAGAGATT CCTGAGCAAG GOCCTGTCAT CAAGTTCTGG CCCAATGAGA 840 

AATGGGCCTT CATTGGTGTC CCCTATGTGT CCCTCCTGTG TGCCAACAAG AAATCATCAG 900 

TCAAGATCAC GTGATGGCAA GATGGTGGCT GGCTTCTCTG CXTArCGCCC CTCATGCAGT 960 

GCK3CTTCCTT TGCTAGGAAG ACAGCCAAGG GAGTXCGAAT AGTTGGGGTG TGGGCTATCT 1020 

TTTCAAAAAT CTATTTGCTG GGGCTCTTAA TTTCTTTAGT GTTCTTTQTA TGTAGGGATT 1080 

TAAACTTTGT CATATGGTAC AAATATTCCC TGCCCCCCTG CAGTTTOOCA TTTGTCTTTC 1140 

AGTATGTTAA TATTTTTGTG CCATACTGGT TTTAAACTTT CATGTTGTCA CATCTGTTAA 1200 

TCTTTTCTTT AGGATTTCTG GATTTTQTGT AATTTTTAAA AAGGTCCOCT OCTCCTCOCT Z260 

AATGTGTCTG TGGACCACCT GGATTCCACT GTACAAGGGG AAAAGTGTCT ATTCCTTTCC 1320 

CAAAGATGGA AAATGGAGGG CTTAQGOACA CTAGATGCAT CTTTCTCASC ATCACTTCCA 1380 

GATGCAGTGA CTTGTTGQGC TGCGTCCTTA ATGGCCATGG CAGAGCAGTC CCTTGGGGGA 1440 

TCCAGOCCTG TACAATGCAT CTCTTCCTGG AGAAAGCTGG CCTGCTCCAG ACCCCACCAT 1500 

TCCCAGGCGC CCTTGGAGTG GACTCTACTG ATOACAGACA GACCCTCTGA GAGACAAGAC 1560 

CCTCTQACTC TGTGATGGAA GATtSCCAQAG ATTTTCCTTT GGGGTAATTG TCCTTAAACA 1620 

AAACCAAACA GATQAAACAC ACACAGGACT TGTGGCTAAA AAGGCTAGTT TTTCACT TGC 1680 

ATTXCrCAAC TAACCCAGGT TTTACATGCA TCTGTGAATC CTTTTACTAC TACCTCTGTG 1740 

GAGAGATGOA GAGACTTCAG ATAAACGTGA AGCTAATGAG TAAAAGCCTC TCTGCCAAAA lfiOO 

CCTACACTCC ACTTTAGGCC CTTCTTGAAG ATGAGCACAA TTTTTAAAXA CTGAGCACAA 1860 

TTTTTAAATA CTGACATCAC TTCCTCTTCC CCCTCCCACX: CCAGCTCAGC AGCCTCAAAT 1920 

CTACAGAGAA GAAGAATTAT GGCATGAACA TXGCCACAGA CCCACCATCT TTAAGACTTG 1980 

ACCTCTGTAA GTTTACCAAA GGGCTCCTCA CAATTGTGGT GGGGGTTCTG GTTCAAAATT 2040 



1011 



WO 03/042661 



PCTAJS02/36810 



TGGAGCAAAC ATGAAGTTTT TGGAAACGTT TTCTCATTTG AAGCCTCCAG TATGCTGTAC 2100 

TATTCTGGAA ATTACCTTCA AGAGTCTCAC TTCTTGTTTC TGTTGTGTTT TCTGTGGGCA 2160 

TCATGTTCTT CACGCTTGCA GTAOAAGGTG CTTTCTCGGT TTCCCAGAGT ATCCAACGGC 2220 

TCACCTTTCT CAAGTHCTGG CAGTAGCTAT GCACTCACGG GCTGGTTTGG GTC3CTGGTG 2280 

5 CAGCAGCGCA AATCTGTTGC CTTCTGAATT TTTCTCRCCT AATGTGACAC TGGCTACAAT 2340 

GAATCTTCTC TTCATCGGGC TGAATGAAAG ATTCAAGAAC CATCTTCAAG GTGCATGGTG 2400 

GGAATTATCA ACCTCAGGGA TACTCATTTT AACTCAGGCG TGTCCTGCTT TGTAACATTC 2460 

CATTGTTGGG AGAGGGCAGG ACAGGT&TGT TCTTCTGTQG GCAGGAGTCA TGTCACTGTC 2520 

CTACATATGT AAGAGTTGGG AAGGTGACGA TTTTTGACAC ATCCAGGAAC TCTTACTCTA 2S80 

10 GTTAGAATTT GTACCAGATC CAAGGTGAAA ACCCCAATAA GCAACTGAAT TTAGAGTTTA 2640 

AAAATGAATG ACTTTATGCT ACATCTGTGG TTATCAAATT ATATAGGTTG TTGAGAAGCA 2700 

GAACGCTGTT TGTAGTAAGA AATCT T TGTG GAACCCCAGT GTGTGAAGTA AATTGTATGT 2760 

TATTAAATTT ATTTAAGGTT AAATTTATGG CATTTACTTA AT AAT AT AT G AGGTGGTGAA 2820 

AATGCAAATT AACAAATTGG TAATTTCCAA GGTAGAAAAA TTAGGTGTTG AATGAATGTA 2880 

15 TATGTTGGTT TATTATAGTT TTATATATAT ATAGAGAGAG TGTTTTGTTT TTGAGTCAGG 2940 

GTCTTGCTCT ATTACCCAGG CTGGAGTGCA GTGGTGCAGT CATGACTGAC TQTAGCCTCT 3000 

GTCTCCCAGG CTCAAGTGAT CCTCTCACCT CAGCCTCCGC AGGAGCTGGG AGTACTGGTG 3 060 

CGTGCCTCCA CTCCCAGCTA ATTTTTGTAT TTTTTCATAG AGATGGGGTT TCACCATTTT 3120 

GCCCAGGCTG GTCTCAAACT TCTGGGCTCA AGAGATTCGC CCGCCTCGGC CTCCCAAAGT 3180 

20 GCTGGGAATA GGCATGAGCC GTCACGCCTG GCCTAAAAAA T ATTT T T AAA ATGATCTTTG 3240 

AATTAAATAT TCGTAGAATT TCTAATGTAT CTCTTTGAGA CCTAGGAGOT TGATGGAACA 33 DO 

GAACTGCTGT TAAGTCCTTT OGGTT lCAAG TCTAGAATTT TTTAAAGGCA AATATCAGCT 3360 

CATTCTTATT TTAGATTGAC CTTATCAGGC ATGGATTCTG GTCTCATCTA CTTTATGQTA 3420 

TAAATGCTCC AAGGTAGGGG GTTTGGTATA TATTTTAAGC CCGGCTTTTT TTTTTTTTTT 3480 

25 TTTTTTTTTT TTTTTAATGT GAGAAGCAGA ATGTGCTTCT AGAAACTGGT TTTAAAOAGA 3540 

TGAGCTGAGA AAGAAATGTG GAATGGAGTA TATTTGAGGA GGACAAAACA TAAjCTTCACT 3600 

TTTGAACRGA AATCSJCTCTA GCTTGCCAGC ATGGGATGTA AACCAAGAGA GTAGAAATAT 3660 

ACCCATCTTA TTTTAAGTTG GGTTTATGGC ATCGCTCATA TATGTAAAAG CACTACAAAC 3720 

TCTTTAAAGA AAATTGGGAA ACTACAGAGA AGTCAAAGAA AAAAAAAAGT AACCCATATT 3780 

30 TCTATTGCCC AGGTATAATC CTTGTTAATA TTTTGGTTTG GTCTCCTCTT TTTTTCCCCC 3840 

AATATAGTTG TAAATAAATG ATGTCTTTCA GAOTTGACAT TTATCCTGTA 'gCTTGAATGG 3900 

CATGTAAATG CCAGTTGTAT ATTTTTTCAT GAAGTGTAGG TTTGOAATAC ACTAGAGTTA 3960 

GCTATATGCT TGAATGCTGA TCACTGGATT CTGAGACTGA CTACTGAGTC TACCTTTTTA 4020 

ATCAAGCCTA ACATGAATGG GCTCCAAAAA GTAATGAATG TAATTGTACT TTTTGATGTG 40 B0 

35 OCTCTGCACT TGGCTTGGTG AGTOATCATA AATAGCTGTT AAATATGTGA CTTTACAGAT 4140 

TTTGATATGT TCAGATTGTA AAAAATGAAT AGTTTATTTC ATTAATTGAT GGGCAGTCAA 4200 
GAATCTCCCT CC 
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Soq ID KO; 1S2 Protein aemience 
Protein Accession #; Eos sequence 

1 11 21 31 41 51 

I I I 1 I 1 

KGAPHWWDQI* QAGSSEVDHC EDNYTIVPAI AEFYKTISNV LFFIIiPPTOfl clfrqyatcf 
NSGIYLIWTIj I*VWGIGSVY FHATIiSFIKSO; MLDELAVLWV IMCALAMWFP RRYLPKIFRN 
HRG5LFKWV3 VI»SAVTTCIxA FVKPAUUNIS IiMTLGVPCTA LLIAE&KKCD BMEVFKLGLF 
SSLWWTIiALF CHISDRAFCB LLSSFNFPYI, HCMWHIIjICI. AAYLGCVCFA YFDAASEIFE 
OGPVIKFWPW EKHAFIGVFY VSI*LCANKKS SVKIT 



Seq XD KQ: 153 DMA sequence 
nucleic Acid Accession #i HI 
Coding sequence : 167 - . 676 



001432-1 



1 11 21 31 41 

I I I I I 

TCACTTGCCT GATATTTCCA GTGTCAOAGG GACACAGGCA ACGTGGGGTC 
TGACAGCCGC TCTCCAGCCA CTGCCGCGAG CCOGTCTGCT CTOGCCCTGC 
TCCGCAGCCG CCCTCCGCCA AGCCCCAGCG CCCGCTCCCA TCGCCGATGA 
GAGGATGGAG ATGCTCTGTG CCGGCAGGGT CCCTGCGCTG CTGCTCTGCC 
TCTTCTACAG GCAGTCCTCA GTACAACTGT GATTOCATCA TGTATOCCAG 
TGATAACTGC ACAGCTTTAG TTCAGACAGA AGACAATCCA CGTGTGGCTC 
AACAAAGTGT AGCTCTGACA TGAATGGCTA TTGTTTGCAT GGACAGTGCA 
GGACATGAGT CAAAACTACT GCAGGTGTGA AGTGGGTTAT ACTGGTGTCC 
CTTCTTTTTA ACCGTCCACC AACCTTTAAG CAAAGAGTAT GTGGCTTTGA 
TATTATTTTG TTTCTTATCA CAGTCGTQGG TTCCACATAT TATTTCTGCA 
AAATCGAAAA AGTAAAGAAC CAAAGAAGGA ATATGAGAGA GTTACCTCAG 
GTTaCCGCAA GTCTGAATGG CGCCATCAAA CTTATCGGCA GGQATAACAG 
TAATATTAAT ATTCCATTTT ATTAATAATA TTTATGTTGG GTCAAGTGTT 
CACTGTATTT TAATGTACTT GAAAAATGTT TTTATTTTTG TTTTATTTTT 
TTGCTAATGT ATAATGTGCA GAAAATATTT AATATCAAAA GAAAATTGAT 
AAGTAATTTC CTGAGCTAAA TGCTTCATTG AAAGCTTCAA AGTTTATATG 
AGTGCTTAGA AGTAAGCAAT TCCCAGGTCA TAGCTCAAGA ATTGTTAGCA 
TTCTGTAAGC CTATATATAT AGTCAAATCG ATTTAGTAAG TATGTTTTTT 
AATCAGTGAT AATTGGTTTG ACTGTACCAT GGTTTGATAT GTAGTTGGCA 
ATATATTAAA ACAATAATGC AATTAGAATT TGGGAGAAGC AAATATAGGT 
ACACTACACA TTTGAAACAA GCTAACCCTG GGGAGTCTAT GGTCTCTTCA 
AQCTATAATT CTGTTATATG AGGGGCAGTG GACAGTTCCC TATGCCAACT 
ACAGGTACTA GTGACTCATC TACCAGATTC TGCCTATGTA AAATGAATTG 
TTCTGTAATC TTTTATTTAA GTAGTGGGCA TTTCATAGCT TCACAATGTT 
ATATTACAAC ATTTATGTGA GGTAATTATT GCTCAACAGA CAATTAGAAA 
CTTGAAGCCT AAATTTGTGC TTTTTAAGAA TATTTTTAGA CTATTTCTTT 
TTTGCTGAAT TCTAACATTA AATCACAGCC CAAAATTTGA TGGACTAATT 
AATATATGAA GACAATAATT CTACATGTTG TCTTAAGATG GAAATACAGT 
TTTATTCAAG GAAGTTTTAA CTTTAATACA GCTCAGTAAA 



60 
120 
LBO 
240 



51 
1 

CCTTCTAGGC 
CCGTGCACTC 
GCGCGGGGAG 
TGGGTTTCCA 
GAGAGTCCAG 
AAGTGTCAAT 
TCTATC1GGT 
GATGTGAACA 
CCGTGATTCT 
GATGGTACAG 
GGGATCCAGA 
TGTGCCTGGT 
AGGTCAATAA 
GAGAGACTAT 

atttttatac 

GCTGGTGCAC 
AATGACAGAT 
ATGTTCCTCA 
CCATGGTATG 
CCTGTGTTAA 
CTCAGGTCTC 
CACGACTCCT 
AAAAAGAATT 
CCTTTTTTGT 
AAAGTCCACA 
TTATAGGGGG 
ATTATTTTAA 
TATTTCATCT 
TAGAATGTAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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5 
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AGTTATGTAT 
TGGTGTATTT 
CACACCTTCC 
TTTTTQCJAAG 
AGGAGTATTT 
GGAGGCTCCT 
TCAGGGAjGQA 
ATAATAAGCC 
CATGCATTAT 
TTACTATTAG 
TATATATATT 
GTCATTTGGT 
AATT AG AT CT 
AGAACACATT 
GATTCAACCA 
ATTATAACAA 
CAGGTTGGAG 
GATCCTCCIG 
CTAATTTTTG 
CTCCTAACCT 
CCACCGCACG 
AGGAGGATGT 

CAAGGTACAA 
TGATCTTGGC 
ATTTCATTTA 
GCATGTCAAT 
AAGCTAATCC 
CACATTAC.TT 
GAQTAAACAT 
TTAGTTCACA 
AAQQGOCTAT 
AATTTATGGT 
ATATTAAGTT 
TAACCTTAAA 
T QAATATTTT 
TGGTATGACT 
ACCACCAGGT 
GTTAAATTCA 
OCAATATCCA 
TGACCGTTAA 
TATTTCTGGT 
TCAAATTTGA 
TTGCACTTTT 
AACATAAAQT 



GGGAATTtSAA. 
CACTGAC 



TTAAAGTTGT 
TTCCAAATGA 
TGTGGAGGCT 
AGACAAAGAT 
TTGGGCTATT 
TCATOSAATG 
TCTGTTCTTC 
ATGGCTGACC 
ATACCCTGGT 
GAAAATAAAC 
CATATACAAA 
GCTCTGCTAA 
AAACAGTTAT 
TTAAATACTC 
ACCAAGGAOG 
CTATTTACTT 
TGCAGTGGCA 
CCTCAGCCTC 
TATTTTTAGT 
TGAGATCCAC 
TAGCATTTAC 
GAATAGGTTA 
ATTTTAGTAT 
CTGTATTTAT 
TGTTTCAAAA 
CAAAAGATTT 
TCATAAAAAC 
CTAAAATAGT 
TTTATGGGTG 
AAAAACTGAT 
CTTAGTCTAA 
GCATATTCAA 
CTTCAACCAA 
CTATAACTAG 
ATTGATATTA 
TTGCTCTAAT 
ACGTCTTAAG 
ATCAAATCAA 
GTAATGCAGT 
TTCTGTAGAC 
AAAATTCTTG 
GTATATTTTA 
TQATGTTAAA 
TAAGTTTGAA 
TGTATTGCAT 
GTATCAATTT 
GAGAAAATAG 
AAAAATTGAA 



ATCTTGACAC 
AAAATCTCAA 
GAGATGAAAA 
TTCTTCTGCA 
GCATAAGGAG 
CTAAACCTTT 
CTCTACGTTT 
TCTGGAGCAC 
GCAATCACAC 
AGACAAAAAC 
CATGTATGTA 
CCATTTATAT 



TACTTACAGT 
GAAAATGCTT 
TTTTTTTTTT 
CGATCTCGGC 
CTGAGCAGCT 
AGAGGCGGGG 
CCTCCACAGC 
ATTAGGTATT 
TATGCRAGCA 
GTGCAGGGGG 
TAACGCTTAC 
AAATCTATTG 
ATTGTAAGCC 
AAGTCATTTT 
TAGATCTTTG 
AAAATAAGAC 
TTGCTCTCAG 
CTCCCCCATT 
TGTATTGAGA 
GGCCACATAA 
TACTCAAGGT 
TATTAAACAT 



TCAGATTTTT 
CTTTTATGCA 
TTTTAAAAAC 
TTXTGAAAAA 
TTCATATGQG 
ATGTTTTTAA 
TGTAATATAA 
GAGCCATTTT 
GCAATTTAAA 

ATAATTTACA 
ATAAATAAAA 



AGGAAATGGG 
TTGAAAGCTT 
CTAGGGCTCA 
CTCTGAGCCC 
CCACTGCTGC 
GAGTAGAGTC 
ATCCTG G CAT 
CAGGTGCCAG 
GACTGTCATC 
AAQTAAATAT 
TACATGACCT 
AAAACTTAAA 
TATTTAATAT 
TGGCCCTCTG 
AAAAAATAAT 
CTTTTTQAGA 
TCACTGCAAC 
GGOACTACAG 
TTTCACCATG 
CTCCCAAACT 
ACAAGTAATG 
CTATGCCCTT 
GCGATCTGGG 
XA&ATGTGAG 
ACTTTTCAAT 
TCTCAATCTT 
TGTATTTTTC 
TAAATGCATA 
AAAAATAATA 
CTGATGTGTC 
TTACAGATTT 
ACCAAAGCAA 
TAACCCAGTT 
TTAAGCTTAA 
ACATAATACA 
AAAGTCAAAG 
ATTTATGAGT 
GCAAATATAT 
CTQTATCTGA 
AAAGTTTTTA 
AGAAGGGCGA 
AAAGAGTAAT 
TGTATTTTCT 
GGTAAACGGT 
GTAACTTATT 
AATATCAGCT 
AGATATTATT 
ATGCRTTBAA 



AAAAAACTTA 
TTAAAATGTA 
TTXTCCTGAC 
ATAGGTCTCA 
CAOCACTTTT 
TCCCTGGATC 
GTGCTAGGGT 
GACTTGTCTC 
TAAAGTCCTG 
ATATGGTCCT 
TAAT9GATCA 
AACAAGAGAA 
AGCTGAAGTC 
TGGTTAGTTC 
ACAACAACAA 
TGGAGTCTCG 
CTCACCTCCC 
GCGCATGCCA 
TTGGCCAGGA 
GCIGGGATTA 
TAAAGATGAT 
TTATATAAGT 
AATCAGTCOC 
GAGAGTCTGA 
AAATCAQCTQ 
GGTTTTTCAG 
ATCTTTAAGA 
TTAAATAATA 
GTTTTAGTGA 
CTGTACACAG 
CTCACTATAT 
CCACAAATGC 
AACTTACTOT 
AATTAAGATT 
ATGTAACTCC 
TGATGGGAGA 
CTTTGAGACT 
GATTCTAGTG 
CCCACTTTGT 
ATTTGATGCC 
GTAATGACTT 
TTCATTTAAA 
TTTTATTTTG 
TTTTATTAAA 
TGACTATGAA 
TTGATAATTG 
AATTTTTATT 
CATCTTGCAT 



AAAATTAATA 
GAAACTTAAA 
ATTTGTXTAT 
GAGAGTTAAT 
GGATTTIATG 
ACATACCAGG 
AAACGAAGGC 
CATGTGTATC 
GCCCTGGCCC 
ATACATATTG 
TAGAATTGCA 
AAGAAAAATC 
AAAATATGTA 
CACATCTGTG 
CAAAAAATAC 
CTCTGTTGCC 
GGGTTCAAGA 
CCATGCCCAG 
TGGTCTCAAT 
CAGaCGTGAG 
TTAAGTATAC 
GACTTGAACA 
CTGTGGATAC 
ATATTTTCAG 
CAATCCATTT 
TTGATCTTAA 
ATGCTTAAAA 
AAGTATGACC 
GGATGGTGCT 
TGGGAAGATT 
ATATTTCTAG 
ATAAATGCAT 
TTAACCAGGA 
TCCTTAACCT 
ACTGTTCTCC 
ATTGGTATAC 
AAATTCAATC 
TCTGACTTTT 
AATTTTTGCT 
CAATATATTC 
QTACAAACAG 
TATCTGTTAT 
CACTCTGTAA 
GATGCTATGG 
TATTATCGGA 
TGTACCTTAA 
TATTTTTCTT 
TCAAAATCTT 



Scg ZD WO; 154 Protein secra-ence 
Protein Accession #= NP_001423-1 

1 11 21 31 41 51 

1 I I I I | 

MTAGRRHEMX* CAGRVPALLL CLGHHLLQAV I*STTVIPSCJ PGESSTOJCTA IrVQTEDNPRV 
AQVSITKCBS DMNGYCLHGQ C I Y"LVDMS QN YCRCEUOYTG VRCEHFFLTV EQPLSKBXVA 
LTVlLIILFfc ITWGSTYYF CRWYKBEXSK EPKKKYERVT SGdEHLPQV 

Seq ID NO* 155 DMA sequence 

Nucleic Ada Accession #s NW_013282.2 

Coding sequence: 85.. 2466 



l 
I 

CGACTCCTTA 



ACCCACACGG 
CAGGAGCTGT 
GAGGACGGCC 

gtccqccaga 

ACCGACTCCG 
GGGGOGGCCX3 
GGGCTGTACA 
GAGGCGCAGG 
AOGTCCAGGC 
GAGAACGGCQ 
AAGTGGCAGG 
AAGGAGCGGG 
CGGGAACTCT 
TTCGTGGACG 
CCCATGAGAC 
TGOOGGGTCT 
TGCGATGAGT 



11 
I 

GAGCATGGCA 
TCAGOGCOGA 

tggactcgct 
tgcacgtgga 
ataccctctt 

GCCTCXJTGCT 
GcraCTGCCT 
AGACTGACAG 
AGGTCAATGA 
TGGTGAGQGT 
asGCGCTGGA 
TGGTCCAGAT 
ACCTBGAGGT 
GCTTCrGGTA 
ACGCCAACGT 
AAGTCTTCAA 
GGAAGAGCGG 



GOGACATGGC 



21 
I 

TGGCTCAGAG 
CACCATGTGG 
GTCCAGGCTG 
GCCAGGC2CTG 
CGACTAGGAG 
OOCOCACAGC 
GGGC3CAGAGT 
CAGGCCAGCC 
GTACGTCGAT 
GAGGCGGAAG 
GGAGGACGTC 
GAACTCCAGG 
GGGCCAGGTG 
CGACGCGGAG 



GCCGTCCTGC 
CCTGTGCGGG 
CTTOCaCATG 



31 
I 

gtgctggtaa 
atccaggttc 
accaaggtgg 

CAGAGGCTGT 
GTCCGCCTGA 
ACCAAGGAGC 
GAGTCAGACA 

gatgaggaca 

GCTCGGGACA 
GCCCCCICCC 
ATTTACCAOG 
GAOGTCCGAG 
GTCATGCTCA 
ATCTCCAGGA 
GATGATTCTC 
CCQOGTGAAG 
AAGCACTGCA 
GGCCZOGCAGG 
TACTGCCTGG 



41 
I 

AACTGATGGG 
GGACCATGGA 
AGGAGCTGAG 
TCTACAGGGG 
ATGACACCAT 
GGGACrCCGA 
AGTCCTCCAC 
TGTGGGATGA 
CGAACATGGG 
GGGAOGAGCC 
TGAAATACGA 

ACTACAACCC 
AGCQOC3AGAC 
TOAAOGACTG 
GGAGCCCCAT 
AGGACGAOGT 
ACCCCOACAA 
AOCGGCCCCT 



51 

1 

GGTTTTTGCT 



GCGGAAGATC 
CAAACAGATG 
CCAGCTCCTG 
GCTCTCCGAC 
CCAC3GGCGAG 
GACGGAATTG 
GGCGTGGTTT 
CTGCAGCrCC 
CGACTACCCG 
CACCATCATC 
CGACAACCCC 
CAGGACGGCG 
TCGGATCATC 
GGTTGACAAC 
GAACAGACTC 
GCAGCTCATG 



iaoo 
1060 
1920 
1980 
2040 
2100 
2160 
2220 
2230 
2340 
2400 
2460 
2520 
2580 
2G40 
2700 
2760 
2820 
2BB0 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3 900 
3 960 
4020 
4080 
4140 
4200 
4260 
4320 
43B0 
4440 
4500 
4560 
4620 



60 
120 



60 
120 
180 
240 
300 
360 
420 
4 BO 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



1013 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CCCAGCGAGG ACGAGTGGTA CTGCCCTGAG TGCCGGAATG ATGCCAGCGA GGTGGTACTG 
GCGGGAGAGC GGCTGAGAGA GAGCAAGAAG AAGGCGAAGA TGGCCTOGGC CACATCGTCC 
TCACAGCGGG ACTGGGGCAA GGGCATGGCC TGTGTGGGCC GCACCAAGGA ATGTACCATC 
GTCCCGTCCA ACCACTACGG ACCCATCCCG GGGATCCCCG TGGGCACCAT GTGGCGGTTC 
CGAGTCCAGG TCAGCGAGTC GGGTGTCCAT OGGCCCCACG TGGCTGGCAT ACAGGGCCGG 
AGCAACGACG GAGCGTACTC CCTAGTCCTG GCGGGGGGCT ATGAGGATGA CGTGGACCAT 
GGGAATTTTT TCACATACAC GGGTAGTGGT GGTCGAGATC TTTCCGGCAA CAAGAGGACC 
GCGGAACAGT CTTGTGATCA GAAACTCACC AACACCAACA GGGCGCTGGC TCTCAACTGC 
TTTGCTCCCA TCAATGACCA AGAAGGGGCC GAGGCCAAGG ACTGOCGGTC GGGGAAGCCG 
GTCAGGGTGG TGCGCAATGT CAAGGGTGGC AAGAATAGCA AGTACGCCCC CGCTGAGGGC 
AACCGCTACG ATGGCATGTA CAAGGTTGTG AAATACTGGC CCGAGAAGGG GAAGTCCGGG 
TTTCTCGTGr GGCGCTACCT TCTGCGGAGG GACGATGATG AGCCTGGCCC TTGGAGGAAG 
GAGGGGAAGG ACCGGATCAA GAAGCTGGGG CTGACCATGC AGTATCCAGA AGGCTACCTG 
GAAGCCCTGG CCAAGCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA GGAGGAGCAG 
CAGSAGGGGG GCTTCGCGTC CCCCAGGAOG GGCAAGGGCA AGTGGAAGOG GAAGTOGGCA 
GGAGGTGGCC CGAGCAGGGC CGGGTCCCOG CGCCGGACAT CCAAGAAAAC CAAGGTGGAG 
CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 
CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG CCCGTTCCAG 
TTGTTOCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 
CGGCCCATCA CGACCGTQTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA CAGATOCTTT 
CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG 
CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCX3GGCTA CGGCAATGGC 
OGGTGATCTC CAAGCACTTC TCGACAGGOG TTTTGCTGAA AACGTGTCGG AGGGCTCGTT 
CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT TCCTCCGTTC 
CCTAAAAAGO TTTGTCTTCC TTTTTTTTTA TTTTTAT T TT TCAAATCTAT ACATTTTCAG 
GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 
CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 
ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG TTGCTGCCAC 

CAAcrcrrrA agaaggcgac aggatcagtc cttctctagg gttctggccc ccaaggtcag 

AGCAAGCATC TTCCTGACAG CATTTTGTCA TCTAAAGTCC AGTOACATGa TTCCCCGTGG 
TGGCCCGTGG CAGCCCGTGG catggcgtgg CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 
AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC CCCACCAGAC 
TGCTTAGCGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG GAGTTGGGGC 
CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG GOCAGGOGTO TGACTGACGC 
TGTCCGACGA AGGCGGCCAC GGACGGAGGC CAGCAGACGA AGTCACGTGC AAGTGGCTTT 
GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAAAT 
GTCAACCAGA TTCIAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 
GGAAOCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 
CTTACAAGAG GGTTTTTTTT TAATTTTTTT ITCTCTTAA!! GAACACATTT TCTAAATGAA 
TTITTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA AOGTTAGGGT TTGGTTaTTT 
TTGTTTTTGT A1TTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 
TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 
GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 
AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 
TTAGATTCTC AGAATAAATG TTTTTCACAG ATTQAAAAAA AAAAAAAA 

Seq ID HO: 156 Prptein, sequence 
Protein Accession #? HP_037414.2 



i 11 
I 1 

MWlQVRTMDG RQTETVDSIiS 
YEVRLNDTIQ LLVRQ SLVliP 
PAKKDMWDET ELGLYKVNEY 
OVIYEVKYDD YPENGWQKK 



SCK3CKDDVW RLCRVCAGHL 



IPGXPVGTMW RFRVQVSESG 
SGGRDIiSGUK. RTAEQSCDQK 
GGKN3KYAPA EGNRYDGIYK 
LGLTKQYPEG YLKALANRSR 

sprhtskktk vepysltaqq 
TPQCICCQEEf vfrpittvoq 



21 

I 

KUTKVEBIjRR 
HSTKBRDSEL 
VDARDTHMGA 
SRDVRARART 
IiGDDSLHDCR 
CGGRQDPDKQ 
KKKAKKASAT 
VHRPKVAGIH 
LTNTNRAIiAIi 
WKYWPEKGK 



Seq ID NO: 157 DMA sequence 

Nucleic Acid Accession #: NM_0007Sfi- 

Coding sequence: 186. .776 



1 
I 

AGAAACTCAG 
GAGAQTGGGA 
TCTCTGCAGA 
CTAACATGCG 
GCCXjGCCATG 
AGCACCCTCA 
OOCAGGCTCG 
TCAACAAGAG 
GCAGCCGCCC 
TGCTGCTGCC 
ATGCCCTCGG 
TCTCCCTGOA 
AGTTAGCACA 



11 
I 

AGACCAAGTC 
ACAGTAAAQA 
GAGGOGGCAG 
GCTGCCGCTG 
CAGGGCGCTC 
GOCCTTGGAT 
GCCGGTCCTG 
CCCGGCCGCT 
TTCGCCGGAA 
TCGGOGCTCG 
CGGCCACCAG 
TCTCACCTTC 
GCAAGCTCAC 



21 

i 

CATTGAGAGA 
GAAAGGAAGA 
CACCC30GCTC 
CTTGTGTCCG 
CTGAGCCGCG 
TTCTTCCAGC 
CTCCGCATGG 
CCCCTTTCGC 
CAGGCGACCG 
CTGGACAGCC 
GAGGCACCGG 
CACCTCCTCC 
AGCAACAGGA 



31 
I 

CTGAGGGGAA 
CAACCTCCAG 
ACCTGOGAAG 
CGGGAGTCCT 
GGCCGGTCCC 
CGCGGCCGCA 
GAGAGGAGTA 
CCGCCTCCTC 
CCAACTTTTT 
CCGCGGCTCT 



41 
I 

AGAGAGGAGA 
AGAAAGCCCC 
CGCCTGGGAA 

GGGAGCTCGG 
GTCCOAGCAG 
CTTCCTCCGC 
GCTCXTTCGCC 
CCGCGTGTTG 



1200 
1260 
1320 
13B0 
1440 
1500 
1560 
1620 
1680 
1740 
1B00 
I860 
1920 
1900 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
25 B0 
2640 
2700 
2760 
2820 
2B80 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
34B0 
3540 
3600 
3660 
3720 
37BD 



31 41 51 

i I I 

KIQEIiFHVEP GXiQRIiFYRGK QHEDGRTIiFD 
etmjSGCCLG OSESDKSSTH GEAAAETDSR 
H7BAQWRVT RKAP3RDEPC SSTSRPALEE 
IIKWQDLEVG QWMLNYNPD NPKERGPWYD 

LMCDECDMAF KIYCLDPPLS SVPSKDEWYC 
SSSQKUWGKG MACVtSlTKEC TXVPfiNHYGP 
GRSMDGAYSL VLAGGYBDDV DEGNFPTYTG 
NCFAPJNDQE GAEAKDWRSG KPVRWHNVK 
SGFLVWRYU* EKDDDEPGPW TKBGKDRIKK 
EQQBGGFASP KTGKGKWKRK SAGGGPBRAG 
AKLWNEVIA3 LKDHPASGSP PQI.FL3KVEB 
HHVCKDCLDR SFRAQVFSCP ACRYDLGRSY AMQVNQPLQT 



GGGAAGTCTT 
AACTCATGGA 



GCGGTCCGAG 
GGAAATGGCC 
GATEATTGGG 



60 
120 
180 
240 
300 
360 
420 
4 BO 
540 
600 
660 
720 



51 
I 

GAAAGAAAAA 
CGGAGACGTC 
GCGAGTGCCC 
CTCCTGCCCT 
CAGGCGCCGC 
CCCCAGCAGC 
CTGGGGAACC 
GGAGGCAGCG 
CTGCAGCAGC 



GAGCCTCCCA 
AGGGCCGAGC 
AAATAAAACG 



60 
120 
1B0 
240 
300 
360 
420 
400 
540 
600 
660 
720 
780 



1014 



WO 03/042661 



PCT/US02/36810 



10 
15 

20 
25 
30 
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GTGCGTTTGG 
TCTGTACCAT 
GGGAGAGAGG 
GCAGCAACAC 
TGGTGTTTTA 
AGTCACTCAA 
TAAGACTAAA 
ACATCTTTCC 
TAATAAAAGT 



CCAAAAAGAA 
AGOGCTGCTC 
GAGAGAGCCT 
AATGTTATTC 
AAGAGAATGT 
TTGTTTTTGT 
TCTGTAAGCT 
AGTCCTTGGT 
1QCAAAGGT 



TCTGCATTTA 
TTATGCCATT 
ATACCCCTTA 
GTTTTGTCTA 
AGACCTGTGA 
TGTGGTCTGA 
CTTTGAAACA 
CAGTTTGGTT 



GCACAAAAAA 
TGTTTATTTT 
CTTAGCATGC 
CGTTTAGTTT 
GAAAACGTTT 
GCCAAAGAGA 
ACTTTCTCTT 

GTGTAAGAGA 



AATTTAAAAA 
TATATAGCTT 
ACAAAGTGTA 
COGTTTCCAG 
TGTTTGAAAA 
ATGCCATTCT 
GTAAACGTTT 
ATGTTGAATA 



AATACAGTAT 
OAAACATAGA 
TTCACGTGCA 
QTGTTTATAG 
AGCAGACAGA 
CTTGGGTGGG 
CAGTAATAAA 
CTTATATTTT 



fieq n> NO: 15B Protein sequence 
Protein Accession #: NP_00D747.l 

1 11 21 31 41 SI 

I I ? ] | | 

MRIiPLtiVSAG VLLVAIiIiPCP PCRALLSRGP VPGARQAPQH PQPLDFFQPP PQSEQFQQPQ. 
ARPVLTiRMGE EYFLRLGWLN KSPAAPLSPA SSLLAGGSQS JRPSPBQATAH FFRVLLQQIiE* 
X.PKRSLDSPA ALAEHGARNA LGGHQEAPER ERHfiEEPPIS LDLTFHIiljRB VLEMARAEOIi 
AQQAHSNRKL MEIIGK 

seq[ ID WD: 159 DNA sequence 

Nucleic Acid Accession #; NM_C 01200.1 

Coding sequences 32S..1514 



1 
I 

GGGGACTTCT 
TGCCCCAGCG 
TGCOCGACRC 
GAGAAGGAGG 
AGAGTTTTTC 

CTGC3GGTCTC 
TTOOOCAGGT 
TOGOGGCGGC 
TCGAGTTGCG 
COGTGGTGCC 
CCGCCCCAGA 
ACCATGAAGA 
TCTTTAATTT 
TOCQAGAACA 
TTTA1GAAAT 
ACACCAGGTT 
TGATGCGGTG 
TGQAGGAGAA 
ATGAACACAG 
GGCATCCTCT 
AGTCCAQCTG 
GGATTGTGGC 
TGGC1GATCA 
ACTCTAAGAT 
ACCTTGAjCGA 
GTGGGTGTGG 



11 
1 

TGAACTTGCA 
GAGCCTGCTT 
TGAGACGCTG 
AGGCAAAGAA 
CATGTGOACG 
CTAAAGGTCG 
CCTCCTGGGC 
QTCGTCGGGC 
GCTGCTCRGC 
CCCCTACATG 
OCACCGGTTG 
ATCTTTGGAA 
AAGTTCTATC 
GATGCAAGAT 
CATAAAACCT 
aOTGAATCAG 
GACTGCACAG 
ACAAGGTGTC 
CTGGTCRCAG 
CCACAAAAGA 
TAAGAGACAC 
TOOCCCGGGG 
TCTGAACTCC 
TCCTAAGGCA 
GAATGAAAAG 
CTAGTACAGC 



21 
1 

GGOAGAATAA 
CGCCATCTCC 
TTCCCAGOGT 
AAGGAACGGA 
CTCTTTCAAT 
ACCATGGTGG 
GGCGC3GGCTG 
CGCCCCTCAT 
ATGTTCGGCC 
CTAGACCTGT 
GAGAGGGCAG 
GAACTACCAG 
CCCACGGAGG 
GCTTTAGGAA 
GCAACAGCCA 
AATGCAAGCA 
GGACAOGCCA 
TCCAAGAOAC 
ATAAGGCCAT 
GAAAAACGTC 
CCTTTGTACG 
TATCACaCCT 
ACTAATCATG 
TGCTGTGTCC 
GTTOTATTAA 
AAAATTAAAT 



31 
1 

CTTOCGCACC 
GAGCCCCACC 
GAAAAGAGAG 
CATTOGGTCC 
GGAGGTGTCC 
CCGGGACCCG 
GCCTCGTTOC 
CXXAGCCCTC 
TGAAACAGAG 
ATGGCAGGCA 
CCAGCCGAGC 
AAACGAGTGQ 
AGTTTATCAC 
ACAATAGCAD 
ACTCGAAATT 
GGTGGGAAAG 
ACCATGGATT 
ATGTTAGGAT 
TOCTAGTAAC 
AAGCCAAACA 
TGGACTTCAG 
TTTACIGCCA 
CCATTGTTCA 
CGACAGAACT 
AGAACTATCA 
ACATAAATAT 



41 

I 

CCACTTTGCG 
GCCCCTCCAC 
ACTGCBCGGC 
TTGCGCCAGG 
CCGCGTGCTT 
CTGTCTTCTA 
GGAGCTGGGC 
TGAGGAGGTC 
ACCCACCCCC 
CTCAGGXCA3 
CAACACTGTG 
GAAAACAACC 
CTCAGCAGAG 
TTTCCATCAC 
CCCOGTGACC 
TTTTGATGTC 
CGTGGTGGAA 
AAGCAGGTCT 
TTTTGGCCAT 
CAAACAGCGG 
TQACGTGGGG 
CGGAGAATGC 
GAOGTTGGTC 
CAGTGCTATC 
GGACATGGTT 
ATATATA 



SI 
I 

OCGGTGCCTT 
TCCTCGGCCT 
CGGCACCCGG 
TCCTTTGAOC 
CTTAGAOGGA 
GCGTTGCTGC 
CGCAGGAAGt 
CTGAGCGAGT 
AGCAGGGACG 
COGGGCTCAC 
CGCAGCTTCC 
CGGAGATTCT 
CTTCAGGTTT 
CGAATTAATA 
AQACTTTTGG 
ACCCCOGCTG 
GTGGOOCACT 
TTGCACCAAG 
GATGGAAAAG 
AAACaCCTTA 
TGGAATGACT 
CCTTTrCCTC 
AACTCTGTTA 
TCGATGCTGT 
GTGGAGGGTT 



Seq ID MO: 160 Protein sequence 
Protein Accession ff: NP_001l9i_i 

1 H 21 31 41 SI 

I ! I ) | ] 

MVAGTfiOLR LLLPQVLLGG AAGLVPHLGR HKPAAASSGR PSSQPfiDEVI. SEFEUlIdbSM 
FGL1CQRPTPS fiDAWPPTfM, DLYRKHSGQP GSPAPDHRLB RAAfiEANTVR 6FHHEESUEE 
LPBTSGKTTR RFFFNLSSIP TEEFITSAEL QVFJUSQKQDA IiGNHSSFHHR IMTYEI2KPA 
TAKGKFFVTR LLDT 

seq ZD no s 1G1 DHA sequence 

Nucleic Acid Accession #: nh 001399.2 

Coding sequence: 1- .8736 



1 

I 

ATGGGGAGAA 
CTC1GGOCGC 
CAGGCGCCGC 
CCCGAGTATC 
GACGTGCTCC 
TGGAAGACaC 
GATGGATTTT 
TGTGGATCAA 
QATGACCZACT 
GAAAATC3GAT 
TTCACTGGTC 
AACCAGATGT 
ACCACTGGAC 
C3EACGQGGTT 
GCTATCCCAG 



11 
I 

GACGGAGGCT 
AGGGCAOGGC 
CGCAACAGGT 
GCGAGGAGGG 
GAGGGCCCAA 
TCCCTGGAGG 
GTTCCOGTCC 
AATCAATTCA 
GCCAGTGDCA 



21 
I 

GTGTCTCSCAG 
CGGOCAGGCT 
TCGGTCCGCT 
TGCCGCAGTG 



CACAGTGTGA 
GCCAAGGGCA 
GGGCGTGGGG 
TCATCCCCAA 
GGATATGCCA 



AAACCAGTGC 
TAACATGTGT 
GCAGTGCAOT 
GAAAGGATAT 
TGGAOGTXGC 
AAGAGATTAC 
GCTGACAGGC 
CCATCCCTOT 

catccgcact 

AGGAGGAAAC 



31 
I 

CTCTACTTCC 
CAGCCTCCTC 
ACAGCAGGCT 
GCCAGCCGCG 
TC3CRGATTOC 
ATTGTCCCGA 
ACTTGTTCCA 
GTGAGATGCA 
ATTGGAACXT 
ATOQCCC3U\C 

ATTGTCXGCA 
GAGATGTGTC 
GGAGCTTGCC 
TGTATCAATA 



41 
I 

TGTGGCTGGQ 
CGCCCAAGCC 
CTOAAGGCGG 
TCCGCCGGCG 
ACTCCTACTG 
TTTGTAGAAA 
GTGGGCAAAT 
TGAATGGTGG 
ATTGTGGACA 
CGTGTGCTTG 
OGTGTTTCAC 
COAAGACTCT 
CAGCCCAGCG 
AAGATGTTGA 
CAGTGGGCIC 



51 
I 

CTGTGTGGTG 
GCCC CGGCCC 
GTTTCTAGOG 
AGGACAGCAG 
CTGOCCTGGA 
TAGTTOTOGA 
ATCATCAACC 
GACCIGTGCA 
ACCTGTCTGT 
TGTTTATGGG 
TCAGGTCAAC 
GTGCTGTGCC 
TCAGCCCTGC 
TGAATGCCAG 
TTTTGAATGC 



840 
900 
9G0 
1020 
logo 

1140 
1200 
126Q 



BO 
120 
1Q0 



€0 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



60 
120 
180 



60 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



1015 



1 



WO 03/042661 



AGATGCCCTO CTGGTCACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA CATTGATGAG 960 
TGCAQCATCA TTCCTGGGAT ATGTGAAACT GGTGAATGTT CCAACACOGT GGGAAGCTAT 1020 
TTTTGTGTTT GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTOGATG CATCGATCAG 10B0 
AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGCOGCT GTGCACAAGA GCTCCCGGGG 114 0 
AGAATGACQA AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 120 0 
CCTGAAGCCT GTCCTGTCAG AGGTTCXGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 
CCAATGGGAG GAATTCCAGG GAGTGCTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 
TTTGCCCCAA GTGGCAATGG CAATGGCTAT GGCCCAGGAG GGACAGGCTT CATCCCCATC 13 B0 
CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG 1440 
GGACCTATCA TCACTGGACT AACAATTCTG AACCAGACAA TAGATATCTG TAAGCATCAT 1500 
GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG ATGTGAATGC 1560 
AACATGGGIT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA 1620 
AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TAAATGTCAT 1680 
GCTGGATTCC AGAGGACTCC TACCAAGCAA GCATGCATTG AT AT TGATGA GTGCATCCAG 1740 
AATGGGGTTC TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 
TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT I860 
ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGQCAG CTTCAAGTGC 1920 
ATCTGCAAAC CAGGATTTGT CTTGGCTCCA AATGGQCGTT ACTGTACTGA TGTTGATGAA 1980 
TGOCAGACCC CAGGAATCTG CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC 2040 
TQTQACTGTC OCCCAGGCCT GGCTGTGGGC ATQGATGGAC GTGTGTGTGT TGATACTCAC 2100 
ATGCGCAGTA CCTSCTATOG AQGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 
GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 2220 
CAGCCATGCC CTGCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 
ATCACXGTGG ATGGAAGAGA TATCAATQAA TGTGCTTTGG ATCCTGATAT ATQTGCCAAT 2340 
GOGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 
GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAQ ACTGCTTTGT 2460 
GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 2520 
GTG1TCAGGA CTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 
GTCAATGGGG CCTGCAGAAA CAACCTTGGA TCTTTCAATT GTGAATG-TTC GCCCGGCAGC 2640 
AAACTCAGCT CCACAGOATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 
ATCCAGGACA GCCGCTGTGA GGTGAATATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 
GOCACCCTCG GAGCOGCCTG GGGGAGCCOC TGTGAGOGGT GTGAACTAGA TACAGCTTGC 2820 
GCAAGAGGGC TTGCCAGGAT TAAAGGTGTI ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2880 
TTCCCTGGCG TTTGTCCAAA TGGACGCTGT GTCAACAGTA AGGGATOTTT TGATTGCGAG 2940 
TGCCCTGAAG GCCTTACGTT GGATGQGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3000 
CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG AAAGTTCCGC 3060 
ATGGATGOCT GCTGCTGTGC TGTCGGGGCG DCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 
AAACGTGGCA CCAAGGAATA CGAGACACTG TGCCCCCGCG GGK3CTGGCTT TGCTAACCGA 3180 
GQGGATGTTC TTACTGGGCG GCCATTTTAC RAAGACATCA ATGAATGCAA AGCATTTCCT 3240 
GGGATGTGCA CTTATGGGAA GTGCAGAAAT ACAATCGGAA GC7TTCAAATG CCGTTGCAAT 3300 
AQTGK3CTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA GTGCAnOATT 3360 
TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC OGGGCAGCTT TGAGTGCGAG 3420 
TGCTTCGAAG GCTATGAAAG TGGCTTCATG ATQATGAAGA ACTGCATGGA CATTGACGGA 3480 
TGTGAACGTA ACCCTCTOCT TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 
CAGTGTBACT GCCCACTGGG ACACGAGCTG TCACCATCCC DTGAGGACTG TGTGGATATT 3600 
AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA CATGATTGGA 3660 
ACCTATCAGT GCTCITGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 
GATATTGATG AATGTATGAT AATGAACGGA GGCTGTCACA CCCAGTQCAC AAATTCAGAG 3780 
GGAAGCTACG AATGCAGCTG CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCOTGT 3840 
GCAGACATTG ATGAATGTQA AAACAATCCT GATATCTGTG ATGGCOGCCA GTGTACCAAC 3900 
ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGQCTTCCAT GGACATGAAA 3960 
ACATGCATTG ATGTCAATGA ATGtGACCTA AATTCAAATA TCTQCATGTT TGGGGAATGT 4020 
GAGAAGACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 
ACCACAGOAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 
GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCAGAGAAGG CTGGATTGGA 4200 
AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAAOCCACCA GTGTAGCATC 4260 
AATGCXCAGT GTGTAAATAC CCCGGGCICA TACCGCTGTG CCTGCTCOGA AGGTTTCACT 4320 
GGTQATGGCT TTACCTGCTC AGATGTTGAT GAGT3TGC3H3 AAAACATAAA CCTCTGTGAG 4380 
AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGGG AGTGTOAGAT GGGCTTCACT 4440 
CCAOCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4500 
GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 
GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4620 
TGTGTCAATG GCCTAT6TGT CAACACGCCT GGTCGCTATG AGTGTAACTG CCCACCCGAT 4680 
TTTCAGTTGA ACCCAACTGG TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG 4740 
AAGTTTGGAC CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 4B0O 
AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGG GAAAOCCCTG TOAGACATGC 4860 
qCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCEDG GAGGTGAAGG CTTCAGACCT 4920 
AACCCCATCA CAATCATTTT AGAAGACATT GACGAATGCC AGGAGTTACC AGGTCTCTGC 4980 
CAGGGTGGAA ACTGCA.TCAA CAGTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 
TACCTCAGCG AGGATAC CCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGOT S100 
GTGTGTGGGC CTGGGACCTO CTATAACACC CTGGGAAATT ACACCTGCAT TTGCCCACCT 5160 
GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA TGAGAAAAAG CTTTTGCTAC 5220 
CX3AAGCTATA ATGGAACCAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 5280 
TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACAAAC CTTGTOAACC ATGCCCAACT 5340 
CCAGGAACavG CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC CTTTGACATT 5400 
CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT TTGTGCAAAT 5460 
GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT GCCCTACAGG ATT CAGTTAC 552 0 
AATGACCTGC TGTTGGTTTG TGAAGATATA GATGAGTGCA GCAATOGTGA TAATCTCTGC 5580 
CAGCGGAATG CAGACTGCAT CAATAjQTCCT GGTAGTTACC GCTGTGAATG TGCCGCGGGT 5640 
TTCAAACTTT CACOCAATGG GGCCTGTGTA GATCGCAATG AATGTTTAGA AATTOCTAAC S700 
GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT CTGCCACAAT 5760 
GGCTTTAAGG CTTCTCAGGA CCAOACCATG TGCATGGATG TTGATaAGTG OGAGOGGCAC 5820 
CCATGTGGAA ATGGAACTTG TAAAAACACC GTTGGATCCT ATAACTGTCT GTGCTACCCA 5880 
GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAC5 ATGAGTGCAG TTCCXTTTTT 5940 
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GGTCAGGTGT GCAGAAATGG ACGTTGTTTT AATGAAATTG GTTCTXTCAA GTGTCTATGT SOOO 

AACORAGGTT ATGAACTTAC CCChGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC €060 

GCCCITCCCB GCTCTTGCTC TCCTGGTACC TGTCAGAAIT TGGAGGGATC CTTCAGATGC 6120 

ATCTGTCCCC CAGGGTATGA AGTAAAAAGC GAGAACTGCA TTGATATAAA TGAATGTGAT 6180 

GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA CTCCAGGGGG CTTCCAGTGC 6240 

CTCTGCCCCC CTGGCTTTGT ACTATCTGAT AATGGACGGA GATGCTTTGA TACTCGCCAG 6300 

AGCTTCTGCT TCACAAATTT TGAAAATGGA AAGTGTTCTG TACCCAAAGC TTTCAACACC 6360 

ACAAAAGCAA AATGCTGCTG TAGTAAGATG CCAQGAGAGG GCTGGGGGGA CCCCTGTGAG 6420 

CTGTGCCCCA AAGACGATGA AGTTGCATTT CAGGATTTQT GTCCATATGG CCATGGAACT 6480 

GTCCCTAGTC T T CAT GATAC ACGTGAAGAT GTCAATGAGT GTCTTGAGAG CCCAGGCATT 6540 

TGTTCAAATG GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TOCAATGGGC 6600 

TACAACCTTG ACTACACTGG AGTACGCTGT GTGGATACTG ATGAGTGTTC AATCGGCAAT 6660 

CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT TTOAATGCAA TTGCAATGAA 6720 

GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAGATATCA ACGAATGTGC CCAGAACCCA 6780 

CTGCTGTGTG CTTTACGCTG CATGAACACT TTTOGOTCCT ATGAATGCAC GTGCCCGATT 6840 

GGCTATGCCC TCAGGGAAGA TCAAAAGATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6900 

TTACACX3ACT GTGAATCTAG GGGCATGAT3 TGTAAGAATC TAATGGGCAC CITCATGIGC 6960 

ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 7020 

TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 

AGATGTQAGT GTAATQAAGG ATTCCAGTCA AGTTCTXCAG GCACTGAATG CCTTGACAAT 7140 

CGACAGGGTC TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 7200 

OGCAATCTCG TCACTAAGTC AGAAXGCTGC TGTGATGGTG GGCGAGGCTG GGGCCACCAQ 72 60 

TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 7320 

GGATATACAA CTGATGGAAG AGATATTGAT GAATGTAAGG TAATGCCAAA CCTCTGCACC 7380 

AATGGTCAGT GCATCAATAC CATGGGCTCA TTCCGATGCT TCTGCAAGGT TGGCTACACC 7440 

ACAGACATCA GTGGAACCTC TTGTATAGAC CTTGATQAAT GCTCCCAGTC CCCGAAACCA 7500 

TGCAACTACA TCTGCAAOAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7560 

GTCCTGGAAG AGGATGGAAA GACATQCAAA GACCTTGATQ AATGTCAAAC AAAGCAGCAT 7620 

AACTQCCAGT TOCTCTGTGT CAACAOCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 76B0 

TTCACACAGC ATCACACTGC TTGTATCGAC AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 

TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 

GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 

CACRGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT ACAGATGTGG CTGCCCCCAA 7920 

GGCTACATCC AGCACTACCA GTGGAATCAG TGTOTOGATG AGAATGAATG CTCCAATCCC 7980 

AATGCCTGTG GCTCTGCTTC CTGCTACAAC ACCCTGGGGA GTTACAAGTG CGCCTGCCCC 8040 

TOGGGGTTCT CCTTOGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 

TCCAAGAACC CCTGCAATTA OGGCTGCTCT AACACGGAGG GGGQCTACCT CTGTGGCTGC 8160 

CCCCCTGGGT ATTACAGAGT GGGACAAGGC CACTGTGTCT CAGGAATGGG ATTTAACAAG 8220 

GGGCAGTACC TGTCACTGGA TACAGAGGTC GATGAGGAAA ATGCXCTGTC CCCAGAAGCA. 8280 

TGCTACGAGT GCAAAATCAA CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT 8340 

CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8400 

CCCGTCAACA TGAAGTTCAA CCTCTCCCAC CTCGOCTCTA AGGAGCACAT CCTGGAACTA 8460 

AGGCCCGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG TCATCTCTCA AGGGAACGAT 8520 

GACAGGGTCT TCOGCATCCA GCAAAGGAAT GGGCTCAGCT ACTTGCACAC GGCCAAGAAG 8580 

AAGCTCATGC CCGGCACATA CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG 6640 

GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG B700 

GCTCTCAGAA TGAGGCTGCA QATTCAGClC 'lATTAACCGT TCACAGACTT GGGOCCAGGC 8760 

TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG 8820 

AGGAAAAATA ATAATAACTC TTGTTTCTTT CCTCCCTGTC TTAGACTTTG AATGTTGACC 8880 

CTCACAGGGA GGGATAATTT AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC 8940 

GTGGTTACTG TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 

TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT TTTTTTTTTT 9060 

CCXGTf AG CA GTCTGTAACA CTTTGGGTAT TTTGCTATAG TTGCTAATTA AAAAAATATA 912 0 

GATGTTTATT TATTTTTAAT GCAGTAATAT ATGGAGAAAT GAACAAACTA TGTAAACAAA 9180 

AAGGGAAACT CACTTGTXTT TCTTTAGATT TATAAATTTG ASCXATTTTT TTTAGAGGTG 9240 

CTTTTTAAAA ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTTCTGCC AGTCATOCAG 9300 

CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 9360 

TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT AACGXAGATC GAAGGCAAAG 9420 

CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AOGCAAGAQA AGTTGAAGCA 9480 

CAACCACTGT AGCAAAATAC CTTGACXGCT TGTGAGACCA TTAGCATTGC AGGOCAAACC 9540 

GTACTGTATT TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTGATT 9600 

CXXACCCAGG GTGCGCTGCG TCCTCATGGT ACTGTAGGCA GCTGAAGAAC CGCOGTTCCC 9660 

TTOAAAGGGA ACAOCTGGCA TTCTGTGGTG TTTCX3TGCTG TCXTAAATAA TGGTGCATTT 9720 

ATTATGTTCA AGTTATTTCA GGATTGCCAT ATGTGCAAAC AAATCATGCA ATGCAGCCAA 9780 

GGAATATATG TTGTTGTTGT TGTTTTAAAC OCATTTTTTT TTTAGAATTT TCATTAATAC 9840 

TGTAGTTATA CACCATATGC CXCATTTTAT CATAGCCTAT TGTGTATGAA AGATGTTTGT 9900 

ACAATGAATT GATGTTTAGT TTGCMTAGT CATTTAAAAA GATATTGTAC CAGGATGTGC 9960 
TATTAAGAGC AOGTATCCAT TATTCTTCTC AACCCAAGAA CCTGTTTCCT GGACCAGTGA 10020 

CCAAACCTCA TATGTGAAAT GGOCAAAGCA CATGCAGGCT CCXGGTTGTr CCTCTCAAAC 10O8O 

CIGTQCXGAC CAAAGATTAG TAACCAGTTA TAOCCAGTAT TTTGAGGTTT TATTQTTTTT 10140 
TTAATAACTA AAAAAAAACT OGTGCC 

Seq id NO s 162 prctein sequence 
Protein Accteaioa #s JHP_001990.1 
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lilt 

MQaURRSUiCLQ LYFLWiGCVV 1MAQGXAGQP QPPPPKPPRP 
PEYSEBGAAV ASSVRRRGQQ DVLRGPNVCG SRFHSYCCPG 
DGFCSRPNMC TCSSGQI8ST CGSKSIQQCS VRCMMGQTCA 
ENGCONQGRC IAQPCACVYG PTGPQCEROY ETGPCFTQVK 

TTGRAWGHPC emcpaqpqpc rhgpipnirt gacqbvdeoq 

RCPAGHKQSE TTQKCEDIDE C5IIPGICET GBCSJTTVG3Y 
RTGMCPSGLV KGRCAQBLPG RMTKMQCCCE PGRCWGIGTI 



41 51 
1 I 

QPPPQQVRSA TAG3EGGFLA 60 

WKTLPGGNQC 1VPICRNSCG 120 

DDHCQCOKQy IGTYCGQFVC 180 

NQHCQGQLTG IVCnCTLOCA 240 

AIPG1CQGGH CINTVGSPEC 300 

FCVCPRGYVT STDGSRCIDQ 360 

PEACPVRGSB EXHRXtCMDGl* 420 
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PMGGIPGSAG SRPGGTGGNG PAFSC2JGNQY GPGGTGFIPI PGGNGFSPGV GGAGVGAGGQ 480 

GPllTGI/riL NQTIDICKEH AKLCLiJGRCI PTVSSYRCEC NMGYKQDANG DCIDVDECT5 540 

HPCTNGDCVN TPG8YYCKCH AGFQRTPTKQ ACIDIDECIQ NGVLCKNGRC W9DGSFQCI 600 

CHAGFELTTD GKNCVDHDEC TTTNMCLNGM CINEDGSPKC ICKPGFVIiAP NGRYCTDVDE 660 

CQTPGTCMWG RCTNSEGSFR CDCPPGXJWG MDGRVCVDTH MRSTCYGGIK KGVCVRPFPG 720 

AVTKSECCCA NFDYGFGE P-C QPCPAKNSAH FHGLCSSGVG ITVDGRDINE CALDFDICAN 760 

GICEKfliRGSY RCNOffSGYBP DASGRNCIDI DECLVKRLLC EMGIjCRNTPG SYSCTCPPGY 840 

VFRTETETCE DIWECESHFC VNGACRKNI/3 SFWCECSPGS KLSflTGLICI DSLKGTCWEJJ 900 

IQDSRCBVNI NGATLKSECC ATLGAAWGSP CERCEI^JTAC PRGIiARIKGV TCEDVNECEV 360 

FPGVCPNGRC VNSKGSFHCE CFEGLTLDGT GRVCLDIRME QCYLKMDEOE CIHPVPGKFR 1020 

HDACCCAVGA AWGTECKECP KPGTKEYETL CPRGAGFANR GDVLTGRFFY KDHJECKAFP 1080 

GMCTYGKCRM TIGSFKCRCN SGFAUJMBER NCTDIDECRI GPDLCGSGIC VNTPGSFECE 1140 

CFEOYESGFM MMKNOtDIDG CERNFLiiORG GTCVNTEGSP QCDCPIXJHEL SPSREDCVDI 1200 

JJECSLSDNLC RNGKCVHMIG TYQC9CHPGY QATPDRQGCT DIDECMIMNG GCDTQCTNSE 1260 

GSYECSCSBG YALMPDGRSC ADIDECBNNP DICDGGQCTN IPGEYRCLCY DGFMASMDMK 1320 

TCIDVNECDL MfiNICMFGBC ENTKGSFICH CQLGY8VKKG TTGCTDVDEC EIGAHNCDMK 13 B0 

ASCLNIPGSF KCSGREGWIG NGIKCIDLDE CENGTHQCS"! NAQCVNTPGS YRCACSEGFT 1440 

GDGFTCSDVD ECAEaTINLCE KGQCLNVPGA YRCECEMGFT PASDSRSOQD IDECSFQMIC 1500 

VSGTOSHLPG MPHCICDDGY ELDRTGGHCT DIDECADPIK CVNGLOTHTP GRYECNCPPD 1560 

FQENPTGVGC VDEJRVONCYL KFGPRGDGSI. SCHTEIGVGV SRSSCCCSLG KAWGNfPCETC 1620 

PPVHSTEYYT I.CPGGEGFRP NPITIILEDI DBOQEIjPGLC QGGNCHJTFG SFQCECPQGY 1680 

YLSEDTRICE DIDECFAHPB VCGPGTCYNT T^GNYTCICPP EYMQVNGGBU CMOMRKSFCY 1740 

RSYNGTTCKFJ ELPFNVTKRM CCCTYNVGKA GNXFCEPCPT PGTADFXTIC GNIPGFTFDI 1800 

HTGKAVDIDE CKEIPGICAN GVCINQIGSP RCECFTGFSY NDLIjIiVCEDI DECSWGDNLC 1860 

QRHftDClNSF GSYRCECAAG FKLSPNGACV DRNBCLSIFN VCSBGLCVDIi QGSYQClCHN 1920 

GFKASQDQTM CMDVDECEHH POGN3TC3SNT VGSYNCI*CYP GFEIVTHNNDC liDlDECSBFF 1980 

GQVCENGRCF NBIGSFKCLC KEGYELTPDG KNCTDTNECV ALPGfiCSPGI OQKIiEGSFRC 204O 

ICPPGYEVK3 ENCIDIKECD EDPNICIiFGS CTNTPGGFQC LCPPQFVLSD HGRECFDTRQ 2100 

SFCFTNFEUG KCSVPKAFMT TKARCCCSKW PGEGWGDPCE LCPKDDEVAF QDLCPYGHGT 2160 

VPSLHDTRED VNECLE8FGI CSNGQCINTD GSFRCECPMG YNLDYTGVRC VDTDECSIGN 2220 

PCGNGTCTNV IGSFEOJCNE GFEFGPMMNC KDI3JFCAQNP LLCAURCWNT FGSYECTCPI 2280 

GYALREDQjKM CKDLDECAEG LEDCESRGMM CKNLIGTFMC IGPPGMARHP DGEGCVDEHE 2340 

CRTKPGICEN GRCVNIIG9T RCECWEX3FQE SS3GTECLDN RjQGLCFAEVL QTICQMASSS 2400 

HNLVTXSECC CDGGRGWGHQ CELCPLPGTA QYKKJCCPHQP GYTTDQKD1D ECKVMFHIjCT 2460 

NGQCINTKGS FRCFCKVGYT TDISGT3CID DDECSQSPKP CNYICKKTEG SYQCSCPRGY 2520 

VLQEDGETCK DliDECQTKQH NCQFLCWTL GGFTCKC&FG FTQHHTACID HHECG3QP1±I* 2580 

GGGKJGIG^TC PGSFSCBCQR GF3LDATGLN CEBVDECDGN HRGQHGOQNI LGGYROSCPQ 2640 

GY1QHYQHNQ CVDENECSNP NACGSASCYK TLGSYKCACP SOFSFDOFSS ACHDV5JECS9 2700 

SKKPCNYGCS KTEGGYLCGC PPGYYRVGQG HCVSGMGFNK GQYLSLDTEV DEENABSPEA 2760 

CYECKIHGYP KKDSRQKREI HEPDPTAVEQ ISLEBVDMDS PVKMKFNIiSH IX39KEHILEL 2820 

RPAIQPXKWH IRYVJSQGND DSVFRXHQRN GIiSTLHTAKK KLMPGTYTLE ITSIPLYKKK 28 B0 
EUCKLESSHE DDYLLGELGE ALRMRIiQIQL Y 

Seq HJ NO: 163 ONA aequence 

2Tucl«ic Acid Accession #: NM_013372.1 

Coding sequence; 63.. 617 

1 11 21 31 41 ' 51 

1 I I 1 I 1 

GCGGCOGCAC TCAGCGCCAC GCGTCGAAAG GQCAGGCCX3C GAGGACCCGG CGCACTGACA 60 

GTATGAGCOG CACAGC3CTAC ACGGTGGGAG COCTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGGCAT CCCCCC33CCA GACAAGGOOC 180 

AGCACAATGA CTCAGAGCAG ACTCA3TCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 240 

GGGGCCAAGG GCGGGGCACT GCCATGCCCG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 300 

CCCTGCATGT GACGGAGCGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTOGCACCAT CATCAACOGC TTCTGTTACQ 420 

GCCAGTGCAA C1CTTTCTAJC ATCCCCAGGC ACATCOGGAA GGAGGAAGGT TCCTTTCAGT 48 0 

CCTGCTCCTT CXGCAAGCCC AAGAAATTCA CTACCATGAT GGtCaCaCTC AACTGCSCCTTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACAOGTGT GAAGCAGTGT CGTTGCATAT 600 

CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGGAAGTCOC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGGCAGAAG 720 

AACCCCCAGC TGCCTCCTGG CAGGAOCCra CTTGTGOGTA GTTOGTGTGC ATGAGTGTGG 7S0 

AtTGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACRC AGTCTCTGCT AGAGAGCACT 840 

OOCTATTTTG TAAACATATC TQCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 900 

CXCRCATCTA AAGGGGCGGG GCOGTGGTCT GGTTCTGACT TTGTGTTTTT GtGCCCTCCT 960 

GGGGACCAGA ATCTCCTTTC GGAA1GAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 1020 

GAOCrcGlTTl AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATOCTC 1080 

CTTT0CTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGAXGCAA ATGTTTTTCA 1200 

TTTTGTGAAG ACCGTCCAGA CXCXGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGQGAGGGT GGAGGGTGAG GCCAAATCAG OTCCAGCAAA AGTCAGTAGG 1320 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGGTTC TGAGAAAGTC 13 BO 

TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGOAQA AATQAOATTQ CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTC3VAA CCTAACACCA AACTGAAAAC ATAAATACTG 1S0O 

ACCACTCCTA TGTTOGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCX3VTPCCAC TAITTCCCAT AATGCTTCXG AGAGGCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCTCTGCTG AGTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGAGAAGAC GACGAGAGXA AGGAAATAAA GGGRATTGOC TCTGGCTAGA GAGEAAGTTA 1920 

GGTGTTAATA CCTGGTAOAA ATGTARGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGAOCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TGTlCATCTG 2040 

CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 
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TCTOATTAAA CTTGGCCTAC TGGCAATGGC TACT TAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACXCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

5AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CGATCAAGGC 2340 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2520 

CTAGGCGAAT TTQTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2580 

« p. CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

IU ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGOAAA GAAGCTGAAA 2700 

ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTOGTTAA TGGAGATGAC TTAAGTTGGC 2820' 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 28 BO 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

lO TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GA CTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

ryf. TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

Z\) GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

TGGAGGAOAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTAATGTA ATT A TT ACT T CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 

^ TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AAGATTTCTA 360D 

AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTQGAAGA CTTACGATGC ATGTATACAA 3660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

~ n AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

JK) AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3900 

ATGAAAGGGG AGTTQATAGT CTCATAAAAC TAATTTGGCT TCAAOTTTCA TGAATCTGTA 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 
TAAATTAAAC CTATTCTTTC AAAAAAAAA 



35 



45 



Seq ID NO: 164 Pirotein sequence 
Protein Accession #; NP 037504.1 



1 11 21 31 41 51 

ACk 1 1 1 I I 1 

*+\J MSHTAYTVGA LLLLLGTT^P AAEGKKKGSQ GAZPPPDKAQ BNDSBQTQSP QQPGSRNRGR 60 
GQGRGTAMPG BEVIiESSQEA LHVTKRKYLX RDWCXTQPLK. DTIHEEGCCTS RTIIKRfCYQ 120 
QCKSPYIPBH IRKEEGSFQS C3FCJCPKKFT TMMVTKHCPB LQPPTKKKRV TEVKQCRCIS 180 



Seq ID HOx 165 DNA sequence 

JSuclelc Acid Accession #c CAT cluster 



1 11 21 31 41 51 

so 1 1 1 1 » I 

JU GAATTOCATC GGACAGAG CTT TCGOCATGGC CGAGTTGOGC GTGGTTGTGG CACTAACACT 60 

GCTACGTTTC CGCCTGAGCG TGGACOGAAC GCGCAAGGTG CGGCGGAAGC CGGAGCTCAT 120 
ACTGCGCAOG GAGAACGGGC TCTGGCTCAA G 

55 Seq id NO: 166 DMA sequence 

HUclelc Acid Accession # ( FGJBWBSH predicted 
Coding sequencer l.,l650 

ATGCCACCTC TGCCATCCAG AATGTCTCGG GGG0CCTOCT TGGTGJACAGG CAGGATGCTG 60 

CCCATCACAG ACCGCCTGCT GCACCTCCTG GGGCTGGAGA AGAOGGCGTT CCGCATATAC 120 

GCGGTGTCCA COCTTCTCCT CTTCCTGCTC TTCTTCCTGT TCCGCCTGCT GCTGCGGTTC 180 

^ CTGAGGCTCT GCAGGAGCTT CTACATCACC TGCCGCCGGC TGCGCTGCTT CCCCCAGCCT 24 D 

OD CCCOGGCGCA ACTGGCTGCT GGGCCACCTG GGCATGTAOC TTCCAAATGA GGCGGGCCTT 300 

CAAGATGAGA AGAAGQTACT GGACAACATG CACCATGTAC TCTTGGTATG GATGGGACCT 360 

GTCCTGCOGC TGTTGGTTCT GGTGCACCCT GATTACATCA AACCCCTTTT GGGAGCCTCA 420 

GCTGOCATCG CCCCCAAGGA TGACCTCTTC 7ATGGCTTCC TAAAACCTTO GCTAGGGGAT 4B0 

TOGGGCTGCTGC TCAGCAAAGG TGACAAGTGG AGCCGGCACC GTCGCCTGCT GACACCCGCC 540 

TTCCACTTTG ACATCCTGAA GCCTTACATG AAGATCTTCA ACCAOAGCGC TGACATTATG 600 

CATGCTAAAT GGCGGCATCT GGCAGAGGGC TCAGCGGTCT CCCTTGATAT GTTTGAGCAT 660 

ATCAGCCTCA TGACCCTGGA CAGTCTTCAG AAATGTGTCT TCAGCTACAA CAGCAACTGC 720 

CAAGAGAAGA TGAGTGATTA TATCTCCGCT ATCATTGAAC TGAGCGCTCT GTCTGTCCGG 780 

_ . CGCCAGTATC GCTTGCACCA CTACCTCGAC TTCATTTACT ACCGCTCGGC GGATGGGCGG 840 

fO AGGTTCCGGC AGGCCTGTQA CATGGTGCAC CACTTCACCA CTGAAGTCAT CCAGGAACGG 900 

CGGCGGGCAC TGOGTCAGCA GGGGGCCGAG GCCTGGCTTA AGQCCAAGCA GGGGAAGACC 960 

TTGGACTTTA TTGATGTGCT GCTCCTGGCC AGGGATGAAG ATGGAAAGGA ACTGTCAGAC 1020 

GAGGATATCC GAGCCGAAGC AGACACCTTC ATQTTTGAGG GTCACGACAC AACATCCAGT 1080 

OA GGOATCTCTT GGATGCTGTT CAATTTGGCA AAGTATCCGG AATACCAGGA GAAATGCOGA 1140 

OU GAAGAGATTC AGGAAGTCAT GAAAGGCCGG GAGCTGGAGG AGCTGGAGTG GGACGATCTG 1200 

ACTCAGCTGC CCITTACAAC TATGTGCATT AAGGAGAGCC TGCGCCAGTA CCCACCTGTC 1260 

ACTCTTGTCT CTCGCCAATG CACGGAGGAC ATCAAGCTCC CAGATGGGCG CATCATCCCC 1320 

AAAGGAATCA TCTGCTTGGT CAG CATC TAT GGAACCCACC ACAACCCCAC AGTGTGGCCT 1380 

GACTCCAAGG TGTACAACCC CTACCGCTTT GACCGGGACA ACCCACAGCA GC33CTCTCCA 1440 
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CTGGCCTATG TGCCCITCTC TGCAGQACCC AGGAATTGCA TCGGACAGAG CTTCGCCATG 
GCCGAGTXGC GCGTGGTTGT GGCACTAACA CTGCTACGTT TCCGCCTGAG CGTGQACCGA 
ACGOGCAAGG TGCGGCGGAA GCCGGAGCTC ATACTGCGCA CGGAGAACGG GCTCTGGCTC 
AAOGTGGAGC CGCTGCCTCC G06GGCCTGA 

Seq ID MOi 167 Protein sequence 

Protein Accession #t FGENESH predicted 



15O0 
1560 
1620 



1 

! 

MPPIiPSRMSR 
IjRLCRSFVIT 
V^PLLVLVHP 
FEFDIL.KPYM 
QEKMSDYISA 
RRAliRQQGAB 
GISHMLFNIA 
TLVSRQCTED 
LAYVPFSAGP 
KVEPE»PFRA 



11 
I 

GPSI»VTGRMIj 
CRRI.RCFPQP 
DYIKPLLOAS 
KlFNQSADIM 
riELSALSVE 
AWI/KAKQGKT 
XYPEYQSKCH. 
IKkPDGRIIP 
RNCIGQSFAM 



21 
I 

PITDRIjLHLJj 
PRKNHItLGHL 
AAIAPKDDIiF 
HAKWRHLAEG 
RQYKLHHYLD 
LDFIDVLL&A 
EEIQEVMKGR 
KGIICIiVSIY 
AELHVWALT 



31 
1 

GIiEKTAFRlY 
GMYfcPNEAGI* 
YGFLKPWLGE 
SAVSLDMFEH 
PIYYRSAEGR 



ELEELEWDDL 
GTHHNPTVWP 
I^RFRLSVDR 



Seq ID NO : 16B DMA sequence 
Nucleic Acid Accession #: AKOSSOaa.l 
Cooing sequence i 252,-1772 



AGGAAAOCAA 
GCCCCAGATG 
TAACTCAGAG 
CCTGTGQTGG 
CACTTTCATT 
GCTCAACCAA 



TATGCA.TTTC 
GTGGGGAAGC 
GGCTTTTTCA 
CACACCCCCT 
AAAAATCCCC 
AAGTCGCCTT 
AAGACCTACT 
GGTGCGTGGA 
CCGOCTGCAA 

acttggttcc 
acaaagtgct 
tgcttttgtg 

CCGTGATGAC 
GACGGAGCTT 
ATCTGGAGGT 
TCAACCATTA 
CCCAGGCTGT 
TTGCTCTATA 
GTTTTTACAT 
ACAACCACGT 
TGGTTGACGG 
CCACGTCATC 
TAAGTTTTGC 
CCCCOCACTG 



11 
I 

GGCAAGCTCC 
TGATGAGCGC 
GCCAGTGTUA 
GCAAAAGGGC 
CATGATAATG 
COGQTGGOGT 
CAAGGCxCTG 
ACAAATCCAG 
AGATGTGATT 
GTCTOAGACC 
GAGGGAGCTS 
TGCAAAGAGG 
GGCCAGtSGCC 
GGGAGTGCTG 
TGTGATGATA 
GTACAAGGAA 
TCTAGGGGGG 
GAAGTCCCCC 
GGTCTTCTTG 
ATTTTTTAAG 
GAGGCCGCTC 
GCTGCGGCAC 
CCACGCACTG 
GAGATTTGGG 
TGGATTCTTC 
GCAGAGAATA 
CAGCCTGOSA 
CAAGTOOCAG 
CTGCAAAAGC 
ATTCATCTTC 
GTCTGCATAA 



21 
I 

CCCTGTCAAA 
TTCCAGGdT 
TGGGAGTTCC 
TlTQSAACGG 
TCGAACACGC 
tTTQCCCAAAC 
AGCATAGACT 
AAGTTTTTCT 
CTCGAG1G0C 
TTGGCCAAGC 
GAGGAGCTTC 
AUCATCATTT 
CTGAAGAACC 
GCTTCCGCCC 
GCCAGACTCA 
GAGCAGCTCA 
ACGCAGATCC 
AGGTTATTTA 
CAACXGAACT 
ASCTTTCCTG 
TTCCTCTGCT 
CTTAACTTCT 
GAGAATGGGG 
CTGCTCTTTA 
TTTAAGATAA 
AAGCACACAG 
GCGGCACGCC 
GAGITCAGGA 
CATACCTTGA 
CCAGCATCTT 
AAQAAAATAA 



Seq ID MOs 169 Protein sequgnce 
Protein Accession #: BAB71658.1 



QIO.KFFFEHF 
RELEEUJtAQ 
GVIiABAHIDQ 
LGGTQIHLHK 
FFKSFPENCC 
HALENGGDHV 
QRIKHTDLES 
CKSHTLKIQT 



11 
I 

LERRVTGSTO 
KNKDIOSGEA 



FSGiFQHCVD 
IPQDLLHKVI* 
PLDRDIGRSL 
HLKDLNTQAV 
PSAVYEHKHV 
VGIPIYVSFA 



21 
I 

RWRLPKQPFS 
DVILECLGPK 
AKRIIISLKI 
VMIARLKPST 



RFLF&CUILH 
RFGLLFNQEN 
SURAARLVKY 
FIFPAS 



31 
I 

GCACCTTGGC 
CAGGCTCAGA 
TCCACTCAGC 
TXGCTTGTCT 
ACAAAGCTCQ 
AGCCtTTCTC 
TTGAGGAAGC 
TTGAGAATTT 
TGGGCTTCAA 
TCTACCTGAA 
TCCGAGCTCA 
CCTTGAAGAT 
TCTACATGAQ 
ACATCCTCCA 
AGCCAAGCAC 
CCACCGGCTG 
ACCTCCACAA 
CCTTTAGTGA 
ACAAGATTCA 
AGAACXGTTG 
TGCGTCTGCA 
TOGCAGAGTC 
GCGACATQQT 
ACCAGGAGAA 
AGGGACTCAA 
ACCTGGAATC 
TGGTGAAGTA 
CAAACCAGAT 
AAATCCAAAC 
GACAGTTTCC 
AATGACATAA 



31 
I 

GDIiLSLSQMC 
WBLHQPRLPQ 
NDPtiVTKVAF 
ZKKPYBAQCK 
FHIxLKTHLIiW 
GITKGKDLEV 
TTYSJCTlALY 
BIRAEAI*VDG 



41 
1 

AVSTLUJFLL 
QDEKKVliDNM 
GMiLSKGDKW 
ISLMTLDSLQ 
RPHQACDMVH 
EDIRAEADTP 
TQLPFTTMCI 
DSKVYNPYRF 
TRKVRRKPSL 



41 
I 

GCATAAGAAS 
AGGGQCCCCC 
ACACTTCCCC 
TTTCTCTOCT 
GCTGGAACGC 
TGGOOACCTG 
TTTGAGGAAC 
CAAGAACAAG 
ATGGGAGCTC 
AGCCCTGGCG 
ATCACCTAAG 
CAATGACCCA 
TGAGGTGGAG 
GT1CRGTGGC 
CATCAAGAAA 
CGAGAAGTGG 
AATCC CACAO 
ATTCCATCTT 
GGCAATTCCG 
CTTTCTGGAC 
CGGCATCACC 
ATGQCTCGAC 
CCACCTGAAA 
TACAACTTAT 
ACATGATACT 
TCCCTCTQCG 
TGAGATCAGA 
CAAGCAGAAG 
TQTGGGCATC 
AGAAGAATCT 
AAGGGAGC 



41 
I 

XALSXDFEEA 
SETLAKLYIiK 
ATAIiKNLYMS 
YKES&LTTGC 
VFLQLNYKIQ 
liRiELNFFPES 
GFFFKIKGLK 



SI 
I 

FFLFRIiLiRF 
HHVUiVWHGP 
SRHRRLLTPA 
KCVFSYNSNC 
EFTFTEVIQER 



KE 9 KRQYPPV 
DPDHPQQRSP 
llaRTENGLWL 



51 
I 

AAAAGGGGGA 
AGCTCTCCTG 
TQTAAACACG 
GCGTAATTTC 
GGGGTCACTG 
CTCTCACTJT 
CCAGACAGGT 
GACATCCAAA 
CATCAGCCCC 
CAGGGCACCA 
AAGACCAAAG 
CTGGTCACTA 
ATTAACTTGG 
CTGTTTCAAA 
TTCTACGAGG 
CTGGAAATGA 
GACCTGCTC3C 
CTGAAAACAA 
ACTTATGAAA 
CGGGACATAG 
AAAGGCAAGG 
CAGGTTACAG 
GATCTTAACA 
TCGAAAACGA 
ACCTCTTATA 
GTGTA03AGC 
GCAGAGGCCC 
TTTGGGTTQA 
CCAATCTATG 
ATGGGATTTT 



51 
\ 

LRNPDRLCIS 
ALAQGTTHFL 
KVEINLEDLL 
SKWLQlHLiVP 
AIPTYETVMT 
WLDQfVTVHHY 
HDTTSYSPYM 
KQKFGLTTSS 



Seq ED NO: 170 DNA agpuence 

Nucleic Acid Accession #? NM 00700 0.1 

Coding sequences 1...777 

\ ™ ?! 31 41 51 

I I I I I | 

ATGGCGTCTG CGGCAQCAGC GGAGGCCQAQ AAGGGATCTG CAGTTGTGGT GOG-CCTGCTA 
GTTOTGGGCA ATAT CATTAT TCTGCTGTCA GKSCCTGTCCC TGTTTGCTGA GACCATATGG 
GTGACAGCCG AGCAGTACCG TGTATACCCA CTGATGGGAG TCTCAGGCAA GGATGACGTC 
TTCGCTGGTG CCTGGATTGC CATCTTCTGC GGCTTCTCCT TCXTCATGGT AGCCAGTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10EO 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



GO 
120 
180 
240 
300 
360 
420 
480 



60 
120 
1B0 
240 



1020 
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GGTGTGGGTG 
C T CAT CGTCT 
ATGGTGTCCA 
GACCAGGGCC 
GGCACATCTG 
GAGGTGGTGT 
AACOAGGAGG 
CACATCGGCC 
CTGATGTGGA 



CCQCACTCTG 
ACATCTTCGA 
ACCCATCCCT 
AGGAGCTGAC 
GTCCCATGGA 
TCCCCTGGCC 
GCTGCCGCCT 
ACGGCATCGA 
CGCTCCCGGT 



CCGCCGCCGG 
GTGCGCCTCC 
GATCACCAAG 
OOGCCTCTGG 
CTGGGTGAAC 
CCCACTGTGC 
GGGGCACATG 
CAGCTACACG 
CATGCTGATA 



TCCATGGTCC 
TGCATCACGT 
CAOATGCTGA 
GACCQCQTCA 
TTCACGTCAG 
TGTCGCCGGA 
GACTACCTGT 
TGGGGTATCT 
GCCATGTATT 



TCACGTACCT 
CCTACACCCA 
CCTTCTACAG 
TGATTGAGCA 
CCTTCCGGGC 
CGGGAAACTT 
TCACCAAGGG 
CGTGGTTTGG 
TCTACACCAT 



GGTGCTCATG 
CCGTGACTAC 
CGCGGACACC 
AGAATGCTGT 
GGCCACTOCG 
CATCCCCCTC 
CTGCTTCGAA 
GTTTQCCATC 
GCTCTGA 



Seq ID NO i 171 Protein sequence 
Protein Accession #z NP_00fi 931.1 

* ? L ?1 31 41 si 

MASAAAAEAE KQSPVWGLL WGNIIII^S GLSItFAETIW VTADQYRVYP LH3VSGKDDV 
FAGAWIAIFC GPSFFMVASF GVGAAJUCRRR SHVLTYLVIM LIVYrFECAg CITSYTHHBY 
MV&NP5LITK QMLTFYSADT DCGQKLTRIW DHVMIEQKCC GTSQPHDWVN FTSAFRAATP 
EWFPWPPLC CRRTGNFIPL NBEGCRLGHM DYLFTKGGFS HIGHAIDSYT KGISWFGFAI 
IiMWTLPVMLI AMYFYTPHj 

fieq ID NO; 172 JPNA sequence 

Nucleic Acid Accession #* NM_0067G0-1 

Coding sequence: 39. .593 



a 

1 

GAAAGCCTGC 
GGACCTTGCC 
AC3VTCTCAAG 
TGCCCCCCTC 
GCAAAGTGOT 
GTGTGGTGGA 
ACCTCGTGCC 
AGTCCAGCAG 
TGGGTATQGC 
TCCTGCTGGT 
TGCCCGGAGC 
GCTCCCAGGC 
CTCCCCTAGA 
ACACTCCATT 
CTTTAAACAC 
GCTGAGGCTG 



11 
1 

CAGCACCTAT 
CTTGATCCTG 
CCTCrCTGGT 
TCACCTCACA 
GACGTCCAGC 
CAGTGGTGCT 
AGGAACCAAA 
AOAGATCCCA 
CCGCACAGGG 
GCTGGGCTTC 
AGCAGCTTCT 
CCCAGGCCTG 
GCCCTCTCCT 
ATTCCTCTGA 
CCCCATAACA 
CATCTGCCAA 



21 
I 

TCCACCTCCC 
ATTCTGCTGG 
CTGCTGTCOC 
GGAGGCAATG 
TTTGTGGTGC 
GGCT1CACAG 
TTCTACATTT 
ATGTCCACAC 
GGCATGGTGG 
ATCATTGCCC 
CCAGGAAGCC 
TGGCTCOCTT 
CCCTCTGTCC 
CCCCACTCCT 
ATTCCCOCAT 
AAAAAAAAAA 



31 
I 

AGCCCAGCAT 
CTCTGCTGTC 
CGGCGCTAAC 
CCACACTGAT 
CTCCGTGCCG 
TCACTCGGCT 
CCTACCTAGT 
TCCCTCGAAG 
TCATCACGGT 
TGGCACTGGG 
CAGGGCACCA 
GGTQCCCTCG 
CTCTCCTTGC 
GTCAS3AGTTG 
CCTTCAGTGA 
AA 



41 
I 

GGCACCCCTG 
CCCAGGGGCT 
GGAGAGGCTG 



TGGGCGCAGG 
CAGTGCATAC 
GAAGAAGOGG 
GAACATGGAA 
GCXGCXCTCT 
CTCCCGCAAG 
TCCAGCTCCC 
CCTCCTOCTC 
CCCCAGTGCC 
ACTTTCCTOC 
ACTAAGTOCC 



SL 
I 

CTGCCCATCC 
GCAGACTTCA 
CTGGTTGOCT 
GCCAATGACA 
GAACTGGTGA 
CAGGTGACAA 
ACAGOCACTG 
TCCATTGGGC 
GTCGOCATGT 
TAAGGAGGTC 
CAGCCCACCT 
CTGCCCTCCr 
TCACCTTCCA 
CATTTTACCA 
TATAATAAAG 



Seq id NO i 173 Protein sequence 
Protein Accession #; NP_0067S1.1 

1 11 21 31 41 

MAPLLPIRTL PLIill^ALL SFGAADFNlfl SLSGLL3PAI, TESLlxVAJbPP CHLTGGNATL 
MVHRfiHDSKV VTSSFWPPC RGRRELVSW DSOAGFTVTE LSAYQVTNLV PGTKPYIfiYL 
VKKGTATES3 RBIPMSTLPR RNME9XGLGM ARTGGMWIT VL&SVAHFI*. VLGFIIAIiAI* 



51 



Seq ID NO: 174 DMA sequence 

Nucleic Acid Accession #: Bos sequence 

Coding sequence: 1..2733 



1 
1 

ATGAAAGTTG 
TTCCTGGGGA 
CATCTAGGCX: 
GAGAAAAGAG 
GGGCTAATTA 
CTGCAGTGTA 
AACTGCTACC 
AGCCAGAOTG 
AGGTTTACAA 
ATTGAAATTC 
ACCCAATTTC 
TCTGAACTGC 
CTGTTTCCAT 
GTCTTTGGAT 
GGAAACATCA 
CTCTCTCTGC 
GCAGCTGTOT 
ACAGTGGGGA 
GCCAGCCATT 
ATCCTTAATT 
AGCTCACGGT 
CCTCTGAATT 
CTCAAAAGGG 
AGAGGCCGTG 



U 
I 

GAGTGCTCTG 
AAAATGATGG 
CAGTCGAAGA 
ATTTGAGAAA 
GAATTATCAG 
CCTGTGAAGA 
TTCACACGGC 
TCAATTTCTG 
ATGACCTTTT 
AACTTAAAAA 
GAAATGGAAG 
TGTCAGCCAT 
TAGAAGACGG 
TTGGGTCCAA 
CAGCCAAGTG 
TTGAAGAACT 
CATCCTTCGT 
ATCTGGCTTC 
TCAGGGTGTC 
CAGCCTCAGT 
TACTAGAGAC 
TTTCTCGGAA 
GTTACAGCTA 
TGTTAATTGG 



21 
I 

GCTCATTXCT 
CATCAAAACA 
ATATCAGCIG 
TTTTCTGAAG 
AGCAAAGGCT 
CAGCTACACC 
TGGAQCACTC 
TQAGAGAACA 
GAAXTCATCT 
AGCATATGAA 
CATCGTTGCr 
TGAACATGTT 
CTdTTCAGA 
GGATGATGAA 
TGAGTCCTCT 
GAACAAGAAT 
GCAAAATCTT 
GGTGGTGTCG 
CAATTCAACA 
AACCAACTGG 
ATTAQAAAAC 
AXTCATTGAC 
TCAGATTAAA 
GTCAGACCAA 



31 
1 

TTCTTCACCT 
AAAAAAGAAC 
CTGCTTCAGQ 
CTCTTGAAGC 
ACCACAGACT 
TGGXTTCCTC 
CCAAGCTGTG 
AAGATTTGGG 
TCTGCTATAT 
AGAATTCAAG 
GGGTATGAAG 
GCCGAGAAGG 
GTGTTCGGAA 
TATACCCTGC 
GGGTGGCAGG 
TTCAGTATGA 
TCTGTCATCA 
ATTCTGAGCA 
ATGGAGGATG 
ACAGTCTTAC 
ATCAGCACTC 
TGGAAAGGGA 
ATGTGTCCCC 
TTCCAGAGAT 



41 
I 

TCACTGACGG 
TCATTGTGAR 
TGACCTATAG 
CTCCATTATT 
GCAACAGCCT 
CCTCATGOCT 
AATGTCAXCT 
GCACTTTCAA 
ACTCCAAATA 
GTTTTGAGTC 
TTGXTQaCTC 
CTAAGACAGC 
AAGCCCAGTG 
CCTGCAGCAG 
TCATCAOGGA 
TTGTAGGCAA 
TTCGGCAAAA 
ATATTTCATC 
TCATCAGTAT 
TGCGGGAAGA 
XGGTGCCTCC 
TTCCABTGAA 
AAAATACATC 
CCCTTCCAGA 



51 
I 

CCACGGTGGC 
TAAGAAAAAA 
AGATTCCAAG 
ATGGTCACAT 
GAATGGAGTC 
TGATCCCCAG 
CAACAACCTC 
AATTAATGAA 
TGCAAATGGA 
GGTTCAGGTC 
CAGCAGTGCA 
CCTTCACAAG 
TAATGACATT 
TGGCTACAGG 
GACTTGTGTG 
TGCCACTGAG 
OCCATCAACC 
TCTGTCACTG 
AGCTGACAAT 
AAAGTATGCC 
GACAGCTCTT 
CRAAAGCCAA 
TATTCCCATC 
AACTATTAXC 



300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 



60 
120 
180 
240 
3 00 
3 60 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 



60 
120 
1B0 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



1021 



WO 03/042661 



PCT7US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



AGCATGGCCT 
GTCAATGGAC 
TTTTTTTCCA 
CAT T TG CAGT 
TGCCAATGTA 
ATCTTCCCCG 
ATTTTATGCC 
CACACACGTC 
TTTATTGTTG 
GTGTTCTTTA 
CTGCTGGCTT 
GTTGGATTTT 
ACGCAACCTA 
AGCAAACCAC 
GTGGTGCTGC 
GATGACAAGG 
GGGCTCACGT 
GTrATTTTTG 
TTGGACAGTA 
CAAACAGAAA 
AACCCACTGC 
ATCA.TGCTAA 



C3TTGACTCT 
CTGTQATATC 
AGATAGAGTC 
GGAACQATGC 
CTCACTTGAC 
TTGTAAAATG 
TGATCATCGA 
GTATTTGCAT 
GTGCCACAGT 
CACACTTCTT 
ACCGGATCAT 
GCCTGGGTTA 
QCAATACCTA 
TCCTGGCT2T 
TAGTTCTCAC 
CCACCATCAT 
GGGGCTTTEG 
CTT TACTCAA 
AGCTGCGACA 
AGCAAAACTC 
AAAACAAAGG 
CTCAGTTTGT 



GGGGAACATT 
CACGGTTATT 
AAACCTGAGC 
AGGCTGCZCAC 
CTCCTTCTCC 
GATCACCTAT 
GGCTTTGTTT 
GGTGAACATA 
GGACACCACG 
CTACCTCTCT 
CCTCGTQTTC 
TGGGTGCCCX 
CAAAAGGAAA 



AAAGCTCTGG 
CCGCGTGGGG 
AATAGGAACA 
TGCATTCCAG 
ACTTCTGTTC 
ATCAGATTTA 
CCATTATGCA 
CTCAAATGAA 



CTACCOGTTT 
CAAAACTATT 
CAGCCTCATT 
CTAGTGAATG 
ATATTGATGT 
GTGGGACTGG 
TGGAAGCAGA 
GCCCTGTCCC 
GTGAACCCTT 
TTGTTCTTCT 
CATCACATGG 
CTCATTATAT 
GATGTGTGTT 
GCACTGGCTA 
AGGCOGACTG 
AAGAGCCTCC 
ATA31X3GACA 
GGATTl-riTA 
AACAAGTTGT 
TCTGCCAAAC 
TTTTCTCATA 
TAA 



CCAAAAATGG 
CCATAAATGA 
GTGTGTTTTG 
AAACTCAAGA 
CACCTTTTGT 
GTATCTCCAT 
TTAAAAAAAG 
TCTTGATTGC 
CTGGAGTCTG 
GGATGCTCAT 
CCCAGCATIT 
CTGTCATTAC 
GGCTTAACXG 
TTGTG3CTGT 
TTGGGGAAAG 
TCATTCTGAC 
GCCAGAATCT 
TCTTATQCTT 
CTGCCTTAAG 
CCAAATTCTC 
CTGGAGATTC 



AAATGCTCAG 
AGTTTTOCXA 
GGATTTCAGT 
CATCGTGACG 
CCCCTCTACA 
TGGAAGTCTC 
CCAAACCTCT 
TGATGTCTGG 
CACAGCTGCT 
GCTIGGCATC 
GATGATGGCT 
CATTGCTGTC 

gtccaatgga 
gaacttcgtt 

ACTGAGTCGG 
CCCTCTGCTA 
GGCTTGGCAT 
TGGAATACTC 
TTCTTGGAAB 
AAAQCCTTTC 
CTCCGACAAC 



Seg ID no i 175 Protein sequence 
Protein Accesaion $s Eos sequence 



1 
I 

MKVGVLWLIS 
EKRDURNFLK 
NCYLiHT AOA I, 
IEIQLKKAYE 
JL,FPI*EDG3PR 



11 
I 



ASHPKVSNST 
PUJPSRKPID 
SMASLTLGNI 
HLQWHDAGCH 
XLCIiIXBAZiF 
VPPTHPFYLS 
TQPSBTYKRK 
XM3KATIIRVG 
LDSKL&QliU 
IMKTQFVSiJE 



LLKPPLLWSH 
PSCECHIiNNL 
RIQGFESVOV 
VFGKAQCNDI 
PSMIVGHATE 
MEDVISIADN 
WGIPVHKSQ 
LPVSKNGNAQ 
LVNETQDIVT 
WKQIXKSQTS 
IjFFWMLMI/31 
DVCWLNWSNG 
KSLItlLTPIiIr 



21 
I 

FLGKNDGIKT 
GLIRIIRAKA 
fiQSVNPCERT 
TQFRNGSIVA 



AAVeSFVQHX, 
ILNSABVTNW 
LKRGY9YQIK 
VNGPVISTVI 
CQCTHLTSFS 
HTRRICMVNt 
LIiAYRlIIiVF 
SKPIOiAFWP 
GLTHGFGIGT 



31 
I 

KKELTVNKKK 
TTDCWaXiNGV 
K2WGTFKXNE 
GYEWGSSSA 
YTLPCSSGYE 
SVIIRQNPST 
TVIVLREBKYA 
MCPQHTSIPI 
QKYSINEVFI, 
ILM9PPVPST 
ALSLLIADVK 
HHMAQHLMMA 
ALAIVAVNFV 
IVDSQNLAKH 
SAKPKFSKPF 



41 
I 

HLGFVEEYQI, 
liQCTCEDSYT 
RFTNDLIiNgS 
SET.T^SAIEEV 
GNITAKCESS 
TVGMLASWS 
SSRLLETLEN 
RGKVLIG6DQ 



IFPWKSf ITY 
FIVGATVDTT 
VQFCLGYGCP 
WLLVUTKIiW 
VTFAIxLNAPQ 
NPLQNKGffYA 



SI 
I 

LLQVTYKDSK 
WFPPSCLDPQ 
SAIYSKYAKG 
AEKAKTALHK 
GttOVIRBTCV 
ILSUIBSL3L 
IBTLVPFTAL 
FQRSLPETII 
QPHCVPWOFS 
VGLGISIGSL 
VHPfiGVCTAA 
LIISVITIAV 
RPTVGERliSR 



fieq ID iro i 176 DMA sequence 

Nucleic Acid Accession AB035089.1 

Coding sequence? 9845. .10215 



GGGCATGCAQ 
CAGTTCTAGT 
CCAAGAGGAA 
TTGGTTTGAA 
AGGAGGAAGG 
GAGACTATTT 
AAGAAAGGAA 
GTAGACAGAA 
AATCTCCTCC 
AATAAAATGT 
AAGGAAGTGA 
GTAATCACAG 
QGATTCCTOG 
CAAGTGTTCA 
TCAGGATATO 
TAGGAGAACT 
CAAGCTT^CT 
TTCAACCTTC 
OCATAGA.TTG 
CCTOCATTCC 
TGGTACCOGA 
CTGGATGCAG 
TTTCCTCTTC 
CTATTCCCCG 
ATATATATTG 
GTGAGAAATT 
CCTATGTGTT 
AAAACAAACT 
1X3TGGAGAAA 
TGQAGAAGAG 
TTTCATTCAT 
ATQAAAGGTC 
ATCAACTTAT 



11 
I 

CCATOGGOGA 
AAAAGGGAGA 
TTAGGGAGAG 
AGCATACAGT 
GAGGCAAGAG 
CCGTCTCTGC 
AGCTAGrXAG 
TCCT1GGGAA 
ACTAACCAGT 
TCTCTTGACT 
GGTCCAGGAA 
GATATTT CCT 
AGOCAATGAA 
TATGCAAAAA 
TCCAGTCTCA 
ATTTAGGAAC 
CTAAATTTAA 
AGGGCAAACC 
GTCCCCTGTA 
caOGATGAQC 
GTAAATCCAT 
ACTCZAGCTGA 
TTTCXTTTTC 
ACTTCAATCA 
ATACAGGAGA 
TCCAGATTGG 
TCTGGCACCT 



21 
I 

AAATCCATAG 
ACATCAATAT 
AGTTATAAGA 
AAATATGATG 
ATAATAT C3VT 
•TTTTCAAACC 
TCTTGTTCTG 
TACAGTAATT 



TTGTTACTTA 
AATCTAGGAG 
GAAAGXGTAA 



TCAGAACXCT 
TCTGGCATTT 
AGGTTTATAC 
ACAACATCAT 
GATTAAAGAA 



CTTCITGGAA 
CACACCAGGA 
AGAAAAAAAT 
GCAAACTCCr 
TCGGTGCCTC 
AOCCOGGITT 
TTGTTGCTTC 
GCTACTCCAA 
GAAGACCATT 
TTCTATCTTT 
GGGAACTTAT 
CCTAAGAAGA 
AAACACAGCT 
TGTTGTAGAT 
GTTAAAAAGG 
ATTCACGGTC 
CCTCAAAATG 
TCAAAAGAAA 
TATTCATAAT 
AATCTGGTCT 



31 
I 

TGCAGATAAA 
AGGATGTTTC 
GATCA3CAAG 
TCTGTCCCTG 
TTTCTCTGTG 
TTACTGGAGT 
AGGTTGTTCA 
GACATATA1T 
ATTGCCACAA 
ACAATGCTGA 
ATATTTCTTA 
TTTCCCATTG 
GTTTATGAAA 
TTTCTGAGTT 
TATGTOCTTT 
GCCTGAAATG 
GGTCATTTTC 
AGACGTTTAG 
GTCTCAGCTT 
TGTCCTATGA 
TAGAGGAAGG 
ATTCATTTTT 
GGATTTTTAG 
ACCTCTTAAA 
GCATGTCTTG 
TCCTTTCTCC 
AAATCTCCCT 
GCCCATGACA 
GGTTGGAATG 
TTAACCTGQA 
TGAAGAAATA 
AGTAAAAGGA 
ATTCATAGAA 



41 
\ 

GCAAGGAGGA 
TTAGCAATAG 
GGGACAGGGT 
GCAGTGTTGG 
CICCAACTGT 
TGTTTTCCCr 
ATGTATACAT 
CTGTEATTTG 
GCACATAATA 
GAAAACTTTA 
ACCAATCTAT 
AGGATTTGTT 
TATCAAGAGA 
CTCTGTGGCA 
CTAGCCTGTC 
ATTTCTCATT 
AGTTAGTACC 
CCATAGTCTG 
GTIATCCTGT 
GACATTAGAT 
TCCATTTTTG 
GGAATTCTTT 
TCCATCAAOG 
CTCATTCAGA 
GGGGTTGAGG 
CATCCAGCCC 
TGACITTGTG 
ATACCAAGTG 
CAGACTTGTG 
TTTACCATAT 
TGCCATGCAA 
TGGAAACAAC 
TGGAATATTA 



51 
I 

AGAAGAAGGA 
AAAAAQAAGG 
TAGATTTGGT 
CAGAGTAGGA 
ACTTACATAT 
CATGAAAACC 
ATCTATATCT 
ATGCTTGAAA 
AGAAACAATA 
CAGCCTTCAT 
AAAGGCATTA 
TTTAATTTCT 
CATAAGTTGG 
ATATATGACA 
TATC!ACATGC 

toaactcatc 
tttccttaag 
aaattctctt 

T ITTJTCTTC 
TCCTTTTCTT 
TCTTATAGCG 
ATCTCAGATA 
COGCATTAGr 
GACTCAAAAC 
AAACAGGCAG 
CTACTTTCAG 
ATGTGCTGAG 
TTGGGGAGAA 
CAGAATTCTA 
□ACCCAGCGA 
AAAAATGTAC 
ACAAATGTCC 
TTGGACCACA 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
19B0 
2040 
210O 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2530 
2640 
2700 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
90O 
960 
102O 
1060 
1140 
2200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
19S0 
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AAAAGGAATG ATGTACTGAT CCATQCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 2040 
TTAAAGAAQC CAGTCACAAA AGGACT TACT GTATGATTCC ATTTACCTGA AATGTTTGGA 2X00 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 2160 
AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AG ATCA TGAT 2220 
5 GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGQ 22 BO 

TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 2340 
TGTCAGGGAG ATATTGG AT T AAATGGCCTT GGACAACAAC CCCTCTOCCT GGCCACAGAC 2400 
ATTCTTCAGA TTACAAGATA TTCCAQGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 
AAACAGAAGG ACCATTGAGA AATGTTGTOA TCCTGACAGG TCAAGCAATT TATTTTTCGG 2520 
10 CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCGATTTAAA ATGGCCCGTC TGTTTCAATT 25B0 
GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTG CTGATAA 2640 
AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACQTGATT 2700 
GGGGAGGOCT CAGAATCACA GTAGGAGGCA AAAGTTATTC TTACATGGTG GCTGCAAGAG 2760 
AAGATGAGGA AQAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTOAGGCTT 2820 
15 ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATOATTCAAT TACCTCTACC 2 880 

TGGGTCCCTC CAATAACATG TGGAAATCTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 2940 
GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGGCTATT3C 3000 
AACAGAAAAC CATCTGGOAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 3060 
CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT G CT AT AAQAT 3120 
20 ACTGAAGTAC ACTTCTTCAC TAT CTCTTTG GACTTAGAAT TAGCACTACA TTOCTTGTTA 3180 
TACAGAAAAA TTACTAAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAB 3240 
GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 
CAAGAAAAGA ATGQTGGGGT TTTTGTTTOT TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 3360 
AGAATACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCT 3420 
25 CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 34B0 

TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 354 0 
TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTQA AATCTQAAGT 3600 
TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA ASAGGAGAGG CATAAATTTA 3660 
GGATCTCACC COTCATTCCA CAGACACACA CAOCCTCTCT GCCCACCTCT GCTTCCTCTA 3720 
30 GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 3780 

AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 3840 
TATTCTGCGA GAOCAAAATT AAAAT AC CTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 3900 
CTTAGTTCCA AGTAAACCAA GG CACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3960 
GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 4020 
35 AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAQTGT AACCAGCAAT 4080 

GAAAGAAAGA TAAAGAGGGT CQATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 4140 
TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 4200 
CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 
TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 4320 
40 TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 4380 
CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 4440 
CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAQA 4500 
CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 4560 
TTTTTCTCTG GTTCCGTCGG CTAGCAJCGCA GATGGTAATA GATGTGGTGG TCIGATGGST 4620 
45 AGCAjCAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATGTT 4680 
TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 4740 
CACAGAGAAC ACCAjCAGAAA AAGCTGCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 4800 
CAGCTTTAGA TCCCTGAACA GGTCA.TAGTT TAAACCTQGA ACTTCACAAA AACTAAGAAA 4860 
AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 4920 
50 TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 4980 
CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTOCAQCTT TGCTCTTCAC 5040 
AAATGATGAA TAAOAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 51 OO 
CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT S160 
GCCTTBCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 5220 
55 TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 5280 
ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 5340 
CCTTAATAQT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAQGCTTAO 5400 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 5460 
TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 5520 
60 TTAQCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 5560 
AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAG GGCTT 564 Q 
CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAO TOGTATCCTT GATOACTGTT 5700 
AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 
AACOCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTQCATC TTTAGTTCAG 5820 
65 AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 5880 

GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 5940 
□TCCTCAATG AGACTACCAQ CATTTAGGGA CTGATCTAAC AGACTTAGCA. TGGGTTTAGT 6000 
ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 6060 
AGQAAATGTT CATCACCAQT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 
70 ATATGAGCTG AAGATCGCCA ACAAQCTCTT CGGAGAAAAG AGSTATCAAT TTTTAGAGGT 61B0 
AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 6240 
AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 6300 
GTGCATTAGC TCCATTTCCA CAACTCTCCX2 CCACTGGAGT GTCCCAGACC CCAACGATAC 6360 
ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGOTTG TGTAATAGAG 6420 
75 TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 64 B0 

CGATACATCT TGGTGGGAAA TGTATQAjCTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 6540 
GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAOA 6600 
CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 
GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 6720 
80 CTTTTGCCAA GATOATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 6780 
TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAOGftGA TACTGCATCT 6840 
ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 
TACATAAATT ACTCCTAATT CCTACTTCT T CCTTCATATC TCAAAGGAAT ATTTAGATGC 6960 
GATCAAOAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATQ CTCCAGAAGA 7020 
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AAGTCSAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 70 80 
TTATAGAAAC ACCTTTGAGA AAGCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGQGG 71 4 D 
GAACAGGTGT G3GGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGOGCA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TOAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 72 60 
5 GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCft. TGACAATAGC 7320 

TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7380 
AQACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCOTC TCTOGTCTCC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTQTCTC TGAGCTGGAG 7500 
CTGOCTGGRT CACATCTGTA GCCAATOTGT TCTGCAGGGA TTATCACAGC TCtCTTCCCC 7560 
10 AICAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
TCTCRTTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7680 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 
CTAAARTGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 
15 GTTTATOM3T ATTCCAAATC AGATQATTGG AGACATTCAT ACACAGAGAA COTGAACTCC 7 920 

TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACQACT GGRGCCTTGA 7980 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATXCAAAG ATCAAAGTCA 8040 
GTGAGXCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
20 GAAAAAATTA AAAACC TATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT B220 

GTGAACGCAA TCTATTTCAA AGGGCAQTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TC T AT ATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATQG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 84 00 
TTGTTCATGT C TOT T ATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
25 GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 8520 
TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCACGTGTTA TTTGTTGGAC 8SB0 
ACATTOAUTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAOG CCAAGGTCCT aOAAATACCA TACAAAQGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCRG AAGGTAAGAA B760 
30 CTTGCATCTA CAACTCTTOC TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 

CAAGGTAAAA GCTTATOACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACFCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8940 
GTTTTTGTTT GTTTGTTTOT TTT TAT TATT ATTATTATAA TACT TTAAGC TTTAGGGTAC 9000 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 
35 CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTOCOCC 9120 
CACCCCACAA CAGTCCTCAG AGTGTQATGT TACCTTCCTO TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTACATGCAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 9300 
AGAAATATAT ACCAATGTQA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 9360 
40 ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATXGCAATG TGTACTAAAT AATGTATACA 9420 
GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTAGA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACAIATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCRTA 9660 
45 TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 

OCAAAATAAC GAATCTOCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 9780 
TABTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTOAA GAGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 
TOGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACAOG TTGAGAACCA TGGGAATGGT 9960 
50 GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TQGAGCCACG GTCTCTCRGT 10020 
ATCTAAAGTC CTACACAAQQ CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 
TGCCACCGCr GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TOCCCATAGA TGCAATTAGT CTGTGACTCC ATTTAGAAAA TGTTCACCTA 10260 
55 GAGGTGTTCT GGTAAACIGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 103 00 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTOC 10440 
CTTGAGGAAA AATGTOCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC TAAATTTGAA 10500 
. ATATAATTCT GTTTCTGACC TGTTTTAAAT GAAGCAAAGC AAATCATACT TTCTCTTCAA 10560 

60 ATTTAGCAAC CTAGAAACAC ACAITTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTTGTGATTC TATAARACAC ATCATCAATA AAATAATGAC ATAAAATGAT 10680 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 10800 
GCCTGGAGTA GAAGGCATAT OCCAGAACAQ GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
65 CTATTTTATG CTAAAGGGAT AAGAAACTGA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 
GGGGCATAAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCTAOG CTGATATCAA 10980 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 1104O 
GGGAGIGGAA ATOGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAATAGTTAC 11100 
AGAAAGACAr CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTTGTTTCT 11160 
70 GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT GTGA1ATTCA ATATAAATAC TTTAGAC3TCA TTGGGATAAA 11340 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 
75 CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 * 
TCCC 

Getj ID UOs 177 Protein sequence 
Protein Accession *h BAB21525.1 

oO 

1 11 21 31 41 51 

I 1 I I I I 

MKTS&SEAHTK PHFDLFQQFR KSKEMNIFYS PISlTSALGM VLLGAKDHTA QQI8KVI.HFD 60 
QVTEJTTEKA ATYHVDRSGN VHHQPQKLIjT EPNKSTDAYB LKIANKIiFGK KTYQFIiQEYL. 120 
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DAIKKFYQTS VESTDRMJAP EESRKKnJStJ VESQTOBKIK NIiFPDGTIGN DTTT.VIiVHAI 
YFKGQWENKP KKEHTKEEKF HPNKNTYXSV QKMKQYKSFN FALLEDVQAK VXjEIPYKGKD 
LSMIVIiPHS IDGLQKLEEK LTAEKLMEWT SLOttMSBTCV DLHLPHFKHE ESYDLTETLR 
TMOMVNIFNG DADLSQMTWS HGLSVSKVLH KAFVEVTEBG VEAAAATAW WELSSPSHT 
EEPCCNHPFL FFIRQKKTNS IIiFXGRFSSP 

Seq ID MO: 178 DWA sequence 

Mudeic Acid Accession #; NH_O01910.1 

Coding sequence : SO .. 12 40 



180 
a*o 
300 
360 



1 
I 

BOAGAQAAGA 
CCTTCTTTTG 
GCCCCTCAGG 
GTTCTGGAAA 
GABTGCCAAG 
TGGCTCCCCA 
CCCCTCTGTB 
GTCCAGCACA 
GTCCGGGATC 
GTTTGGAGAA 
TCTG33CCTG 
GATGGCTCAG 
AGGTGGTGCG 
OCTGAATTGG 
GG'IGGGAGGC 
TTCCCTCATC 
CCCCGTGGAT 
CTTCACCATT 
CGTGGATGGA 



TGGGAATAAC 
CCTQCCTGTC 
GTTATTTTCC 
ACATQAGAAT 
ACTCCCACCA 
TGATTATGAA 
GCAGAGATCA 
CACACGGCCA 
GTACCTGGAT 
TAACATCCTT 
CTGTTTTTGT 
TTTGAAATGT 
TXGAAATCCC 
AGTTAGATCT 
CTTGTTGCAT 
CTQCTGGGTG 



11 

I 

AAGGAGGGGG 
CTGCTGGTQC 
AGGCATCCGT 
TCCCATAATT 
GAACCCCTCA 
CCACAGAACT 
TACTGCACTA 
TACAGCCACC 
ATTGGAGCCG 
AGTGTCACAG 
GGATACCCCT 
AACCTGGTGO 
GGGAGCGAGC 
GTOCCAGTCA 
ACTGTTATGT 



GGAGAATATQ 
AACGGAGTCC 
ATGCAGTTCT 
TGGATCCTGG 
CGTGTGGOAC 
TGACAGACCT 
AGAQAATBTA 
ACACACACAC 
CCGTCATGAT 
AATCAAAAAT 
TGGTATAATA 
GGCCTGTTTA 
CATTCTGAAG 
AAATATACAA 
AGTTGQATTQ 

ctgtaagtct 
gaggtgtcat 
ggaggtctga 

CCTGTCAGGA 
GGTTTGGAGT 



21 
I 

CAAGGGAGAA 
TCCTQGAGCT 
CCCTCAAGAA 
TGGACATGAT 
TCAACTACTT 
TCAC TGI CAT 
GCCCAGCCTG 
CAGGTCAATC 
ACCAAGTCTC 
AGCCAGGCCA 

ACTTSCCOAT 
TGATTTTTGG 
CCAAGCAAGC 
TCTGCTCCGA 
CCGACAAGAT 
CTBTGQAGTG 
C5CTATACCCT 
GCAGCAGTGG 
GGGATGTCTT 
TGGCCCCAGC 
TGAATATGTT 
GCTGTTTCCA 
ACACACATAT 
GGAGGAATTA 
TTTCACATTT 
AATCCCTTTG 
TCTACACTGC 
CAAATTCCGA 
TCGGAATTCA 
TTTGTAT TAG 
CTTTCCATCT 
TTGACATGGT 
TTTTGTGGCA 
GGCAGATAAT 
TCTTGGCTTT 



31 
I 

GCTGCTGGTC 
GGGAGAGGGC 
GAAGCTGCGG 
CCAGTTCACC 
GGATATGGAA 
CTTCGACACT 
CAAGACGCAC 
TTTCTCCATT 
TGTGGAAGGA 
GACCTTTGTG 
GGGAGGAGTG 

AGGCTACGAC 
TTACTGGCAG 
GGGCTGCCAG 
TAAGCAGCTG 
TGCCAACCTT 
CAGCCCAACT 



CATTCGACAG 
AGTCCCCTAA 
AGGCTGGGGC 
GGGTTGCAAC 
ACACACACAC 
GSTTATACAT 
GATTATGAAA 
CAACTCCACT 
TGCCCACTCC 
GCATTACATC 
AGCATCTCCC 
GATTCAAGCA 
ACAGAGTTTA 
TCTCTGAACT 
AAAATACTTC 

AATCATTCAT 



41 

i 

GGACTCACAA 
CAAGGATCCC 
GCRCOGAGCC 
GAGTCCTGCT 
TACTTCGGCA 
GGCTCCTCCA 
AQCAGGTTCC 
CAGTATGGAA 
CTAACCGTGG 
GATGCAGAGT 
ACTCCAGTAT 
TACATGAGCA 
CACTCCCATT 
ATTC3CACTGG 
GCCATTGTGG 
CAAAACGQOV 
AAOGTCATGC 
GCCTACACCC 
CTTGACATCC 
TTTTACTCAG 
GGAGGGGCCT 
ATT CTTTAtZA 
TTGAATTAAG 

Hf*& f*'T l, l , f^ll f ft 

TCATATTTTG 
ATCTCCAAAC 
CAGOCCTGAC 
TCTCTCOIGC 
ATTTTGTCCA 
ATTGTCCCAC 
AGGCGCATAT 
GCACATTTGA 
TATCTTTCCT 
CTAGGTGGTG 
CTCTATTGGT 
TACAAAQTTC 



51 
I 

TGAAAACGCT 
TTCACAGGGT 
AGCTCTCTGA 
CAATGGACCA 
CTATCTCCAT 
ACCTCTGGGT 
AQCCTTCCCA 
CCGGGAGCTT 
TTGGCCAGGA 
TTGATGGAAT 
TTQACAACAT 
GTAACCCAGA 
TCTCTGGGAG 
ATAACATCCA 
ACACAGGGAC 
TTGGGGCAGC 
CGGATGTCAC 
TACTGGACTT 
ACCCTCCAGC 
TCTTTGACCG 
TGTGTCTGTG 
CCTACAAAAA 
ACCAAACAGA 
GATACACACC 
TATTGATTTT 
ATATGCACAA 
AACCCATCCA 
TCCBCATGCT 
TAAATATTTC 
AAATGTTTGG 
AT TG CATTTA 
ACGTTGCTGG 
ATAAAATGGT 
CTGGGTACTT 
AATGTTAAGA 
AGCATTTT 



51 



Seq id NOs 179 Protein sequence 
Protein Accession ft* np_oai90l.l 

1 11 21 31 41 

I 1 ] I I [ 

MKTLUbLLLV LLELGEAQGS LHRYPLRRHP SLKKKLRARS QL>SEFWKSHN LDMIQPTESC 
8MDQSAKEPL INTTLDKEYPG TISIGSPPQN FTV1FDTG6S HLWVPSVYCT QPACKTHSHF 
QPSQSSTYSQ PGQeFBIQYG TBSLSBIIQA DQVSVBGLTV VGQQFGESVT EPGQTFVDAE 
FDG1LGLGYP 3 1 ■ AVGGVTPV FDNTOIAQTCLV DLPMFSVYMS SNPEGGAGSE UFGGYDHSH 
FSGSLNWVPV TKQAYKQIAIj DNIQVGGTVM FCSSGOQAIV DTGTSI*ITGP SDKJKQLQNA 
TGAAPVDGEY AVECAKIiKVM PDVTPTINGV PYTJjSPTAYT LLDFVEGMQF CSSGPQSliDI 
HPPAGPLWH, GDVFXRQFYS VPDRSNS&VG LAPAVP 

Seq ID HO: 180 TOJA g e queue e 

wucleic Acid Accession #: «H_Dl&05a.i 

Coding aequence* 319. .1575 



1 
I 

TACQCGCTGC 
GACGGCCGGG 
TACACCBACA 
GTCAACGTGG 
AGAAAGGGCT 
CCTCATGCCC 
CTCAGAGATG 
GTGOacCCCA 
AACTTCCTTT 
GTGGACOACC 
AAAGTGGACA 
ACCCATGGGA 



AACATTOOCT 
GGAGACCCCC 
ACAGGGGGTG 



11 
I 

GGGACCGGCA 
AGGAGATCTA 
AGTTGTTCAA 
CCCGTGGTGT 
CTGGACGCTA 
TCATTGAAAT 
TGGCTGCTGA 
TCCTCAGCAG 
TCCACAACCG 
CCCAOCAGCA 
TCGTGTATGG 
AGGTOGGCTT 
TCATCACCGC 
ACCGCAGCTC 
TCATCGAGGA 
TGGTGACCQA 



21 
I 

GGGGAACGCC 
CTTCCTCAAC 
GTTCCGCAAT 
GGCCAGCCTC 
CTCTATCTAC 
GGACCGTGAG 
GGCTGGGGTC 
CAGTGCCTO? 
GGGCGATGGC 
TGGGCGAGGT 
CAACTGGAAT 
CCGGGACATC 
CGACTTTGAC 
CrCAOCCAAC 
GCTCAATOOC 



31 
i 

ATCGGGGTCA 
ACCAATAATG 
AACCGGTGGG 
TXTGCCGGAC 
ATTGCCAATT 
GCCAGTGACC 
AGCAAATATA 
GATATCTTCT 
ACCTTTGTGG 
GTCGCCCXGQ 
GGGCCCCACC 
GOCTCACCC3V 
AATGACCAGG 
CGCCTCTTCC 
GGC3QAOGCCT 
GACGGGATGC 



41 
I 

CAGCCTGOQA 
CCTTCTOSGG 



ACQCCTACGG 
TCTCCQGGGG 
CAGGGGGCXX3 
GCGACAATGA 
ACGCTGCGOC 
CTGACTTCAA 
GCCTCTATCT 
AQTTCTCCAT 
AGCTGGAGAT 
GCGTCATCCG 
TGGAGCC1GA 
TGGACCTCAT 



€0 
120 
180 
240' 
300 
360 
42 0 
480 
540 
€00 
660 
V20 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



60 
120 
180 
240 
300 
360 



SI 
1 

CATCGACGGG 
GGTGGCCACG 
GAGCGATGAG 
CTGTGTGGAC 
TAATGTGGGC 
CATTCTGGCG 
AGGCGTCAGC 
GAATGGGCCT 
CAOTQCTGGT 
CCGTGATGGC 

gcaaatgagc 
CttCttcaag 

TAGAGAGCAC 
GGGOCGGGGC 

cttgtcccat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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GGAGAJSTCCA 
TGGCTGOGAG 
CTCTACACCA 
TGTGAGATGG 
GTGACGTGGC 
GTGCTGGAGA 
ACACCAATGA 
ACAC CTATGG 
ACGAGGATGG 
CCCCCACCGC 
CACCGGTCCT 
CCAGCTGCTG 
AAGTGGGCTT 
CCCAAGCCCA 

ATTCCAGTGG 
CTGCACAGGA 
AAAGCTATGT 
AAATGGGGAT 
GACACTTGGC 
GGGCTTTGTC 



TGGCTCAGCC 
TGGTGCCACG 
AGAAGAGTGG 
AGCCCGTGGC 
CAGATQGCAA 
TCCTCTACCC 
ATOCATCCAG 
AAGCTACAGG 
CACAGCCTGC 
TGCTGCTGCC 
CGTAGATGGA 
AGCAGGGGTG 
GTGCTGCTGC 
TCCATGCACA 
CACATAGCTQ 
GTCTAATGAC 
AGTATGAGGA 
GACCTTACAC 
TAAGAATAGA 
ACAAAACCTG 
AACACGTG 



GCTGTCCGTC 
CACCCGGGTT 
GGCCCACCTG 
ACACTTTGGC 
GATGGTCAGC 
CCGGGATGAG 
TTOCCATTCG 
TGCCGGACCA 
GTGGGGACTC 
ACTGCCGCTG 
GATCTCAATC 
GGACATGAAC 
CTAGACAGTA 
TTACTTAGCT 
TGATCACAGC 
CATATCTTAG 
CTTTAGTGXC 
CAGTCACTTA 
ATCTTGGGGT 
GCACATAGTA 



TTCCGGGGCA 
GGGGCCTTTG 
AGGATCATCG 
CTGGGGAAGG 
CGXSAACGTGG 
GACACACTTC 
TGTGCCCTCG 
ACAAGAAGTG 
TOGGCCAGTC 
CTGCTGCCGC 
TGGGGTCGGT 
CAGCGGATGG 
GGGATGTAAA 
AACAATTAGG 
AGACAGGGTC 
OACACAGATG 
CTGAGTTCAA 
ACTTGTTAGC 
TAGTGTGGAG 
AAGGCTCAAT 



ATCAQGGCTT 
CCAGGGGAGC 
ACGGGGGCTC 
ATGAAGCCAG 
CCAGCGGGGA 
AGGACCCAGC 
AG ACAAGCC C 
CAGTCGGGGC 
ACCGGGCCCC 
TGCTGQAGCT 
GGTTAAGGAG 
AGTCCAGCAG 
GGCCTGGGAG 
GAGACTCGTA 
□CTGGCCTGA 
TGCCCAGGGA 
ATCCTGATXC 
CAT CCATTAT 
ATTAGATTAA 
AAAAACAAGT 



CAACAACAAC 
TAAGGTCGTG 
AGGCTACCTG 
CAGTGTGGAG 
GATGAACTCA 
CCCACTGGAG 
GTATGTGTCA 
TACGAGCCCA 
CGCCCCACCA 
GCCACTGCTG 
AGCTGCGAGC 
GGGAGTGGGA 
CTAGACCCTC 
AGGCCAGGCC 
TGGCGCTTAC 
GGTGGTGTCA 
AGGAACTCAC 
CGCATCTGCA 
ATGTATGTAA 
GCCTCTCACT 



L020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
19B0 
2040 
2100 
2160 



Sag ID NOi 181 Protein sequence 
Protein Accession #: NP_060528.1 



1 11 21 

1 I 1 

MDPEASDLSR GILALRDVAA EAGVSKYTGG 
RGDGTFVDAA ASAGVDDPKQ HGRGVAIiADF 
FRDIASPKFS HPSPVRTVIT ADFDNDQELE 
KLNPGBALEP EGRGTGGWT 0FDGD3MLDL 
RTKVGAFARG AKVVI/YTKKS OAHLRIIDGG 
KMV9HNVASO EMNSVIiEILrY PRDEDTLQDP 
GAGPTRSAVG ATSPTRMAQP AWGLSABHRA 



31 
I 

RGVSVGPIfcS 
WRDGKVDrVY 
IFENWIAYRS 
ILSUGESMAQ 
SGYLCEMKPV 



41 

I 

SSASDIFCDM 
GNWNGPHRIiY 
SSANRLFHVI 
PLSVFRGNQG 
AKFGLGKDEA 



51 



ENGPNFLFBW 
LQMSTHGKVR 
KREHGDPIjXE 
FNNNWLRWP 
SSVEVTWPDG 
FYVSTPMEAT 
PAPPPPPlOJi PIjPI iT it iPUjE LPLLHR3S 



Saq ID NO: 182 DMA sequence 
Nucleic Acid Accession ft: AJ279016 
Coding sequence: 1 . .1962 

1 11 21 31 41 51 

I 1 1 I 1 I 

ATGTCCAGGA TGTTACCGTT CCTGCTGCTG CTCTGGTTTC TGCCCATCAC TGAGGGGTCC 

cagcgogctg aacccatgtt cactgcagtc accaactcag ttctgcctcc ioactatgac 

AGTAATCCCA CCCAGCTCAA CTATGGTGTG GCAGTTACTG ATGTGGACCA TGATGGGGAC 

tttgagatcg tcgtggcggg gtacaatgga cccaacctgg ttctgaagta tgaccgggcc 
cagaagcggc tggtgaacat cgcggtcgat gagogcagct caccctacta cgcgctgcgg 

GACCGGCAGG GGAAOGCCAT CGGGGTCACA gcctgcgaca TCGACGGGGA CGGCOGGGAG 
GAGATCTACT TCCTCAACAC CAATAATGCC TTCTCGGGGG TGGCCACGTA CACCGACAAG 
TTGTTCAAGT TCCGCAATAA CCGGTGGGAA GACATCCTGA GCGATGAGGT CAACGTGGCC 
CGTGGTGTGG CCAGCXri'C.TT TGCCGGACGC TCTGTGGOCT GTGTGGACAG AAAGGGCTCT 
GGAC3SCTACT CTATCTACAT TGCCAATTAC BCGTACGGTA ATGTGGGCCC TQATGCCCXC 
ATTGAAATGG ACCCTGAGGC CAfftGACCTC TCCCGGGGCA TTCTGGCGCT CAGAGATGTG 
GCTGCXGAGG CTGGGGTCAG CAAATATAC3V GGGGGCGGAG GCGTCAGCGT GGGCCCCATC 
CTCAGCAGCA GTQCCTCGGA TATCTTCTGC GACAATGA0A ATGGGCCTAA CTTCCTTTTC 
CACAACCGGG GCGATGGCAC CTTTGTGGAC GCTGCGGCCA GTGCTGGTGT GGACGAGCCC 
CACCAGCATG GGCGAGGTGT CGCCCTGGCT GACTTCAAGG GTGATGGCAA AGTGGACATC 
GTCTATGGCA AGTGGAATGG CCCCCACCGC CTCTATCTGC AAATGAGCAC CCATGGGAAG 
GTOOGCTTCC GGOACATCGC CTCACCCAAG TTCTOCATGC OCl'CCCCTGT CCGCAOGGTC 
ATCACCGCCG ACTTTGACAA TGACCAGGAG CTGGAGATCT TCTTCAACAA CATSGCCTAC 

CQCAGCTCCT cagccaaccg cctcttccgc gtcatccgta gagagcacgg agaccccctc 

AT CG AG GAG C TCAATCCCGG CGACGCCTTG GAGCCTGAGG GCCGGGGCAC AGGGGGTGTG 

GTGACCGACT tcgacggaga ogggatgctg gacctcatct tgtcccatgg AGAGTCCATG 

GCTCAGCCGC TGTCCGTCTT CGGGGGCAAT CAGGGCTTCA ACAACAACTG GCTGCGAGTG 
GlGCCACGCA CCCGGTTTGG GGGCrrTGCC AGGGGAGCTA AGGTCGTOCT CTACACCAAG 
AAGAGTGGGG CCCACCTGAG GATCATCGAC GGGGQCTCAG GCTACCTGTG TGAGATGGAG 
CCCGTGGCAC ACTTTGGCCT GGGGAAGGAT GAAGCCAGCA GTGTOGAGGT GACGTGGCCA 
GATGGCAAI3A TGGTGAGCCG GAACGTGGCC AGCGGGGAGA tgaactcagt GCTGGAGATC 
CTCTACCCCC GGGATGAGGA CACACTTCAG GACCCAGCCC CACTGGAGTG TGGCCAAGGA 
TTCTCCCAGC AGGAAAATGG CCAT TGGATG GACACCAATG AATGCATCCA GTTCCCATTC 
GTGTGCCCTC GAGACAAGCC CGTATGTGTC AACACCTATG GAAGCTACAG GTGCCGGACC 
AACAAGAAGT GCAGTCGGGG CTACGAGCCC AACGAGGATG GCACAGOCTG CGTGGGGACT 
CTCGGCCAGT CACCGGGOCC OCGCCCCACC ACCCCCACCG CTGCTGCTGC CACTGCCGCT 
GCTGCTBCCG CTGCTGGAGC TGCCACTGCT GCAOOGGTCC TDGTAGATGG AGATCTCAAT 
CTGGGGTCGG TGGTTAAGGA GAGCTGCGAG CCCAGCTGCT GAGCAGGGGT GGGACATGAA 
CGAGOGGATG G AGT CCAGCA GGQGAGTGGG AAAGTGGGCT TBTGCTGCTG CCTAGACAGT 
AGGGATGTAA AGGCCTGGGA GCXAGACCCT CCCCAAGCCC ATCCATGCAC ATTACTTAGC 
TAACAATTAG GGAGACTCGT AAGGCCAGGC CCTGTGCTGG GCACATAGCT GTGATCACAG 
CAGACAGGGT CGCTGCCCTG ATGGCGCTTA CATTCCAGTG GGTCTAATOA CCATATCTTA 
GGACACAGAT GTGCCCAGGG AGGTGGTGTC ACTGCACAGG AAGTATGAGG ACTTTAGTGT 
CCTGAGTTCA AATCCTGATT CAGGAACTCA CAAAGCTATG TGACCTTACA CCAGTCACTT 
AACTTGTTAG CCATCCATTA TCGCATCTGC AAAATGGGGA TTAAGAATAG AATCTTGGGG 
TTAGTGTGGA GATTAGATTA AATGTATGTA AGACACTTGG CACAAAACCT GGCACATAGT 
AAAGGCTCAA TAAAAACAAG TGCCTCTCAC TGGGCTTTGT CAACACG 



60 
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Seq ID NO; 183 Protein sequence 
Prohein Accession ft: CAC0B4S1 
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i 
I 

MSHMIiPFLLL 
FEIWAGYNG 
BIYFLNTNNA 
GRY5IYIANY 
LSS6ASDIFC 
VYGNWNGPHR 
RSSSANRLFR 
AQPEiSVFRGN 
PVAHFGLGKD 



I/3QSPGPRPT 



11 
1 

LWPLPITEGS 
PNL>VLKYDRA 
FSGVATYTDK 
AYGNVGPOAL 
DNENGPHPIjF 
liYLQMSTHOK 
VIRRKHGDPL 
Q3F5JNNWIi&V 
EASSVEVTWP 
DTNECIQFPF 
TPTAAAATAA 



21 
I 

QRAEPMFTAV 
C4CRLVNIAVD 
LFKFKNNRWE 
IEMDPSASDL 
HNRGDGTFVD 
VRFHDIASPK 
lEEDNPGDAIi 
VPRTRFGAFA 
DGKMVSRNVA 



31 
I 

TNSVLPPDYD 
ERSSPYYALR 



SRGILALRDV 
AAASAGVEDP 
FSMPSPVRTV 



RGAKWLYTK 



AAAAAGAAXA 



NTYGSYRCRT 
APVLVDQDIiN 



41 
I 

SNPTQLNYGV 
DRQGNAIGVT 
RGVASLFAGR 
AAEAQVSKYT 
HQHGRGVALA 
ITADFDNDQE 
VTDFDGDGML 
KSGAHLRIID 
ZjYFRDEDTLQ 
NKKC8RGYEP 
IjGSWKESCE 



SI 
I 

AVTDVDHDGD 
ACDIDGDGRE 
SVACVDRKGS 
GGRGV9VGPI 
DFNRDGKVDI 
I,EIFFKHIAY 
DLIIfSHGBSM 
GGSGYLCEME 
DPAPLECGQG 
JJEDGTACVGT 
PSC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
€00 



Seq ID NO« 184 DNA sequence 
Nucleic Acid Accession # 
Coding sequence: 1 . .4794 

1 11 21 31 41 SI 

1 1 1 ! I 1 

ATGGCGTGTC CGGGAGGACT CCCAGCCCGT TGCTCTGGTT GGATGGGACT 
AGCGGCTCCT CCCCAGCATC CCCTCCCCAT TCCTCCTCCA GGTACAATGG ACCCAACCTG 
GTTCTGAAGT ATGACCQGGC CCAGAAGCGG CrGGTGAACA TCGCGGTCGA TGAGCGCAGC 
TCACCCTACT ACGCGCTGOG GGACCGGCAG GGGAACGCCA TCGGGGTCAC AGCCTGCGAC 
ATCGACGGGG ACGGCqHGGA GGAGATCTAC TTCCTCAACA CCAATAATGC CTTCTOGGGC 
CACAGCAGCT CAGCGCAGGT CCCTTCTGGG CTCCACAGAA ACAQGCGTGT GCTGAAGCCT 
CCACCTACAA CCCCTaCAGG CCTCCTGGGT CTGOCTCCAC TCAGCGGAAG GGACTTTTCC 
TCCTOCCTGG GTCAGGCTTC TCCGGACAGC A3GCAGGGAG AGAGGGTQCC GGTTCCCTGC 
TGTCGGGGTG GACTGAGACC TACCCATGAA CCAGAACCAT TTCTTCTGAG ACCCAAATCA 
GGGGTGGCCA CGTACACCGA CAAGTTGTTC AAGTTCCGCA ATAACCGGTG GGAAGACATC 
CTGAGOGATG AGGTCAACGT OGCCOGTGGT GTGGCCAGCC TCTTTGCCGG ACGCTCTGTG 
GCCTGTGTGG ACAGAAAGGG CTCTGGAOGC TACTCTATCT ACAXTGCCAA TTACGCCTAC 
GGTAATGTGG GCCCTGATGC CCTCATTGAA ATGGACCCTG AGGCCRGTGA CCTCTCCCGG 
GGCATTCTGG CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAGAAGGC 
TTCTCCCACA CTGCCTCTCC AAGCATTGGT GAGATATCTG GCAGAACOGA GGAGCGGGAA 
GGAGGAGACC CAGAGGAGGC AGATGAGGAG CACAGTGGGG ATGGAAGCAC CAGOCAACTG 
TGCCGGCTGG GCTGGAAGGA CGGGCAGTTC AAGGAAGAAG CAGCAGCTTT GGTGGAGGAA 
CAGAGGGAGG CTGGGGCAGC TGGCGTGCCC AGAGGACGTQ TTCGAACAGC XCTGCAGACT 
TCCAAAAGCC ATTTGGCTGA CAAGAACCTA TTTGGCCCAC CATGTTACTA TTCTOTCTGC 
GCGCCTTCTC CAGCCCACCC TTTCCCTGCC CGCCAAGCCC CCCAACACTA CCCTGTAGCC 
CCGCTTGTCA CT CAG CT ART GACACATGGA CGTCTGGGTG GAAAACTAGC CCGGAGTGTC 
CCCCACCCCC GAGCCCCAGG AATGGACCCC AAATGTAAGG GCCGCCATGC TOAGCCCGGC 
CTGATGGCTG AGGCTTTGGG CGOGTGGCCA GCGCTCAGCA CCACTGTGGT GCCAGGGGGC 
CTGAGAAGCT GGGAGGAAAG CAGGCAGAAG GGGC&GGCCA TGTCCA.GATT3 TQCACTCAGG 
GAGCTGGGAG GTCCCTGGAG CCAAGCCACA CaGGACCtGC CTGCTAGAGA GCTGTATGAC 
CTGGGAGAAC CTCCCATTTT ACAAAGAACA GACGGAGATC CAGGGAGQAG AAGGGACTCG 
CCCAAGGTCA CACAGGAGTG CCATCTAGTG GCCACCATGC CAGCTCTCGG GGGACTCGAG 
GGCCCCGGGA GGGTGGCCAA GCGAGAGATT GGGAGAGAGA CTGGGGCAGT AGGAAGACCA 
CTCTCCCATC CCCTGGTCCC CAACTTCCCC AGCTGCTTGA GGOCTCTTGA AGCCGOGACA 
GTGCCGGGAG CTGOCCTGCC TGGGAATCCT GGGAACTGGG TTCTGGACAT GGCCASGGCC 
CTGGCGTGGA ACCAGATGGA AAAAGAGGAG GGGAAGATTC ATGGAGACCA TGAGCCCAGA 
TTTAGGCTCA GGAAAGCACG GGAAGCAGAA TIXXCCCCAG GCTCCTCIGA GGAGCCTCTG 
CTGCAGTTCC CCTCAGGCCT CAGAGGCAGC CCTGTCCT CC AGGTGGGCCT GGGGCTTGCT 
TCTGCCACTC ACTGTGGGTC GATGTCTTTT CTAGGGGGCC GAGGDGTCAG CGTGGGCCCC 
AlCCTCAGCA GCAGTGCCTC GGATATCTTC TGCGACAATG AGAATGGGGC TAACTTCCTT 
TTCCACAACC GGGGCGATGG CACCTTXJ3TO GACGCTGCGG CCAGTGCTGA ACGTCGTTTA 
GOCTTCATCG TTCACCTCAA ATATCACCTC TGCAGAGATT TTCXTCACTC CCTGTGCCAC 
CTAGCAGAAA CTGOTCCTTC CTCCTCCTGC TGCCCGTGGC ATGCACGTCT TCTTCAGGCT 
CCACATTGCC ATCATGGTTT GTCTATGAGC TTTACAAGGA CCGGGTCACG GITCTATTCA 
MCTTGACGC AAGGCTTGGC CTCCAGTGCC CACCGGAGGA CACTCAGCCT OCAGGGTTCT 
CAGGGGGCCC CAOCCTGCCT TCTGGCAAGA GCTCCCTGTG TCCTGGGGTC TCTGATCCOC 
ACXGCCEAIT ACATTGTCCT GTGGTCTGCC ATOCCAGAGA GCCTGATGAC CCACAGCTAT 
TTGTCCTCTG AAAGAGTCAA CGTGGGTGTG GAGGACCCCC ACCAGCATGG GGGAGGTGTC 
GCGCIGGC1G ACXTCAAOGG TGATGGCAAA GTGGACATCG TCTATGGCAA CTQGAATGGC 
CCCCACOGCC TCTATCTGCA AATGAGCACC CATGGGAAGG TCCGCTTCGG GGACATCGCC 

xcacccaagr tctccatgcc ctcccctgtc cgcacggtca tcaccgccga ctttgac3uvt 
gacgaggagc tggagatctt cttcaacaac attgcctacc qcaqctcctc agccaaccgc 
ctcttccgat gctccatoct ggctcgtggc tcttcatcct tgacagctgg tgggaggaac 
ggtcagggag aaggtttaag aatcagaagg ggagggttoc cagggccagg gggtcaggcc 
aaggtcaaca caqqtcccct gatgaagaaa cagaaaggaa ggaaggacga ggactgggga. 

AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTIGCAGGG 
AAAGGGAAGG GAAATGTGGC CCAAAGTGTG CCCAGAACCG AAGCGCCACA AGATACAAAG 
CCACACTRCC ACAAAAAGGG GCTACAGGGT CGAATCACTA CGAGGAAAAG GGGCTACGGG 
GTCC31ATCAC TACCAGGAAA AGOOGCXACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 
CGGGGTCCAA TCACT AC CAG GAAAAGGGOC TACGGGGTCC AATCACTACC AGGAAAAGGG 
GCTACQGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 
AGGGGCTAGQ GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGXCCAA TCACTACCAC 
AGAAAGGGGC TACGGGCXCC AATCACTACC AGGAAAAGGG GCTACGGGGT CCAATCACTA 
CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 
ACTACCAGGA AAAGGGGCTA CGGGGTCCAA TCACTACCAG GAAAAGQG3C TACGGGGTCC 
AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 
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GTCCAATCAC 
CGGGGTCCAA 
GCTATGGGGT 
AGGGG CT ATG 
AGAGAGCACQ 
GGCCGGGGCA 
TTGTCCCATG 
AACAACAACT 
AAGGTCGTGC 
GGCTACCTGT 
AGTGTGGAGG 
ATGAACTCAG 
CCACTGGAGT 
GAATG CAT CC 
GGAAGCTACA 
GGCACAGCCT 
CCCAAAAAGG 
CCGGGTTGCC 
CAGAAAGCTC 



TACCACAGAA 
TCACTACCAG 
CCAATCACTA 
GGGTOCAATC 
OAOACCCCCT 
CAGGGGGTGT 
GAGAGTCCAT 
GGCTGCGAGr 
TCTACACCAA 
GTGAGATGGA 
TGAGGTGGCC 
TGCTGGAGAT 
GTGGCCAAGG 
AGTTCCCATT 
GGTGCCGGAC 
GCGTGGGTAC 
AGCTGCAACT 
GGCTGCTCCT 
CAGGTATTCC 



AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCACAG 
CATCGAGGAG 
GGTGACCGAC 
GGCTCAGCCG 
GGTGCCACGC 
GAAGAGTGGG 
GCCCGTGGCA 
AGATGGCAAG 
CCTCTACCCC 
ATTCTCCCAG 
CGTGTGCCCT 
CAACAAGAAG 
TGAGCTAGGC 
TTCCCAAGGC 
CAAAAGAGCT 
AGAAGCCCAA 



GGCTCCAATC 
TACGGGCTCC 
GGGCTACGGG 
AAAGGGGCTA 
CTCAATCCCG 
TTCGACGGAG 
CTGTCCGTCT 
ACOCGGTTTG 
GCOCACCTGA 
CACTTTGGCC 
ATGGTGAGCC 
CGGGATGAGG 
CAGGAAAATG 
CGAGACAAGC 
TGCAGTCGGG 
TCTAGGCATA 
ATCTGCACCC 
CAGCTOCRGG 
GTGTATGAAC 



ACTACCAGGA 
AATCACTACC 
CTCCAATCAC 
CGGGGTCCAA 
GCGACGCCTT 
ACGGGATGCT 
TCCGGGGCAA 
GGGCCTTTGC 
GGATCATCGA 
TGGGGAAGGA 
GGAAOGTGGC 
ACACACTTCA 
CJC CAT TG CAT 
OCGTATGTGT 
GCTACGAGCC 
CAATGACGTG 
CCGTCTGGTC 
CTGCTCCCAG 



AAAGGGGCTA 
AGGAAAAGAG 
TACCAGGAAA 
CGTOVTCCGT 
GGAGCCTGAG 
GGACCTCATC 
TCAGGGCTTC 
CAGGGGAGCT 
CGGGGGCTCA 
TGAAGCCAGC 
CAGCGGGGAG 
GGACCCAGCC 
GGACACCAAT 
CAACACCTAT 
CAACGAGGAT 
GAAAOCAAGG 
CTTTTTCCTG 
CACCCTTCTC 
ATAA 



3720 
3780 
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Sett. ID NO t 185 Protein sequence 
Protein Accession. 



1 
I 

MACPGGLPAR 
SPYYALRDRQ 
PPTTPAGTiLG 
GVATYTDKLF 
GNVGPDAJjIE 
GGDPEEADEE 
SKSHIiADKNXi 
PHPRAPGMDP 
HLGGPWSQAT 
GPGRVAKREI 
LAHHQMBKEB 



11 
I 

CSGWMGTiGGP 
GNAXQVTACD 
LPPIiSGRDFS 



21 
\ 



IDGDGREEIY 



MDPEASDIiSR 
KSGDQSTSQli 
FGPPCYYSVC 
KCKGRHAEPG 
QHLPAHKT-iYD 
GRETGAVGRP 
GKIHGDHEPR 
LGGSGVSVGP 



AFIVHLKYHL 
FLTQGLA9SA 

i.eefiaviiVGV 

3PKPSMP3FV 
GQGEQLRIRR 
KGKGNVAQSV 

RXGLRAPITT 
NHYQEKGDOG 
AMGSNHYQEK 
GRGTGGWTD 
KWIiYTKKSG 
MNSVLEILYP 
GSYRCRTKKK 
PGCRLLLKRA 



HRRTLSLQGS 
DDPHQHGRGV 
RTVITADfDH 
GGFPGPGGCjA 
PRTQAPQDTK 
YGVQSLPGKG 
RKRGYBVQSIi 
PITTRKRGYR 
GIiRAPITTRK 
FDGDGMLDLI 
AHLRIIDGGS 
RDEDTLQDPA 
CSRGYEPMED 
QLQAAPSTIili 



LSDEVNVARG 
aiEAIiRDVAA 
CRLGWKDGQF 
APSPAHFFPA 
LMAEALGAWP 
LGEPPI1QRT 
L8EPL.VPNFP 
FRIiRKAHSAS 
1LSSSASDIF 
LAETGPfifiSC 
QGAPPCUjAR 
AIiADFKRDGK 
DQBLBIFEHBT 
KVNTGPLMKK 
PHYHKKGLQG 
ATG 3J3HYQE K 
PGXGATGSNH 
VQSLPOKGAT 
RGYGVQSLPQ 



31 
] 

SSSHYNGPNL 
FU3TMHAFSG 
RQGERVPVPC 
VASbFAGRSV 
EAGVBKYTEG 
KEEAAAttVEE 
RQAPQHYPVA 
AbSTTWPGG 



41 
I 

VLKYDSAQKR 
HSSSAQVPSG 



ACVDRKG9GR 
FSHTAfiPfilG 
OREAGAAGVP 
PLVTQLMTHG 



51 
I 

LVNIA.VDERS 
IiHRNRFVLXP 
PEPFLLRPKS 
YSIYIAMYAY 



SCLRPIiEAGT 
FPPGS9EEPL 



PKVTQECHLV 
VPGAAU?GNP 
I/QFPSGLRGS 
FHNRGDGTFV 



GYLCBMEPVA 
PLECGQGFSQ 
GTACVGTELG 
QKAPGIPEAQ 



CPHHARUjQA 
APCWLGSIilP 
VDIVYGNWWG 
IAYRS9SANR 
QKGRKDKDWA 
PITTRKRGYG 
GLQGPITTRK 
YQEKGT*RGPI 
GSNHYQEKGL 
KGATGSNVIR 
USVFRGNQGF 



RGRVRTADOT 
RLAGKLAR3V 
GQAMSRCAIiR 
ATMPALGGLE 
GBJWVLDMAKA 
PVLQVGLGIiA 
BAAASAERRL 
FTRTGSRFYS 



TAYYIVXMSA 
PHRIiYDQMST 
LFRCSXLARG 



HGKVRPRDIA 
gaSLTAGGEN 
AKEFASAIAG 



VQSLPGKGAT 
RGYGLQ9UPG 
TTRKRGYGTLQ 
RGPITTHKRG 



YGLQSLPGKE 



OSMGHCMDTN 
SRHTMTHKPR 
VYEQDQE 



NNHWlfiWPR TRFGAFARGA 
fiVEVTWPDGK KVSENVASGE 
ECIQPPFVCP RBKPVCVNTY 
PKKELOJLSQG ICTPVHSPFL 



fieq ID WO: 186 DMA, sequence 

Nucleic Acid Accession NM_000584.1 

Coding sequence = 75.. 374 

1 11 21 31 

I I I I 

AGCAGAGCAC ACAAGCTTCT AGGACAAGAG CCAGOAAGAA 
CACTGTGTGT AAACATGACT TCCAAGCTGG CCGTGGCTCT 
CTGCAGCTCT GTGTGAAGGT GCAGTTTTGC CAAGGAGTGC 
GCATAAAGAC ATACTCCAAA CCTTTCCACC CCAAATTTAT 
AGAGTGGACC ACACTGCGCC AACACAGAAA TTATTGTAAA 
TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGX 
CTGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA 
GAAACTTCAA GCAAATCTAC TTCAACACTT CATGTATTGT 
OCAGATGCAA TACAAGATTC CTGGTTAAAT TTGAATTTCA 

cattgtacca tgaaatatcc agaacatact tatatgtaaa 
caaaaaacaa caaataattt ttaaatataa ggattttcct 
tacaaatagc aaaattgagc caagggccaa gagaatatcc 
gaatgggxtt gciagaatgt gatatttgaa gcatcacata 
tttgccataa agtcaaattt agctggaaat cctggatttt 
cctagtctgc taggcaggat ccacaagtcc ttgttccact 
tttctaaqtg gaaaaagtat tagccaccat cttacctcac 
tggaagcact ttaaottttt tcatcataac ataaattatt 
ctatttatta tttatgtatt tatttaagca tcaaatattt 
agaagatgaa tcattgattg aatagttata aagatgt tat 
gatattaaat gatgttttat tagataaatt tcaatcaggg 
AjCAATTGGGT ACCCAGTTAA attttcattt cagataaaga 
agtacattat tgtttat ctg aaagttttaa ttgaactaac 
agtcttgtca ttgccagctg tgttqgtagt gctgtgttga 

AAC T ATT AAA ACAGCCAAAA CTCCACAGTC AATATTAGTA 
TGTTTATTAT GTACAAATAG AT TCTTATAA TAT TATT TAA 



41 
I 

ACCAGCGGAA 
CTTGGCAGCC 
TAAAGAACTT 
CAAAGAAGTG 
gctttctgat 
gqagragttt 

GAATCAGTGA 
□TGGGTCTGT 
GTAAACAATG 
GTATTATTTA 
AGATATTGCA 
GAACTTTAAT 
AAAATGATGG 
TTTCTQTTAA 
GTGCCTTGGT 
AGTGATGTTG 
TTCAAGTGTA 
GTGCAAGAAT 
AGTAAATTTA 
TTTTTAOATT 
ACAAATAATT 
AATC5CTAGTT 
ATTACGGAAT 
ATTTCTTGCT 
ATGACTGCAT 
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51 
1 

GGAACCATCT 
TTCCTGATTT 
AGATGTCAGT 
AGAGTGATTG 
GGAAGAGAGC 
TTGAAGAGGG 
AGATGGCAGT 
TGTAGGGTTG 
AATAGTTTTT 
TTTGAATCTA 
CGGGAGAATA 
TTCAGGAATT 
GACAATAAAT 
ATCTGGCAAC 
TTCTCCTTTA 
TGAGGACATG 
ACTTATTAAC 
TTGGAAAAAT 
TTTTATTTTA 
AAACAAAGAA 
TTTTAGTATA 
TGATACTCCC 
AATGAGTTAG 



TTTTAAATAC 
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AAGGCTTTAT ATTTT T AAC T TTAAGATGTT TTTATGTGCT CTCCAAATTT TTTTTACTGT 
TTCTGATTGT ATGGAAATAT AAAAGTAAAT ATGAAACATT TAAAATATAA TTTGTTGTCA 
AAOTAAAAAA AAAAAAAAA 

Seq ID NO: 18*7 Protein sequence 
Protein Accession NP_000575-1 

1 11 21 31 41 SI 

I I I ! 1 1 

MTSKLAVAIOj AAFIiISAAIiC EGAVLPRSAK ELRCQCIKTtf SKPFHPKFIK ELRVIESGPH 
CANT3IIVKL 6DGRELCLDP KENWVQHVVE ECFIiKRAEWS 

Seq ID NO: 188 DMA sequence 

Nucleic Acid Accession ft: NM_Q03G61.1 

Coding sequences 1..1152 



1560 
1620 



1 
I 

ATGAGTGGAC 
CAAAACGTTC 
GCTGCTGGCA 
AAGGAAAAAQ 
GGATTGGTGG 
□ACAACCTTG 
TACAGAAACT 
AGAAGGCTCC 
AATGTGGTGT 
CTGGCACCCT 
ATCACAGCDG 
ACACAAGCCC 
GAGTTTTTGQ 
ACACGAGGCA 
GTACCGCAT0 
GAACAGGTGG 
ACGGATGTGG 
TCAAAGCACT 
CftGGAOCTOO 
CAAGAACTGT 



11 



CAAGTGGGAC 
CCATGGAGOC 
TGAGCACACA 
CTGCTGCTQA 
CAAGACAAAT 
GGTTTCTOAA 
GTGCCCTTGC 
CTGOCTCTCT 
TCACAGAGGG 
CTTTGACCGG 
AAGCCCACGA 
GTGAGAACAT 
TTGGGAAGGA 
CCTCAGCCTC 
AGAGGGTTAA 
CCCCTGTAAG 
TACATGAGGG 
AGOAGAAGCT 
GA 



21 
1 

TGTGQGAGTG 
AGAXACTGGA 
AGAGAQCAGT 
GAATCTGCTA 
ACTGCCCAGG 
GATCATGAAA 
AGAGTTTCCT 
AGATGGGGTT 
CAGCATTTOC 
AGGCAGCCTT 
GATTACCAGC 
f2CTGGTCATC 
ATCCAACTTT 
CATCCGTGOC 
ACGCCCCCGG 
TGAACCCfiGC 
CTTCTTTCTT 
G3CAAAGTCA 
AAACATTCTC 



31 
I 

AGGGCAGAGG 
GATCCTCAAA 
ATCTTTATTG 
CTCCTGCTGA 
AATGAGGCAG 
GACAAAAACT 
CGGTTGAAAA 
CAGAAGGTCC 
TCTGGCATCC 
GTACTCTTQG 
AGTACCATGG 
AAAAGCCTTG 
CTTTCCTTA6 
CTCAGACGAG 
GTCACTQAGC 
ATCCTGGAAA 
GTGCTGGATG 
GAGACAGCTG 
AACAATAATT 



41 

I 

AAGCTGGAGC 
GTAAGCOCCT 
AGQATGCCAT 
CTGATAATGA 
ATGAGCTGCG 
GGCACGATAA 
GTGAGCTTGA 
ACAAAGQCAC 
TGACCCTCGT 
AACCTGGGA1 
ACTACGGAAA 
ACAAATTGAA 
CTGGCAATAC 
CCAGAGCCAA 
CAATCTCAGC 
TQAGCAGAGG 
TAGTCTACCT 
AGGAGCTGAA 
ATAAGATTCT 



51 
I 

GAGGGTGCAA 
CGGTQACTGG 
TAAGTATTTC 
GOCCTGGAAC 



AGGCCAGCAG 
GGATAACATA 
CACCATCGCC 
CGGCATGGGT 
GGAGTTGGGA 
OAAGTGGXGG 
GGAGGTGAGG 
TTACCAACTC 
TCTTCAGTCA 
TQAAAGCGGT 
AGTCAAGCTC 
CGTGTACGAA 
GAAGGTGGCT 
GCAGGCGGAC 



60 
12 0 
180 
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300 
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420 
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Seq 3D MOi 189 Protein sequence 
Protein Accession #? NP_003652.l 

1 11 21 31 41 51 

I ' I I I 1 I 

MSALFLGVGV RAEEAGARVQ QMVPSGTDTG DPOSKPIOTW AAGTMDPBSS XFIEDAIKYP 
KBKV STQKL1* T,T J .TDNEAWN GFVAAAEIiPR NEADELRKAL DNIARQMIMK DKNWEDKGQQ 
YRNWFTiKKFP RLKSELEDNI RRLRALADGV QKVEKGTTIA NWSG6LSIS SGXLTLVQMG 
LAPFTEGGSL VLLEPGMELG ITAALTGITS STMDYGKKHW TQAQAHDIVI X8LDKLKEVR 
J3FWJENTSNF 1*5LAGNTYQL TftGIGKDIRA LRRARAKLQS VPHASASRPR VTEPISAESG 
EQVBRVNEPS ILEM3RGVKL TPVAFVSFFIj VLDWYLVYE SKHLHEOAXS ETAEELKKVA 
QELEEKXWIL. KNNYKlLQAD QEL 

Seq ID NOi 190 DETA sequence 

Nucleic Acid Accession #i HM_0 14452.1 

Coding sequences 1 . .1968 



1 11 21 91 

1 I I I 

ATGGGGACCT CTCCGAGCAG CAGCACCOCC CTCGGC TCCT 
GCCACAGCCA CGATOATOGC GGOCTCCCTT CTOCTGCTTG 
GCTCAGCCAG AACAGAAGGC CTCGAATCTC ATTGGCACAT 
ACCGGCCAGG TGCTAACCTG TGACAAGTGT CCAGCAGGAA 
ACCAAGACAA GCCTGCGOGT CTGCACCAGT TGOCCTGTOG 
AATGGCATAG AGAAATGCCA TGACTGTAQT CAGCCATGCC 
TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC 
AACGCTACCT GTGCCCCCCA TACGGTCJTGT CCTGTGGGTT 
ACAGAGACTG AGGATGTGCG GTGTAAGCAG TGTGCTCGGG 
TCTAGTGTGA TGAAATGCAA AGCATACACA GACTGTCTGA 
AAGCCGGGQA CCAAGGAOAC AGACAACGTC TGTGGCACAC 
ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG 
GTCCCTTCCT CCACTTATGT TCCCAAAGGC ATGAACTCAA 
TCTG1TAGAC CAAAGGTACT GAGTAGCATC CAGGAAGGGA 
TCAGCAAGGG GGAAGQAAOA CGTGAACAAG ACCCTCCCAA 
CAGCAAGGCC CCCACCACAG ACACATCCTG AAGCTGCTGC 
GGCGAGAAGT CCAGCACGCC CATCAAGGGC COCAAGAGGG 
CACAAGCATT TTGACATCAA TGAGCATTTG CCCTGGATGA 
GTGCTTGTGG TGATTGIGGT GTGCAfflATC CGGAAAAGCT 
CCCCGGCAGG ATCCCAGTGC CATTGTGGAA AAGGCAGGGG 
ACCCStBAACC GGGAGAAATG GATCXACTAC TGCAATGGCC 
CTTGTAGCAG CCCAAGTOGG AAGGCAGTGG AAAGATATCT 
AQTGAQAGGG AGGTTGCTGC TTTCTCCAAT GGGTACACAG 
GCAGCTCTGC AGCACXGGAC CATCCGGGGC CCCGAGGCCA 
GCCCTGCGCC AGCACCGGAG AAACGATGTT GTGGAGAAGA 
ACCACCCAGC TGOAAACTGA CAAACTAGCT CTCCCGATGA 



41 
I 

GCAGCCGCAT 
GATTCCTTAG 
ACCGCCATGT 
CCTATGTCTC 
GQACCTTTAC 
CATGGCCAAT 
(MCTGGCAT 
GGGGTGTGOG 
fSTACCTTCTC 
GTCAGAACCT 
TCK2COTCCTT 
AGCACATGGA 
CAGAATCCAA 
CAGTCCCTGA 
ACCTTCAGGT 
CGTCCATGGA 
GACATCCTAG 
TTOTGCTTTT 
CGAGGACTC7 



ATCAGTTTCT 
CCGACCAGGA 
GCCTCGCCCA 



60 
120 
180 
240 
300 
360 



51 
1 

CGCCCGCCGA 
CACCACCACA 
TGACCGTGCC 
TGAGCATTGT 
CAGGCATQAG 



GTTCCAGT CT 
GAAGAAAGGG 
AGATGTGOCT 
GGTGGTGATC 
CTCGAGCTCC 
AACCCATGAA 
CTCTTCTGCC 
CAACACAAGC 
AGTCAACCAC 
GGCCACTGGG 
ACAGAACCTA 
CCTGCTGCTG 
GAAAAAGGGG 
CATGACTCCA 
XATOCTQAAQ 
TTGCAATGCC 
GCOGGCCTAC 
GCTAATTAGC 
GATGGAAGAC 
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120 
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AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCG CTCTCCTGAC GGTGGAGCCT 
TCCCCACAGG ACAAGAACAA GGGCTTCTTC GTGGATGAGT CGGAGCCCCT TCTCCGCTGT 
GACTCTACAT CCAGCGGCTC CTCCGCGCTG AGCAGGAACG GTTCCTTTAT TACCAAAGAA 
AAGAAGGACA CAGTGTTGCG GCAGGTACGC CTGGACCCCT GTGACTTGCA GCCTATCTTT 
GATGACATGC TCCACTTTCT AAATCCTGAG GAGCTGCGGG TGATTGAAGA GATTCCCCAG 
GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTGGAG TCAAGAGCCA GGAAGCCAGC 
CAGACCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTGTAG 

Seq ID NO; 191 Protein sequence 
Protein Accession #t NP_055267.1 



1620 

1GB0 
1740 
1800 
1860 
1920 



1 
i 

MGTSPSSfiTA 
TGQVLTCDKC 
LiPCAALTDRB 
B9VMKCKAYT 
VPBSTYVPKG 
OQGPHHRHII* 
VLWIWCSI 
LVAAQVG BQW 
AliHQHRHNDV 
SPQDKNKGFP 
DEKLKFUtfPE 



11 

\ 

LASCSHIARR 
PAGTYVSEHC 
CTCPPGMFQS 
tXTLSQHIiWI 



KLfcPSMEATG 
RKSSRTLKKG 
KDIYQPLCNA 
VEKIRGLMED 
VDE «?E PXJjRC 
EIiRVIEEIPQ 



21 
I 

ATATKrAOSI» 
TNTSLRVCSS 
NATCAPHTVC 
KPGTKfiXDNV 
SVRPKVLSSI 
GEKSSTPIKG 
PRQDPSAIVB 
SSREVAAFSW 
TTQLETDKTA 
DSTSSGSSAIt 



31 
I 

TiT>r<GFLSTTT 
CPVGTFTRHE 
FVGWGVRKKG 
CGTIjPSFSSS 
QEGTVPDNTS 
PKRGHPHQNIi 
KAGLKKSMTP 
GYTADHERAY 
LPMSPSFl^P 
SRNG3FITKE 
X1GV2&QEAS 



41 
I 

AQPEQKASNL 
NGIEKCHDCS 
TETEDVRCKQ 
TSPSPGTAIP 
SARGKEDVNK 
nKHFDINEHL 
TQNREKWIYY 
AALQHWTIRG 
SPIPSEHAKL 
KKDTVliRjQVR 
QTIiOSVYSH 



51 
I 

IGTYKHVDHA 
QPCPWPMIEK 
CARGTFSDVP 
PftPEHMETHE 
TLPNLQWNH 
PWWIVLFLLL 
CNGHGIDILK 
PEASLAQIjIS 
EN9ALI»TVEP 
LDPCDLQPIF 
IiPDUL 



60 
120 

iao 

240 
300 
360 
420 
480 
540 
60O 



Seq ID NO: 192 PNA sequence 
NUclcic Acid Accession #: XM_044533 
Coding sequence: 238.. 2751 

1 11 21 31 41 51 

I I 1 1 1 \ 

GCTCTGCCCA AGCOGAGGCT □CGGGGCCGG CGCGGGOGGG AGGACTQCGG TGCCCCGCGG 
AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCXr 
GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGAOCGCG GGGCGGAGCT GCCGCCCGTG 
AGTCCGGCCG AGCCACCTGA GCCCGAGCCQ CGGGACACCG TCGCTCCTGC TCTCCGAATG 
CTGOGCACOG CGATGGGCCT GAGGAGGTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 
CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 
TGGGCGCTCA GCCCCCQQAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 
TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGOACC 
CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTGCTGCCA 
GaCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAQC 
TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CGTGCCGCTC 
AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GC3VSCCTTCA GCCCCATGTG TACCTACATC 
AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGARGGGGA ATGTCCTCCT GGAAGATGGC 
AAGGGCCGTT GTCCCTTCGA CCOGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGA0 
CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 
AGGCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC tgcaagaccc agcttttgtg 

gcctcagcct acattcctga gagoctgggc agcttgcaag gcgatoatga caagasctac 
TTTTTCTTCA gcgagactgg ccaggaattt gagttctttg agaacaccat tgtgtcccgc 

ATTGCCCGCA 7CTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGAGC 
TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGBCCCGACG ATGGCTTCCC CTTCAACGTG 
CTGCAGGAXG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCBTGACAC CCTTTTCTAT 
GGGGXCTTCA CTTCCCAGrG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 
ACAATQAAGG ATGTGC&GAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 
CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 

AACAGTGCCC gggaaaggaa gatcaactca tccctqcagc tcccagagcg cgtgctgaac 

TTCCTCAAGG ACCACTTOCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTQCTGCAG 
CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 
GATGTCCTCT TCCTGGGCAC TGGTOACGGC CGGCTCCACA AGGCAGTGAG 03TGGGCCCC 
CQOGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGOCCGT GCAGAATCTG 
CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGOCGT AGTCCAGGTG 
OGCATGGCCA ACTGCAGCCT GTACAGOAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 
TACTGTGCTT GGAGOQGCTC CAGCTGCAAG CAC3TCAGCC TCTACCAGCC TCAGCTGGCC 
ACCAGGCCGT GGATCCAGGA CATCGAGGGA GGCAGCGCCA AGGACCTTTG CAGCGCGTCT 
TCGOTTGTDT CCCCGTCTXT TGTACCAACA GGGGAGAAGG CATGTGAGCA AGTCCAGTTC 
CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTQGC GACCCGACTC 
TGGCTACGCA ACGGGGCCCC CGTCAATOCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 
GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTGC AGTGCTGGTC ACTAGAGGAG 
GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 
CAAACAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 
GCTGQTGGCA AGGCCAGGTG GGGTGCAGAC AGGTCCTACT GGAAGGAQTT CCTGGTGATG 
TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 
AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGOCA GCOTQCACCC CAAGACCTGC 
CCTGTGGTGC TGCCCCCTGA GACCCGCCCA, CTCAACGGCC TAGGGCCCCC TAGCACCCCG 
CTCGATCACC GAGGGTACCA GTCGCTGTCA GACAGCCCCC CX3GGGTCCCG AGTCTTCACT 
GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC GCCAGTGIGC 
CCCOGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 
TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 
TOC3CCTCC3GC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGa 
GGCCAGCTGG CCTGCTOCTC TCCAGTCAAG TAQCGAAGCT CCTACCACCC AGACACCCAA 
ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 
TGCTCCTTAT GTAAACTCAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 
GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 
GGGGTGCTGG GGATGCATCC AAAGTOGTTG TCTGAGACAG AGTTGGAAAC CCTCRCCAAC 



60 
120 
180 
240 
300 
360 
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1OB0 
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TGGCCTCTTC ACCTXCCACA TTATCCCGCT GCCACOGGCT GCCCTGTCTC ACTGCAGATT 3240 

CAGGACCAGC TTGGGCTGOG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 3 3O0 

CTGCOOTOGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGOG GCCCTCACCA 33fi0 

GGTCCTQGGC TCGGACCCAA CTCCTGGACC TTTOCAGCCT GTATCAGGCT GTGGOCACAC 3420 

GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3460 

CAGGGAAGAQ ACTOTC33CCT GCCTTCCTCC GTTGTT GOGT GAGAACCCGT GTGCCCCTTC 3540 

CCACCATATC CACCCTOGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3600 

GTCCTCTCCC CAGTCCCCAG TTCACOCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

TCAACACTGC CCRGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 

Seq ID HO: 193 Protein sequence 
Protein Accession #: XP_044533-3 

1 11 21 31 41 51 

I I I I I 1 

KLRTAMGLRS WlAAPWGAItP PRPPDDLLLL LLUbLQPPPP THAdSFRI Sl» PIiGSEERPFIr 60 

RFEAEHISNY TAI»LLSRDGR TLYVGAREAL FALS&NI^PL PGGEYQKLLW GADAEKKQQC 120 

BPKGKDPQBD CQNYIXILI^* LSGSHLFTCG TAAPSPMCTY INMHHFTIAR DEKGCTVLLED 180 

GKGRCPFDPN FKSTALWDG ELYTGTV6SF QGKDPAlfiRS QSLRPTKTES SI2JWI»QDPAF 240 

VASAYIPESL GSLQGDDDKI YFFF3ETGQB FEFFENTrvs RIARICKGDE GGERVLQQRW 300 

TSFItfAOIiLC SRPDDGFPFN VLQDVFTLSP SPQDWKDTLF YGVFTSQWEH GTTEGSAVCV 360 

FTMKDVQKVF SGLYKEVNRE TQQHYTVTHP VPTPRPGACI TNSARBRKIK SSLQIiPDRVIi 420 

KFLKDHFLMD CQVH8RKLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 4B0 

PRVHIIEELQ IFSSGQ.PVQN LkliDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCUjARD 540 

PYCAKSGSSC KHVgLYQPQb ATRPWIQDIE GA8AKDLCSA SSWSPGFVP TGEKPCEQVQ 600 

FOPNTVNTIA CPLLENLATR LHLRNGAPVN AfiASCHVLPT GDUjLVGTQQ LGEFQCifSLE 660 

BGFQQI.VASY CPBWHDGVA DQTOEX3GSVP VI1STSKVSA PAGGKASWQA DRSYWKEEXV 720 

MCTLFVTiAVL LPVIiFLLYRH RNSMKVFLKQ GECASVHPKT CPWl^PETR PLMGLGPPST 780 
PLDHRGYQSU SDSPPGSKVF TESEKRFIiSI QDSPVEVSPV CPRPKVRLGS EIRDSW 

Seq ID NO j 194 DHA sequence 

Wucleic Acid Accession #; 1MJ>22 819.1 

Coding sequence i 1..635 

1 11 21 31 41 51 

1 1 1 I I I 

ATGGCAGATG- GGGCAAAGGC CAACCCCAAA GGGTTCAAAA AGAAGGTGCT GGATAGATGC 60 

TTCTCTGGGT GGAGGGGCCC ACGCTTCGGG GOCTCCTGTC CTTCAAGAAC CTCCAGGTCT 120 

AGCCTGGGTA TBAAGAA0TT CTTCACCGTG GCCATCCTTG CTGGCAGCGT TCTGTCCACA 180 

GCTCACGGCA GCCTGCTCAA CCTGAAGGCC ATGGTGGAGG CCGTCACAGG GAGGAGCGCC 240 

ATCCIGTCCT TOGTGGGCTA CGGTTOCTAC TGTGGGCTGG GGGGCCGTGG CCAGCCCAAG 300 

GATGAGGTGG ACTGGTGCTG CCACGCCCAjC GACTGCTGCT ACCAGGAACT CTTTGACCAA 360 

GGCTGTCACC CCTATGTGGA CCACTATGAT CACACCATCG AGAACAACAC TGAGATAGTC 420 

TGCAGTGACC TCAACAAGAC AGAGTGTGAC AAQCAGACAT GCATQTQTGA CAAGAACATG 480 

GT T CTGTGCC TC3VTGAACCA GACGTACCGA GAGGAGTACC GTGGCTTCCT CAATGTCTAC 540 

TGCCAGGGCC CCACGCCCAA CTGCAflCATC TATGAACCGC CCCCTGAGQA GGTCACCTGC 600 
AGTCACCAAT CCCCAGCGCC O0C0GCCCCT CCCTAG 

Beq ID ttOt 195 Protein eequence 
Protein Accession #: »P_073730 

1 11 21 31 41 51 

I I I I 1 1 

MADGAKANPK GFKKKVLDRC FSGWRGPRFG ASCPSRTSR8 SLGMKKPFTV AILAGSVLST 60 

AHGSLLNLKA KVEAVTGRSA ILSFVGYGCY CGLG3RGQPK DBVDWCCHAH DCCYQELPDQ 120 

GCEPYVDHYD HTIENNTEXV CSDLNKTECD KQTCMCDKNM VLCLMNQTYR EEYRGFLJSVY 180 
CQGPTPNCSI YKPPPEEVTC SHQSPAPPAP P 

Seq ID NO: 196 DMA seqpt^nce 

BUcleic AOid Acceeeion #s XH_02 8196-1 

Coding sequence: 1315.. 1791 

1 11 21 31 41 51 

I 1 1 I I I 

©QCATTGAT3 CTGTGTGOGC GTGCOTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 60 

GTGTGTGTGr GTCTGGAGTC ATGGCAGGGT CCCTTTCTGT CTGTCTCCTT GCTCTGCCOC 120 

AGACTGGGGG GCTGCAGAGG TGAGGGTATC TGGCCTCAAC AGCTGCTTAT TCCCGATGGG 180 

ATGGCCTGGG CTGGGCCCCT GAGGCCAGGC TGACTTGGAC ATGGCAAGAG GGGTCCCAGG 240 

CTCTTGTGGG CAAAGCAGGG GAGGCGCCAA TGTGGAGGAA CAGAGTCTCC TGGCTGGCTG 300 

CTGCTCTCTG GAGCGGGTGG AGTGAGGGAA GAGCTGAGCT GGGGAGTCAC CCTGGGCCTG 360 

GGGTCACCGT AGGCCCCATG TAGCACCCTG GTTOCCCTGC CTGTAGGTGA C3V3GAGCCAG 420 

CCCAGCCAGG TGTOCXCCCT CCCSCAGQCCC TAGGCAGGOG GGTACAGGGG OOVSCAGCTG 480 

CGCCCGCCCC ACCTTCCTTC CCACCCACAT GCCGAAGGGr GGCCAGGCAG GCAGGTGGAC 540 

GAGTCCAGGC AGOGGCTGAG TCAGTGTGTG TGGAATGTTC TGGCOGCTCC CAGCTGCACC 600 

CTGCCCCTAC CTGCX3U:CAC CTCAC5CTTCR TCCTCAGGCG CTGC5GGCCCT GAGCCCCTGC 660 

CAGGAATGCA GCTTTAGCCC AGGCCXGCTC AGTGAGCTCC GCCGACAGCC AGCCCTGCTC 720 

CTCCOGCCAT GACCCTGCAG ACCOCTCTGG GCTTOCAAGT TCCTGGGGGO TGCAGTGAAC 780 

AXGCTCCACC TGCATGOCTG GCAAACCATG GTGGGCXZCXZA GCTGTGGTQC GTGCTGGGGT 840 

AGAGQCAAGG AAGTGATGX3G ACCGCAGAGA TGAGACCCCC AGGGATGAGA TGGGACOCCC 900 

AGGCAGGGOC CAGGGTCCAG GGCCCAGGAG AGAGAAGCAG GGAGGGAQAG AGCTTCCTGO 960 

TGGAGGACGC ATCCTACAGT GGGGGCAAGG GTGCTCTGAG GTCCGGTGAA GGCAGGGACT 1020 

AGGCTGCCCA GGCCCTGCCT GCTTGGCTGG GGCTGGGGGC TGCTGGGAQG TGGCTGGGAG 1080 

GCTGGGCCTG GGCAGCTAAG CTGGAGCTT7 GGCCAQGGTC CAGAGOCTCC CTCCCTTCAG 1140 



1031 



WO 03/042661 



CTTCCTGCTG CACAGAACCC TCGCCCCTGG CCACCCCGTG CTGCCTCCTT GCCCTGGCAG 1200 

ACCCAGCACT GGClGCTGCT AGTCAGATGG GGTAGCGGGC AGGGGCCGGA GGGGCCACCC 1260 

TCCCAGCTGA CCCAGCCTCC TGGGOCGcn" CITCCAAACC AGCAGGGTAG AAAGATGGGG 1320 

CACCCACCAG TtTTCTCCCAQ TGCCCCGGCC CCAGCTGGCA CCACAGCTAT ACCTGGGCTT 1380 

ATTCCAGACC TTGTCGCCGG GACCCCCTGT GAGTTGTGGG ATTCCCAAGA GGGGTGTGGG 1440 

QATAACCCAG CCAAGTGGGG GCTGCAGCTG TCCACAGATG CACTCAGCCT GGCCTCTACC 1500 

CCAGGGCCCC GCTGGGCTCT CATTGCCGGC GCCCTTGCCG CGGGCGTCCT CCTCGTCTCC IS 60 

TGCCTCCTCT GTGCTGCCTG CTGCTGCTGC CGOCGCCACA GGAAGAAGCC CAGGGACAAQ 1620 

GAGTCCGTGG GTCTGGGCAG TGCCQGCGGC ACCACCACCA CCCACCTGGT GAGGAGCGGC 1680 

TCCTTGCTCA CTCAGTOCAG AGAGGGCTTG AAATCCAGGC TCCAGAGCCC AGGGCAGCGA 1740 

GGOGAGTTCA GCCCCAGGGA TGGTTTAACC CCCACAGAGG CAGGGCGTTG AGGACCTTCC 1B00 

TGGCAGOGAA AGTGGGTGAA CAGAGOTGAG AAGGAGGCCA TGCAACAGGG GCTQCCCCAT i860 

GGGOCOSAGG GAGCCACAGC GGGTTCTTGA GGAAGGCAGG GGGTACCCCA GATGCCACGT 1920 

TTTGQGTGGG TTTGGCOGGT CTCACAGAac GAAGCCGACG ATTTGTGCCT GTTGGGTGGC 1980 

CTGGCCTGGA GGCGGGGGGT CTTGACCCAT GTCATGCAAG GGCTGCCCGG GAGCCCAGGG 2040 

CTCTGATGAG GCATGATGTC AGCACCACCT GCCCCTTGTC CCAACTCACT GCAGGTGCAA 2100 

CCTGATGTGG ATGGCCTGGA GTGCAGCCCG GGGGATGCTC AGCAATGGGG GTGCCTGCAG 2lfi0 

CTCTCCCTGG AGTTCGACTT TGGAAGCCAG GAGGTGAAGG GCCCCGCTGC GCAGOACCAG 2220 

□GGTTCTGCG AGTTTCDGGA AAGGGTGACG GGGGAAGGGC AGACCCCATG CCCTGQGTGG 22 BO 

TGGGOAGCTG ACAGGGCAGG GQCCCTTGGC TGAGCOCACC CCGCXGGCTC CCAGATCAGG 2340 

GTGGGCCTGA GGCAGGCAGC OGACCTGAGG CCTGGGGGCA CDGTGGACCC CTATGCCCGG 2400 

GTCAGCGTCT CCACCGAGGC CGGACACAGA CATGAGACAA AAGTGCACCG AGGCACGCTC 24.60 

TGCCCOGTGT TTGACGAGAC CTGCTGCTTC CACGTGAGTC AGGGATGGTC GGCTGGGTGG 2520 

GOCTGGACGa CTGGATGGGC CTGGGCTGGG TGGGCCTGQG CAGCTGGGTG GGCCTGGQCA 2580 

GCTGGGTGGG CCTGAGCTAG GGCAGCAGGG CCTGGCTGAC GCCGCTGCCT CAGATCCCGC 2640 

AGGCGGAGCT GCCAGGGGCC ACCCTGCAGG TGCAGCTTTT CAACTTCAAG CGCTTCTCOG 2700 

GGCATGAGCC CCTGGGTGAG CTCCGTCTQC CACTGGGCAC CGTGGATCTG CAGCATGTTC 2760 

TGGAGCACTG GTACCTGCTG GGCCCGCCGG CTGCCACTCA GGTGAGGTGC TGGTCACCAG 2820 

QCCACAGCCC AAGGCAGAGC TGGCAGGOAC CCTGCCCTAT GGGCCATCGG AAAGACAGGC 2880 

CTGATGGGCA GCATTTTOGG GGGTCTGAGC CCCAACTCGG CCAQAATCAC CCTCCCGGGC 29 4D 

TGAAGCCCCT CTTGCTGCCC ACAGCCCGAG CAGGTOGGGG AGCTGTGCTT CTCTCTCCGG 3000 

TACGTGCCCA GCTCAG3CCG GCTGACOGTG GTGGTGCTGG AGGCTOGAGG CCTGCGTCCA 3060 

GGACTTGCAG AGCCCTAOGT GAAGOTCCAG CTCATGCTGA AOCAGAGGAA OTGGAAGAAG 3120 

AGAAAGACAG CCACCAAAAA GGGCAOGGOG GCCCCCTACT TCAATGAGGC CTTCACCTTC 3180 

CTGQTGCCCT TCAGCCAGGT CCAGAATGTG GACCTGGTGC TGGCXGTCTG GGACOGCAGC 3240 

CTGCCGCTCC GAACTGAGCC GGTAGGCAAG GTGCACCTGG GTGCOOGGGC CTCGGGGCAG 3300 

CCCCTGCAGC ACTGGGCAGA CATGCTGGOC CACGCCCGGC GGCCCATTGC CCAGCGGCAC 3360 

COCCTGCGGC CAGCCAGGGA GGTGGACCGC ATGCTGGCCC TGCAGCCCCG CCTTCGCCTG 3420 

CGCCTGDOCT TGCCCCACTC CTGAATGCAC CACATGCCTC TGTCTCCCCG CTGAGCCCAG 34B0 
GCACTTGCCC AGGCCGCCCT GCAGGACCAC TGCAATAAAC QCCTTCTCCT GCC 

Seq ID NO i 197 Protein sequence 
Protein Accession ft* XP_02B196-1 

* 11 21 31 41 51 

I I 1 I I I 

KGHFPVSPSA PAPAGTTAI P GLIPDIjVAGT PCELWDSQEG CGDNPAKKGIi QI^STDALSLA 60 

STPGPRWALI AGAIiAAGVLIr VSCLLCftACC CCREHRXKPR DKBSVGLGSA RGTITTHIjVR 120 

GGSLLTQSRB glksrlqspg qrgefspbdg ltpteagr 

Seq ID »0: 198 DMA sequence 

Nucleic Acid Accession #s KM 000612.2 

Coding sequence: 553..109S ~~ 

1 11 21 31 41 51 

I I I 1 | | 

TTCTCCCGCA ACCTTCCCTT C6CTCCCTCC CGTCCCCCCC AGCTCCTAGC CTCCGACTCC 60 

CT0C0CCCCT CACGCCCGCC CTCTCGCCTT CGCCGAACCA AAGTGGATTA ATTACACGCT 120 

TTCTGTTTCT CTCOGTGCTG TTCTCTCCCG CTGTGCGCCT GCCCGCCTCT CGCTGTCCTC 180 

TCTCOCCCTC GCCCTCTCTT CBOCCCCCCC CTTTCACGTT CACTCTOTCT CTCCCACTAT 240 

CTCTGCCCCC CTCTATCCTT GATACAACAG CTGAOCTCAT TTCCOGATAC CTXTTCCCCC 300 

GCGAAAAGTA CAACATCTGG CCCGCCCCAG CCCGAAGACA GCCCGTCCTC CCTGGACAAT 360 

CAGAOGAATT CTCCCCCCCC CCCCAAAAAA AAAAGCCATC CCOOCGCTCT GCOCGGTCGC 420 

ACKTTCGGCC CCCGCGACTC GGCCAGAGCG GGGCTGGCAG AGGAGTG1CX! GGCAGGAGGG 480 

CCAAOGOCCG CTGTTCGGTT TGOGACACGC AGCAGGGAGG TGGGCGGCAG CGTCGCGGGC 540 

TTCCAOACAC CAATGGGAAT CCCAATGGGG AAGTCGATGC TGGTGCTTCT CACCTTCTTG 600 

GGCTTCGCCT CGTGCTGCAT TGCTGCTTAC OGCCCCAGTG AGACCCTGTG CGGCGGGGAG 660 

CTGGW3GACA OCCTCCAGTT OSTCTGTGGG GAOCGCGGCT TCTACTTCAG CAGGCCOGCA 720 

AGCCGTGTGA GCCOTCGCAG CCGTGGGATC GTTGAGGAGT GCTGTTTOCG CAGCTGTGAC 780 

CTGGCCCTCC TGGAGACGTA CTGTGCTACC CCOGCCAAGT CCGAGAGGGA CGTGTGGACC B40 

CCTCCGACC3G TflCTTCOGGA CAACTTCCCC AGATACCOCG TGGGCAAGTT CTXCCAATAT 9QQ 

GACACCTGGA AGCAGTCCAC CCAGOGCCTG CGCAGGGGCC 7X3CCTGCCCT CCTGCGTGCC 960 

CGCCGGGGTC ACGTOCTCGC CAAGGAGCXC GAGGCGTTCA GGGAGGCCAA ACGTCACGGT 1020 

CCCCTGATTG CTCTACCCAC CCAAQACCCC GCCCACGGGG GCGCCCCCCC AGAGATGGCC 1080 

AGCAATCGGA AGTGAGCAAA ACTGOGGCAA GTCTGCAacc CGGOGCCAOC AXCCTGCAOC 1140 

CTOCTCCTGA CCAOGGACGT TTCCATCAGG TTCCATCCOG AAAATCTCTC GGTTCCAOGT 1200 

CCOCCTGGGG CTTCTCCTGA CCCAGTCGCC GXGCCCOGCC TCCCCGAAAC AGGCTACTCT 1260 

CCTC5GGCCOC CTCCATCGGG CTGAGGAAGC ACAQCAGCAT CTTCAAACAT GTACAAAATC 1320 

GATTGGCTrr aaacaccctt cacataccct occccc 

Seq ID NO s 199 Protein sequence 
Protein Accession # : HP_000603 .1 

1 11 21 31 41 51 
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AAAGTTCCCA 
CTGTTGTCTC 
CTTCTCCTCC 
CTTTACAAAT 
GCCCCACCTT 
GGTGTCAGCC 
TCTGTCACCC 
CPCCCTTGGC 
GCTGTTGTGC 
CCAGGCCTTG 



G CCAATAGAC 
CATGGACCAT 
CTCTACTCAG 
QCCTTGGTAT 
GGTGATGTTG 
CTGCTAGGGA 
TGCAGCTCCT 
TGTTCACCCA 
TCTCTCGAAC 
TTAAAAAAAA 



AGCATGAATC 
TCCTCGGAGT 
CCTTAAAAAG 
GTACCTCTGG 
CAACCTTTCC 
GTCACTGTAC 
TTACAGTTCA 
GCCA.TTCCCT 
CCTGACCAGA 
AAAAAAAAAA 



AAGGAACTTG 
AGTGGTGAGA 
GGCTTCTTGG 
CAAATGCCCT 
TTCTGCTAGG 
ACACAAACTC 
ATCCAATGAT 
GAAQGOCTTA 
TCATCAGCCA 



CATTATATGT 
TGGOCTTGGG 
AACTTTACCA 
GGTATGTACC 
gtgtacacct 
TACTGGAATT 
AGAAACCATC 
CCAACAGGAA 
CTGAGGCCAG 



GCTCTTGAAT 
TTGCCCTTGG 
GCAGCCTCAG 
TCTGGCAAAT 
AGCCTGTGCA 
CCTGCCAACA 
CCTTCCCTTT 

TATCCAAGAA 
TGGAATTTCC 
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Seq ID WO: C66 OKA Sequence 

Nucleic Add Accession ffs WM_014459-2 

Coding sequence: 738, *3407 

1 11 21 31 41 51 

I I I I I 1 

GTAGATGCAG TCOGCCGCCG CCGCTGCCTC AGCCAGCAAT GCAAGATTAG ATCTCTAAAT 60 

GCAGCAAAAC ACTGCCTGAA AACAGACOGG CCCGCGCAGC AAGCAGACAT TTCACGGTGC 120 

GCTGGGGAAG CTTCAAAATA TATCTGTGAC TCTGTCTTCG TTGCTCTTCA TCCCCATCAA 180 

TTTCATCACG GGAGGCGAGC AGCAAGTAAG AATTTCACTT TCGGATCTGC CTAGAGACAC 240 

ACCTCCCTGC TCCCTCCCCC ACTCGATGTG AAGAGTATTC CGGAGTCTCC GGGCGGGAGT 300 

AGATTTGCAG CACCCTAGCG GGAQCX3AGGA AAACCTACTG ATTCTTIAGC TCATTATCAT 360 

CTCTCCCAGA CGAGATTTCC TTCTTATCGC CTGCCTCATC GCTCAAOTTT GAGCCTCCCG 420 

AAGTCCGGGC GGGAGAGACG AAACCCCTGG CTCACCCCCA GCCGCAGGAA GCCACGGCCT 480 

TGCTCCAAGC OCCTGCAGCT CTGCTGCACC GCAGCTTCTC ACCCAGTGCG GATGCTGTAG 540 

ATCAACAGGT TCAGGGAACT TGAGCAGAAT AAGGAGAGAC CACCGGGTGC OGCAGCTCGG 600 

GTGCAGAGGG AAAAAAGGAC CCATAGACTT GTGQCTCGCG TCGCGCGCGC ACGCTGCGCC €60 

AGGGCGOCAG GCTGGCGCGC ACTCCCTCTC TGGCTCCTCC AGTCCGATTG CTCCTGCCCC 720 

CACCTTACAG GTCTGGGATG TACCTTTCCA TCTGTTGCTG CTTTCTTCTA TGGGCCCCTG 7B0 

CCCTCACTCT CAAGAACCTC AACTACTCCG TGCCGGAGGA GCAAGGOGCC GGCACGGTGA 840 

TCGGGAACAT CGGCAGGGAT GCTCGACTGC AGCCTGGGCT TCCGCCTGCA GAGCGOGGOG 900 

GCGGAGGGCG CAGCAAGTCG GGTAGCTACC GGGTGCTGGA CAACTCCGCA CCGCACCTGC 9G0 

TGGACQTaGA CGCAGACAGC GGGCTCCTCT ACACCAAGCA GCGCATCGAC CGCGAGTCCC 1020 

TGTGCCGCCA CAATGCCAAG TGCCAGCTGT COCTCGAGGT GTTCGCCAAC GACAAGGAGA 1080 

TCTGCATBAT CAAGGTAGAG ATGCAGGACA TCAAOGACAA CGCGCCCTCC TTCTCCTCGG 1140 

ACCAGATCGA AATGGACATC TCGGAGAACG CTGCTCCGGG CACCX33CTTC CCGCTCACCA 1200 

GCGCACATGA CCCCGACGCC GGCGAGAATG GGCTCCGCAC CTACCTGCTC ACGCGCGACG 1260 

ATCACGGCCT CTTTGGACTG GAOGTTAAGT CCCGOGGOGA OGGCACCAAG TTCCCAGAAC 1320 

TGGTCATCCA GAAGGGTCTG GACCGCGAGC AACAGAATCA CCATACGCTC GTGCTGACTG 1380 

CCCTGGACGG TGGOGAGOCT CCACGTTCCG CCACCGTACA GATCAACGTG AAGGTGATTG 1440 

ACTCCAACGA CAACAGCCCG QTCTTCGAGQ CGCCATCCTA CTTGGTGGAA CTGCCCGAGA 1500 

AXX3CTCCGCT GGGTACAGTG GTCATCGATC TGAA03CCAC CGACGCOGAT GAAGOTCCCA IS CO 

ATGGTGAAGT GCTCTACTCT TTCAGCAGCT ACGTGCCTGA CCGCGTGCGG GAGCTCTTCT 1620 

CCATGGACCC CAAGACCGGC CTAATCCGTG TGAAGGGCAA TCTGGACTAT GAGGAAAACG 1680 

GGATGCTGGA GATTGACGTG CAGGCCCGAG ACCTGGGGCC TAACCCTATC CCAGCCCACT 1740 

GCAAAGTCAC GGTCAAGCTC ATCGACCGCA ACGACAATGC GGCGTOCATC GGTTTOGTCT 1800 

CCGTGCGCCA GGGGGC33CTG AGCQAGGCCG CCCCTCCCGG CACCGTCATC GCCCTGGTGC 1860 

GGGTCACTGA CCGGGACTCT GGCAAGAACG GACAGCTGCA GTGTCGGGTC CTAGGCGGAG 1920 

GAGGGACGGG CGGCGGCOGG GGCCTGGGCG GGOCCGGGGG TTCCGTCCCC TTCAAGCTTG 1980 

AGGAGAACTA CGACAACTTC TACAOSGTGG TGACTGACCG CCCGCTGGAC CGCGAGACAG 2040 

AAGACGAGTA CAACGTGACC ATCGTGGCGC GGGACGGGGG CTCTOCTCCC CTCAACTCCA 2100 

CCAAGTCGTT CGCGATCAAG ATTCTAGACG AGAACGACAA CCCGCCTCGG TTCACCAAAG 2160 

GGCTCTACGT GCTTCAGGTG CACGAGAACA ACATCCGGGG AGAGTACCTG GGCTCTGTGC 2220 

TCGCCCAGGA TOCGGACCTG GGCCAGAACG GCACCGTATC CTACTCTATC CTGCCCTCGC 2280 

ACATCGGCGA OGTGTCTATC TACACCTATG TGTCTGTGAA TCCCAOGAAC GGGGCCATCTT 2340 

AGGCCCTGCG CTCCTTTAAC TTCGAGCAGA CCAAGGCTTT TGAGTTCAAG GTGCITGCTA 2400 

AGGACTCGGG GGCGCCCGCG CACTTGGAGA GCAACGOCAC GGTGAGGGTG ACAGTGCTAG 2460 

ACGTGAATOA CAACGCGCCA GTGATGE3TGC TCCCCACGCT GCAGAACGAC ACCGCGGAGC 2520 

TGCAGGTGCC GCGCAACGCT GGCCTGGGCT ATCTGGTGAG CACTGTGCGC GCCCTAGACA 2580 

GCGACTTCGG CGAGAGCGGG GGTCTCACCT ACGAGATCGT GGAOGGCAAC GACGACCACC 2640 

TGTTTGAGAT CGACCCGTCC AGCGGCGAGA TCCGCACGCT QCACCCTTTC TGGGAGGACG 2700 

TGACGCCCGT GGTGGAGCTG GTGGTGAAGG TGACCGACCA CGGCAAGCCT ACCCTGTOCG 2760 
CAGTGGCCAA GCTCATCATC CGCTCGGTGA GCGGATCCCT TCCCGAGGGG GTACCACGGG 2 020 
TGAATGGCGA GCAGCACCAC TGGGACATGT CGCTGCCGCT CATCGTGACT CTGAGCACTA 2BB0 
TCTCCATCAT CCTCCTAGCG GCCATGATCA CCATCGCCGT CAAGTGGAAG CGCGAGAACA 2940 
AGGAGATCCG CACTTACAAC TGCGGC&TCG CCGAGTACAG CCACCCGCAG CTGGGTGGGG 3000 
GCAAGGGCAA GAAGAAGAAG ATCAACAAAA ATGATATCAT GCTGGTGCAG AGGGAAGTGG 3060 
AGGAOAGGAA CGCCATGAAC GTCATGAACG TGGTGAGCAG CCCCTCCCTG GGCACCTCX3C 3120 
CCATGTACTT GGACTACCAG AGCOGCCTQC CCCTCAGCTC QCCCGGGTOG GAGGTGATGT 3180 
ATCTCAAACC GGCCTCCAAC AACCTGACTG TCCCTCAGGG GCACGCGGGC TGCCACACCA 3240 
GCXTCACCGG ACAAGGGACT AATGCAAGCG AGAOCCCTGC CACTCGGATG TCCATAATTC 3300 
AGACAGACAA TTTTCCCGCA GAGCCCAATT ACATGGGCAG CAGGCAGCAG TTTGTTCAAA 3360 
GTATTTCAGT AGCTCCACGT TTAAGGACCC AGAAAGAGCC AGCCTGAGAG ACAGTGGGCA 3420 
CGGGGACAGT GATCAGGCIG ACAGTGACCA AGACACTAAC AAAGGCTCCT GCTGTGACAT 3480 
GTCTGTTAGG GAGGCACTCA AGATGAAAAC TACTTCAACT AAAAGCCAAG CACTTGAACA 3540 
AGAACCAGAA GAGTGXGTTA ATTGCACAGA TGAATGGCGA GTGCTTGGTC ATTCTQACAG 3600 
GTGCTGGATG CCACAGTTCC CTGCAGCCAA TCAjGGCTGAA AATGCAGATT ACCGCACAAA 3660 
TCTCTTTGTA CCTACAGTTG AAGCTAATGT TGAGACTGAG ACTTACGAAA CTGTGAATCC 3720 
CACTGGGAAA AAGACTTTTT GTACATTTGG AAAAGACAAG CGAGAGCACA CTATTCTC&T 3780 
TGCCZAAOGTT AAACCTTATT TAAAAGCCAA ACGTGCCCTG AGCCCTCTOC TCCAAGAGGT 3840 
CCCCTCAQCA TCAAGCAGCC CAACCAAGGC GTGCATCGAG CCTTGCACCT CAACAAAAGG 3900 
CTCCCTGGAT GGCTGTQAAG CAAAACCAGG AGCCCTGGCT GAAGCAAGCA GTCAGTACTT 3 3 SO 
GCCCACTGAC AGTCAATATC TGTCACGTAG TAAGCAACCA AGAGACCCTC CCTTCATGGC 4020 
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TCAGCCCACC CTTGGQTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA XOSO 

TCTGTCQCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGTATG 1140 

TATAGGGGAA AGAAGGOATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
TTAATAAAGG CTGAAGCTTT TTQT 

Seq ID MOi 203 Protein gequence 
Protein Accession #s NP_004208 



MAQKSNSYPW 
SSGTPDIIrTR 
QLHREIEIQA 
HEELADALWY 
PPEMIEGRKH 
QDI*ISKLLRH 



11 
I 

FYGRQTAPSG 
HFTIDDPBIG 

CHGKKVIHRD 
NEKVDfcWCIG 
NPSERLPLAQ 



21 
I 

£>STIiPQRVXiR 
RPLGKGKFGN 
YNYPYDRRRI 
1 KPBN LiLLGb 

VSAHPWVEANr 



31 
I . 

KEPVTPSALV 
VYXiAKEKKSH 
YLILEYAPRG 
KGE LiK IADFG 
PPFESASHNE 
SERVL.PPSAL 



41 



51 

1 



LMSRSNVQPT AAPGQKVMEN 
F IV AXiKVTi FK SQIHKBGVEH 
ELYKELQKSC TFDEQRTATI 
KSVHAPSLRR KTMCGTIJ>YL 
TYRRIVKVUb KFPASVPTGA 
CSV A 



Seq ID NO: 204 DNA sequence 
Nucleic Acid Accession #i AK055G63 
Coding sequence : 38 . . 1423 



AGAACGGCTT 
AAAACCACAA 
CCGAAGGTCC 
GCTTATGTGG 
TTTTGATCTT 
TAGCCCTGTC 
AGTCTTGGCA 
ACAOCCCGAG 
CCTGTTCACG 
TACGAGCTGG 
GGGACTTAGC 
AGCATTTGCT 
CACTGCCTCT 
GTACAGTGGG 
ACTCATCAGA 
GACCCTAGGT 
TGAACAAATG 
TGTTCAAATT 
CAATGTCCTA 
TGATTTGAAC 
ATTTAACACT 
TTATGGTTTT 
TGGAGTTCCA 
TAGATATGGA 
TTATAAGGAA 
TGTTTAATCA 
TATGAAACIA 
GCTTTAAATA 
GTTXTGTAGT 
AJ3ATCTGTCA 
TGCCACTGTG 
CTTABT TT TP 
ATGCAGTGGC 
TGCCTCAGCC 
TGTATTTTTA 
ACCTCATQAT 
CCTGGCCGAT 



11 
1 

CCGGOGGGAG 
AGATCCTTTT 
TGGAAGATAC 
TGCAGTTCTA 
TTTAQTTTAA 
TATTCATTTG 



A ATTG CTAAA 
TTTTTAGCAG 
QATTTTTGTT 



ATACACACGG 
ATGCTTTCTA 
CTTCAAGAGC 
AGTATCTTCC 
CTTTGTATTA 
GCTATAGCTA 
AAAGTCTTAC 
GAGGTATCTA 
TTTGGCTCAT 
GTTCTTGCTC 
TTCAAGGATG 
AACTTTTCA.G 
OCAGTTACAT 
GCTGGGAAAA 
GGTCTCAATC 
GGAATTGGAG 
ACTAATAATA 
TAT TGACTCG 
TTIACTCTAA 
TAT TTTTGTA 

TGACTGCAGT 
CATTACTAAG 
CCCGGOCAAT 



ATGATCTCAG 
TCCCGAGTAG 
GTAAAGAOGG 
CCACCCACCT 
ATTTTCTTTA 
AAAATGTCTG 
TTTTTCTTTG 
AAATTTTGGA 
AAAGTTTCTC 



21 
I 

CTGTGCAGCT 
TTGGCAAGTT 
TGCTCTTTGG 
CTAATAGTAT 
TGACATGTTT 
GGTTTGAAAG 
CTCTCTTTAT 
GAAGATTATT 
TTOGGAATAA 
ATCTTGCAGA 
TTCCCCGAAT 
CATATATGCT 
TTGOCTTGAT 
TCCAGACAAC 
CCTTAGATGG 
TGGCTGGATC 
ATGTGACCAA 
ACTGGATTAG 
ATCATCACGT 
CAACTCCAGC 
ATGTGAACCC 
ATGGACACAC 
CAACTCAAGG 
GAATTGGACA 
TTGGCTTCCA 
ATGTTAGATA 
AAATGTATTT 
AGAAAATGTG 
GTGATGTGAC 
ATACGATATT 
ACATTATTAT 
GTTTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
□GGATTTCAC 
TAGCCTCCCA 
ATGAAATTTA 
TTCAAAAAGT 
AGGTTCTOCT 
ATACATTCTA 
TCCTTTAAAA 



31 
I 

CCTTATCATG 
GTTACGGGAA 
TGTAATAAAC 
AGCTTTAACT 
AATAAGTTAC 
ATTAGAAGTC 
ATTAAAAGAA 
AGTTGGTACT 
ACCTTTTGCT 
TCTTAGTOGA 
GAATCCATTT 
CATTGAAATT 
GACATTTGGC 
ACCACCCCAT 
AGTTTTAGAA 
AGTGCATGTA 
CAGGCTGTAC 

AATCCCAATG 
TAAACCTAGT 
AGTTATTCTT 
ACCTTACAGC 
ATTGAGGACT 
ACCAAGACCA 
ATT T ATT TAG 
ATAGTAGTCT 
GTGACAGTGA 
TTTCTTTAAA 
CTTACCTTTA 
TCTTTTTTTT 
TAACTTAAGG 
ATGGAGTCTG 

ccrcraocTC 

AGGCACCTGC 
CATGTTGGCC 
AAGTGCTGGG 
TAAATATGCT 
AAAGGTCTCT 
GAATTATGTC 
TCTAGCACAA 
ATTTTAGTAC 



41 
1 

GGGACAATTC 
TTTAGACTTG 
TTGATATGTA 
GCCTATACTT 
TGGGTAACAT 
CTGGCTGTAT 
AGTGCAGAAC 
TTTGTGGCTC 
TATGTCTCAG 
AGCTTGTGTG 
GTTTTGATTG 
AATAATTATT 
ACTATGTATC 
GTTATTGGT C 
GTCCGAAATG 
AGAATTCGAC 
ACTCTAGTGT 
TTGTCTGGGC 
CCTCTTTTAA 
AGTCCACCTC 
CTAAACACAC 
AGCATGCTTA 
GGTTTTACAA 
TGATAGACTC 
TAATCOAACT 
TGTTCACATT 
AATCCTCGTA 
TTTGGATTTT 
TAAGAGGCAC 
TCCOAGAOGG 
CTGTACTTTA 
ACTCXGTCGC 
CTGAGTTCAA 
CACCACGCCC 
AGGCTGGTCT 
ATTAGGTGTG 
TCTTQAATAA 
TTTATAGCTT 
TTACAAACTA 
TTTGAATTTT 
ATT TGTAAAT 



SI 
I 

ATCTCTTTCG 
TAGCAGCTGA 
CTGGCTTDCT 
ACCTGACCAT 
TGAGGAAACC 
TTGCCTCCAC 
GCTTTTTGGA 
TTTGTTTCAA 
AAGCTGCTAG 
GAATTATTCC 
ATCTTGCTGG 
TTGCCGTAGA 
CCATGAGTGT 
AGTTGGACAA 
AACATTTTTG 
GAGATGCCAA 
CTACTCTAAC 
CTGTTGCAGC 
AGGGTACTGA 
CACJAATTTTC 
AAACAAGGCC 
ATCAAGGACT 
ATATACCAAG 
TAACTTATTT 
TTGCATTSAC 
TCATGAAACC 
AATGTTAAAG 
GGTATCTTTG 
TTGATGGAGT 
AGTCTTGCTC 
TTAAGGCTTC 
CCAGGCTGGA 
ATGATTCTCC 
AGCTAATITT 
TGAACTCCTG 
AGOCACCGCA 
TACACATTTT 
TTCCAAACTT 

TAATTATCAA 



Seq ID MO- 205 Pxoteln sequence 
Protein Accession #= HAB7Q98Q.1 



1 
I 

HGTIHLFRKP 
TAYTYLTIFD 
ESAERFLEQP 
SSliGGliPGIi 
GTMYPMSVYS 
VRISRDAHBQ 
MPIiIiKGTDDIi 
SSMliNQGLGV 



11 
I 

QRSPPOKWjR 
LFSLMTCLIS 
ElHTGRIi'LVG 
SeiFJUPRHMP 
GJCTIjTjQTTPP 
MVLAHVTHRIj 
NPVTSTPAKP 
PGIGATQGtiR 



21 
I 

EFRiVAADRR 
YWVTLRKPSP 
TFVAIiCFNLF 
FVIiIDIiAGAF 
HVIGQLDKLI 
YTLVfiTliTVQ 
SSPPPEFSFN 
TGFTNIPSRY 



31 
I 

SWKILLFGVI 
VYfiBGFERLE 
TWL8IRNKPF 
ALCITYMLZE 
REVSTLDGVli 
IFKDDWTEPA 
TEGKNVNPVI 
GTMNHIGQPS 



41 
I 

KlilCTGFLLM 
VLAVPA3TVL 
AYVSEAASTS 
rHMYPAVDTA 
KVRNEHFWTL 
liTiSGPVAANV 
LtiHTQTRPYG 
P 



Seq ID no s 206 DMA sequence 

NUCleic ACid Accession #s NK_016361.1 

Coding sequence i 397- -1662 



60 
120 
180 
240 
300 



60 
120 
1G0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
1960 
1920 
19B0 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



SI 
1 

WCSSTHS1AD 
AQLGALFILK 
KLQEHVADLS 
SAIAIALMTF 
GFG&IiAGSVH 
IjNFSDHHVTP 
PGLMHGHTPY 



60 
120 
180 
240 
300 
360 
420 



11 



21 



51 
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GGAACTCAGG 
TTTAATGTTC 
AGGAGCGACC 

rrGTTCrcTC 

TGCTTGCTGA 
TGAGCCCCGG 
CGAGGGCCTA 
GTCCTGACCT 
GAGGCCGATG 
GTGTTTOGAC 
TGGAACCCCC 
CTAGCTGGTG 
AAGGGGGGCA 
GGAGAGAGAC 
CCAC/USGAGG 
TTGCTGGCTG 
GCAGATTCAG 
AGAGGCCGGA 
AAGGACAGGA 
GTGGCTGCCG 
ATOATOGAAC 
AGTCTTCAGA 
ATGGACTCTG 
GTGACCTTCA 
GCTGTTGACC 
CTCTATTACC 

GACATGTTCT 
TGCTCTCAAA 
ATGTGTTGAT 



GCCGGCTCCT 
CTCGTCGGGC 
AACCCCGACC 
CCCCGTTCCC 
AGGGCTGGAT 
CGCCCGCAGA 
CCACCATGAT 
CGCTGGCGTA 
GCCAGTGTCC 
ACGGGGCTCG 
AGCTATTAGA 
GTCCGAAACC 
TGTTTGCTGG 
TGAOGAAGAA 
TCTTTATTOG 
GGCTTTTCCA 
AAGTCTTGTA 
GGCAGACTGC 
TGGGCATTGA 
AGCAGGCACA 
AGAGAGCTGT 
TGGCAGTAGG 
CCACTGCCCC 
TAOCGCl'CTT 
TGACCATGGA 
ACGGGAAGGA 
TGAATGCCAT 
CTCAGGTGAT 
TTTAAAATAA 



GTTCCTTCAA 
AAAAGATAAG 
CGGGTTAAAA 
ACTCGGGGTC 
GTACGCATCC 
AGACTTGTGT 
CACTGGTGTG 
CTGCCTGCAC 
GGTCGACCGC 
GAGTCCTCTC 
GGTCGCACCC 
ATATTCTCCT 
GCAGCTQACC 
CTATGTGGAA 
TTOCACTAAC 
GTGTCAGAAA 
TCCCAACTAC 
CT CTTTACAG 
CAGTAGTGAT 
CAACCTCCCA 
GGACACATCC 
CCCATTCCTC 
COACAAGATC 
AATGACCCTG 
ACTTTAGCAG 
GCAGGTGCCG 
GTCAGTTTAT 
GGAASTTGGA 
AGTGCCTTTA 



GAGTGCTGGA 
GATCCGATCT 
CTCCCAGGGA 
TCCXTTCAGGG 
GCAGGTFCCC 
TTGOCTCCTG 
TTCAGCATGC 
CAGCGGCGGG 
AGCCTGCTGA 
AAGCCGCTCC 
CAAACTCAGT 
TACGACTCTC 
AAGGTGGGCA 
GACATTCCCT 
ATTTTTOGGA 
GAAGGACCCA 
CAAAGCTGCT 
OCAGGAATCT 
AAAGTGGACT 
AQCTGCCCCA 
TTGTACATAC 
CACATCCTAG 
AGAAAGCTGT 
GGGATTTTTG 
CAOCTGGAAT 
AGAGGTTGCC 
ACCTTAAGCC 
AATGAAGAGT 
TACAAAAAAA 



QGCCAAACTT 
CCCCCGGCCC 
CTCTTCGCTG 
CCGGGAQGCA 
GCGGACTTGG 
CAGCCTCAAC 
GCTTGTGGAC 
TGGCCCTGGC 
AGTTGAAAAT 
CGCTGGAGGA 
TTGATTACAC 
AATACCATOA 
TGCAGCAAAT 
TTCTTTCACC 
ATCTGGAGTC 
TCATCATCCA 
GGAGCC1GAG 
CAGAGGATTT 
TCTTCATCCT 
TGCTGAAGAG 
TGCCCAAGGA 
AGAGCAACCT 
ATCTCTATGC 
ACCACAAATG 
CTAAGGAGTG 
CTOATGGGCT 
CAGAAAAATA 
AACTGACTTA 
AAAAAAAAAA 



GAAATACAAG 
GGTGTGCAGC 
CTGOCACCTC 
CAGCGGTCCC 
GGGCGCCCGC 
CCGOAGGCAG 
CCCAGTGGGC 
CGAGCTGCAG 
GGTGCAOGTC 
GCAGGTAGAG 
AGTCAGCAAT 
GACCACCCTG 
G'l" J"l GCCTTG 
AACCTTCAAC 
CACCCGTTGT 
CACTGATGAA 
GCAGAGAACC 
GAAAAAGGTG 
CCTGGACAAC 
ATTTGCAGGG 
AGACAGGGAA 
QCTGAAAGCC 
GGCTCATGAT 
GCCACCGTTT 
GTTTQTGCAG 
CTGCCCGCTG 
CCATGCACTC 
TAAAAGCAGG 
A 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
ISOO 
1560 
1620 
1680 



Seq ID NQt 207 Protein sequence 
Pxotein Accession #.* NP_05744S.l 



KRLWTPVGVI, 
LPLBEQVEWN 



PIIIHTDEAD 
DFFILLDNVA 
LESNLLKAMD 
ESKEWFVQLY 
B 



11 
I 

TSLAYCLHQR 
PQLLEVPPQT 
RLRKNYVEDT 
SEVLYENYQS 
AEQAHNLPSC 
SATAPDKJRK 
YHGKEQVPRG 



21 
1 

RVAIiAEbQEA 
QFDYWTNIjA 
PKbSPTFNPQ 
CWSLRQRTRG 
PHLKRFARMI 
LYLYAAHDVT 
CPPGLCPLDM 



31 
I 

DGQCPVDRSL 
GGPKFYSPYD 
EVFIESTHIP 
RRQTASLOPG 
BQRAVDTSI.Y 
FIPIiLMTLGI 
FLHAMSVYTfc 



41 
I 

I/KXrKMVQ W F 
SQYHETTLKG 



ISEDLKKVKD 
ILPKEDEESL 
FDHKWPPFAV 
gpEECYEAliCB 



SI 

i 

RHGARSPLKP 
GMFAGQLTKV 
AGLFQOQKEQ 
RMGIDSEDKV 
QMAVGPPIiHI 
DIiTMELYOHL 
QTQVMBVGNE 



Seq Il> NO: 208 DMA sequence 

Nucleic Acid Accession #: CAT cluster 



TTTGAGGGGG 
TAACAGCCTG 
TGCTGQGGGC 
TAGTGGCAGA 
TCTGGCTGCT 
GGAGGOCTCC 
CGTCTTTATT 
GCAGGTTCCT 
TGAGCTGGCA 



11 
\ 

TGGTGGGGCG 
CCCTTGOAGA 



CTTGCATTTG 
GCAGATCAGT 
GGCAAATGGG 
CACTGCTCCT 
GCGCAG 



21 
I 

AGTTTAATTC 
GAAGTCCTTC 
CTTCAGGCCA 
ATGAGTGGGA 
CATTTGCCAC 
CCGCTCAGCT 
TCATTGGCTT 
TCACCAGTGT 



31 
I 

ATAAAGAAGC 
CTTOAGGATA 
TCOCTGGAGG 
TGGCCTTCTC 
TCAGAACTGC 
GCAGGTTTTT 
CCAAGGCAGT 
CCTGCGAGGT 



41 
I 

CTOCTGATCA 
AGGCCTCCCA 
CCAC3TCCTGT 
AGGTACAGGA 
CGCGATCCCA 
CCAGTCATAQ 
CAGGCCAACX 
CACCTTGGCG 



Seq ID NO: 209 DNA sequence 

Nucleic Acid Accession #x FGENESH predicted 

Coding sequence: 1..564 



1 
) 

ATGGAGCCCT 
TCAGATCCGT 
CAGAGTCACA 
GATCCCTTCT 
CTGGCCATTG 
AAGCAGCACA 
ACCTTGGCCA 
AGCTACACAT 
GATGCTGCTG 




CAST1GGCCT 
ACTATGACTG 
CTQGGATCGC 
GTCCTGTACC 
AATCAAATAA 
CAGTCTTCCG 

ccctcacatc 

GCTGTACCTA 



21 
1 

GCAGGGTTTA 
GDCAGCTCAG 
GACTGCCTTG 
GAAAAACCTG 
GGCAGTTCTG 
TCAGAAGGCC 
ACCTTTATCT 
AGTGCCTCTT 
AQGCCATCCA 
ATGA 



31 
I 

AAGAGCCGAC 
GACACTGGTQ 
GAAGCCAAIG 
CAGCTGAGCT 
AGTGGCAAAT 
ATCCCACrCA 
CCAAGCACCT 
TCTGOGTCCC 
AGCATGCAGA 



41 
I 

CCACGTGCCC 
AAGGAGGAGT 
ACGCATTTGC 
GACTGATCTG 
GGAAATGCAA 
TCACTCCAGG 
TTGTCTTGGT 
TGTACCXTTGC 
ACATAAGCAT 



60 
120 
180 
240 
300 
360 
420 



SI 
1 

GAAAGGGGCC 
GQOGAGGAGG 
GCTCAGCAAG 
CTGTGCTGCT 
GCAATGGCCA 
TAGAAGGGAT 
GTGTGACTCT 
AGGGCTCAOC 



51 
\ 

AGCAGCCTCC 
GAGGAAOCTG 
CAATAAAGAC 
CGGAGGGCTC 
QAGCAGCCAG 
CAGATTTCTC 
GTTTGGCATC 
CATTCCTGGT 
GCAGAACACT 



60 
120 

iao 

240 
300 
360 
420 
460 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seg; ID NO; 210 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

! 1 1 I I I 

MEPWAKTiQGIi KBRPTCPAAS SDPFSAbPAQ DTGEGAVHNL QSHTVGLTAL EANDPFANKD 
DPFYYDWKNI» QI»SGLICGGL IAIAQIAAVL SGRCKCKSSQ KQHSPVPEKA IPLITPGRFI. 

1035 



60 
120 



WO 03/042661 



TLAKSNKPI.S P5TFVLVFGI SYTSVFRVPL SASLYPAIPG DAAALTSGHP SMQNI ENQNT 180 
GTKGCT 

Seq id NO: 211 DMA sequence 

MUcleic Acid Accession ft: FGENESH predicted 
Coding sequence: l.«318 

* 11 21 3i 4i 51 

ATGCCCGGCC ACCCCGTCTG TOAAGTGAGG AGCACCTCTG CCCGGCTGCC COGTCIOGGA fiO 

AGTGAGGAGC GCCTCTGCCC GGCTGCCACC CCGTCTGTGA GfTQCCTGCTG CGCTGGGCCC 120 

AGGCCGCCCG TGCCCTGCCA GGCCCTCCGG COCCCCACCT TCCACCCCAG GGCCTGCTCC ISO 

TCACCCCAGG GTTCCATCTC CCTAQTTTCC ACCAGAGACT GGGTCTTCAT TCTCACCCTG 240 

CTACACAGCC CCTACCAGAA CGTTCTGAAA TGCAAACCTA ACAACTGTCT CACCCCAGCA 300 

GGAAACTCCC CAGGGTCCCQ GGCCCCCTGC GGGOTTGCAG GCCTCACTCT TCGCGCCCAT 360 

CCCrCCOCCC TGACCGCOCT GAGCTCGCCC CCAGTGCTGG CCCTTCACGT CCAGTTATCC 420 

CICCCAGCCT CCAAGGTCCC OSTTACCGAA GACCGCCACC ATCACGACAT AGOGCAGCAC 4S0 

ATATGGQACA CTGGTGAAGG AGCAGTGAGG AACCTGCAGA GTCACACAGT TGGCCTQACT 540 

GCCTTGGAAG OGAATGACCC ATTTGGCAAT AAAGACOATC CCTTCTACTA TGACTGGAAA 600 

AACfCTGCAGC TGAGCGGACT BATCTGCGGA GGGCTCCTGG CCAlTGCTGG GATOGCGGCA 660 

OTTCTGAGTG GCAAATGCAA ATGCAAGAGC AGCCAGAAGC AGCACAGTCC TGTACCTGAQ 720 

AAGGCCATOC CACTCATCAC TCCAGGCAGA TTTCTCACCT TGGCCAAATC AAATAAACCT 780 

TIATCTCCAA GCACCTTTGT CTTGGTBTTT GGCATCAGCT ACACATCAGT CTTCCGAGTG 640 

CCTCTTTCTG CGTCCCTGTA CCCTGCCATT CCTGGTGArG CTGCTGCOCT CACATCAGGC 500 
CATCCAAGCA TGCAGAACAT AAGCATGCAG AACACTGQAA CGAAGGGCTG TACCTAA 

Seq id MOi 212 Protein, sequence 
Protein Access ion tt* FGEHESH predicted 

1 11 21 31 41 si 

11)11) 

KPGHPVCEVR STSARI>PRLG SEERLCPAAT PSVSACCAGP RPPVPCQALR PPTFHPRACS 60 

SPQGSISI.VS TRDWVFILTL LIISPYQNVLK CKPNNCLTPA GNSPGSRAPC GVAGLTLRAH 120 

FSAI/TALSSP PVIiALHVQLS LPASKVPVTE DRHHEDIAQH 1WDTGEGAVR NLQSHTVGI/T 180 

ALKAMDPFAN KDDPPrXDWX Nl^OJLSGLICG GLLAIAGIAA VLSGKOXCKS SQKQH9PVPE 240 

KAIPLITPGR FfcTLAKSNKP LSPSlFVLVF GISYTSVFRV PLSASLYPAI FGDAAALTSG 300 
HP8HQNISHQ NTGTR3CT 

Seq ID 150; 213 DMA sequence 

Nucleic Acid Accession #: FGENESH predicted 
Coding sequences 1,.175B 

1 11 21 31 4i 5i 

I I I I I I 

ATGATGGGGT CTCATGTTGC CCAGGCTGGT CTTGAACTCC TGGGCTCGAG TGACCCTOCT 60 

GCCTTGGCCT CCGAAAGTGC TGGGATTACA GGACTGTXAT TACAGGAATC CATAACACTG 120 

GAGGATGTGG CTGTGGACTX CACTTGGGAG GAG1GGCAAC TCCTGGGCGC TGCTCAGAAG 180 

GACCTCTACC GGGATGTGAT GTTGGAGAAC TACAGCAACC TGGTGGCAGT GGGGTATCAA 240 

GCCAGCAAAC CGGATGCACT CTTCAAGTTG GAACAAGGAG AACAACTGTG GACAATTGAA 3Q0 

GATGGAATCC ACAGTGGAGC CTGTTCAGGT TCTCCAAAGG TCCCGTTCTC CATTTTCTCA 360 

TCTGTGCCCT TCACTCTTCA AAATTGCCTT CATTCE&ACA TATGGAAAGT TGATCATGTG 420 

CTGGAQOGCT TGCAGAGTGA AAGOCTGGTG AACAGAAGGA AACCATGTCA TGAACATGAT 480 

GCATTTGAAA ATATTGTTCA TTGCAGCAAA AGTCAGTlTC TGTTAGGGCA AAAICATGAT 540 

ATATTTGACT TACGTGGAAA AAGTTTGAAA TCCAATTTAA CTTTAGTTAA CCAGAGCAAA 600 

GGCTRTGAAA TAAAGAACTC TGTTGAGTTT ACTGGAAATG GGGACTCCTT TCTTCATQCT 660 

AAGCBXGAAC GACTTCATAC TGCAATTAAA TTCCCK3CAA GTCAAAAACT CATCAGCACT 720 

AAGTCCCAAT TCATCAGTCC CAAGCATCAG AAAACACGAA AATTAGAGAA GCATCATGTG 780 

TGCAGTGAAT GTGGGAAAGC CTTCATCAAG AAGTCTTGGC TAACTGATCA OCAGGTAAttS 840 

CATACAGGAG AGAAACCCCA CAGATGTAGT CTATGTGAGA AAGCCTTCTC CAGAAAGTTC 900 

ATGCTTACTG AACATCAGCG AACTCATACA GGAGAAAAAC CTTATQAATG CCCTQAATGT 860 

GGCAAAGCCT TTCTCAAGAA A1CACGGCTC AACATACATC AGAAAACACA TACCGGAGAG 1020 

AAACCCTATA TATGCAGTGA ATGTGGAAAA GGCTTCATCC AGAAAGGAAA TCTCATTGTA 10B0 

CACCAGCGAA TTCATACAQG TGAGAAACCT TATATATGCA AHaAATOTCG AAAAGGCTTC 1140 

ATTCAGAAGA CGTGTCTCAT AGCACATCAG AGATTTCACA CAGGAAAGAC GCCCTTTGXG 1200 

TGCAGTGAAT GTGGAAAATC CTGTTCTCAG AAATCAGGTC TCATTAAACA TCAAAGAATT 1260 

CACACAGGAG AGAAACCCTT TGAATGTAGT GAATGTGGGA AAGCCTlTAG CACAAAGCAA 1320 

AAGCTCATTG TCCATCAAAG GACTCATACA GGAGAGAGAC CCTATGGCTG TAACGAGTGT 1380 

GGGAAAGCGT T TGCGTATAT CTG6TDTCTG GTTAAGCATA AGAGAAIACA CACAAGGGAG 1440 

AAACAAGAGG CAGOCAAGGT GGAAAATOCT CCTGCAGAGA GGCACAGCTC ATTACACACC 1500 

AGTGATGTCA TGCAGGAGAA AAACTCTGCT AACGGGGCGA CTACACAAGT GCCTTCTGTG 1560 

GCCCCTCAGA CATCATTAAA CATCAGCGQC CTCCTOGCAA ACAGGAACGT AGTCCTTGTO 1620 

GGACAGCCAG TGGTCAGATG TGCAGCCTCA QGAGATAACA GASGATTTGC ACAGGACAGA 1690 

AACCTTGTGA ATGCAGTGAA TGTGGTTQTG CCTTCCGTGA TCAATTATGT CTTATTTTAT 1740 
GTTACAGAAA ACCCATAG 

Seq ID NO i 214 Protein sequence 
Protein Accession #« FGENESH predicted 

? " 21 31 41 51 

TOSGeHVAQAG LEIAG8SDPP AIAfiESAGIT GUiQESITI. JBDVAVDFTWE EWQDLGAAQK 60 

DLYEDVMLBW YSH^VAVGYQ AfiKPDALFKL EQGEQLMTIE DGIHSGACBG SPKVPPSIPS 120 

avPFTLQNCI* H8NIWKVDHV LERLQSESXjV NHRKPCKEHD APKNXVHCSK SQPLLGQMHD 180 

IFDUW3KSUC SN1VT1,VN08K GYEI1OTSVEF TGHGDSFDHA NHEEtLHTAlK FPAfiQKLIST 240 

1036 



WO 03/042661 



KSQFISPXHQ KTRKLrEKHHV CSECGKAFIK. XSWIiTDHQVM ETGBKPHRCS LCEKAFSHKF 300 

KLTEHQRTHT GHKPYBCPEC GKAFUCKSRI/ NIHQKTHTGE KPYICHECGK GFIQKGNLIV 360 

HQRIHTGEKP V1Q3EOGKGF IQKTCLIAHO RFHTGKTPFV C8ECGKSCSQ KSGLIKHQRI 420 

HTGEKPFECS ECGKAFSTKQ KLIVHQRTHT GERPYGCNEC GKAFAYMSCL VKHKRIHTRE 48D 

KQEAAKVENF PAERHSSJjHT SDVMQEKNSA NGAITQVPSV APQTSLNISG LIifiNRHWLV 540 
GQPWRCAAS GDNRGFAQDR N1TVNAVNVW PSVUJYVLFY VTEHP 

Seq ID NO: 215 DNA sequence 

Kucleic Acid Accession #: NWJ>32190.1 

Coding sequence: 502.-2332 

1 11 21 31 41 51 

I I I I 1 I 

GATTCCGTGT TCTTGGCCAT GTTAGCCATA ATATCCTGTG CAGTATGTTT TTCCTGTGCA 60 

GAGGCAAAAA CATATTGGGC ATATGTT CCC AAGCCCCCAG CAGTATGACC CATACTTTGG 120 

AGTOACACTC CTCCTAAGAT TTATCATGAT TAAQGA&CAT GGGCTCCAGG ACCCCTAACT 180 

CCACCTGACA TAGAACAGTT AGACTCTCAG AATAATGTCA TTAATTATAG CGCTCCATTG 240 

GAAGGACTTC CTTTGTGTGT CACCACAAAQ ACATGAiCTCA GCCATAGCTG TCTTACAGTT 300 

CAAGCTCACA CATGGTTGAG TCACTATGGG AAAATCATGT ACTTATTAAG TCTTGGTTAT 360 

ATTAATGTAA CCGGTGTGCT AACCAACCAT TCCTGGCCCA ATCGCCTTCA TTGTGCTGAC 420 

TAXACAGAAT GGATTCCCTT CAATAGTTCC TACCCCCCTC CATAGACCCA GTGTCTTGGC 450 

CCACTGGCTA GAAAACAATC TATGTTAACT QGAGACATTG TGGATTGGGG ACCTAAASGC 540 

CAATTAGATG GAAAAOAAOA AAATCAGAAA TCGTGGCACA AACTTTGCTG GCATTGGTGG 600 

CAASCTTTTA ATGCTTCTTC TTTATATAAC ACTGGGATCC AATCCCAGTC GGCCGCCCAG 660 

ATTGCTTGGC ATGGAGCAGG CTTTAGCCCG CCTCTTCCTC AGtTGGCATTA TCTAGGGAGG V20 

AAAGGACCAA TTCAAAAGAT GATATGOAAG GCAGCATTCC CATTTATGAA TGGCAACATC 780 

TGGGTTQCCA TAATACTATC CAATAATAGC AATAGTAAQC AACACAGTCT TAATOTTACA 840 

TITGTAAAGA ATATCACCAC TCAATTTACA GTTTGTGTTT TTAATCCTTA TGTGTTTTTG 900 

GCAOCTAAGA AGGACCAGCT OCAGGTAAAC AATACCCAAT TGACCTGTAA ATCTTGCCAG 960 

TTATATCACT GCATTAATCA TAGCACATTG CAAACACATA ATATCTCTAC TTTGATGATT 1020 

TTAOGTTGCA TCCCTQGGCT ATGGATTCCT □TTAATCTGT CTGAGCCATG GGCTGCCACA 1080 

ATTGCTTTAC ATTTTGTGAA ACTTCTTCTA ACTCAGTTTA CTCATTGTGT CCGTAGAGGC 1140 

TTAGGCATGA TAATTTTTGC TATTGTTTAC TTGGTCACAC TAATAATTTC TGTTGTGAXG 1200 

TCCTCTGTAG CTTTGCATAG TTCTATTCAA ACAGCTCAGT ATGTGOAGAA CTGGACACGC 1260 

ACAGTCAACC AAGGGTGGCT ACTTGAGAAT AAAATTAACA CTGAGTTACA AACTGAAOTG 1320 

GCAGTGTTAT AATCCACGAT TCTATGGTTA GGGGAACAAQ TACAAAGCTT GCAATTGCAG 1380 

CAGTAATTGT GTTGTCATTT TAATCACACT CATATTTGTG TAACCAACTT AGAATATAAC 1440 

CAAAGTGAGT ATCCATGGGA TCTTGTGAAA GCCCATTTGC AGGGAGCTTT CACATCCGAC 1500 

ATCACCTTTG ATATTGGTGA ATTACAAAAC AAAATTCTTG ATCTAAATAA ACAAATTCCA 1560 

GAGTTTCAGC CTTCTTTAGA ASACTGGACT GAATTCCAGC AAGOCCTGGA GAGCGTCAAC 1620 

CC1TGGACCT ATCTAAAGCA CCACATTAAC ATCTTATATA TAGTTCTTGG AATAATGTTG 1600 

TTTTGTCTCT GTCTTCTGTT CATAGTCTGT AAAATCGGAT GGACTGOCAA TGGGAGAATG 1740 

AAAGCTACCC AGCCTGGCCT TACATTCTTT CACTTAATAC ATAAACAAGA AGGGGGAAAT 1800 

GTTGGGAGCC AAAAAGGCCA AAGGGATGGT GaCCAACTCA GCATTCCACT GGAGGCTACA 1860 

TGATCAAACA GCAAACTGTT TATCATGAAT ACAGAATGTG QGCAAACTCG CTTCTGTGCC 1920 

TGCCCAGAAG GTTTGCTGAG GGCCATCGCT CCCTGGCOCC GGCTCCTTGA GGTTATCEAC 1980 

TGGGACATCT AGAGCCTATT GTTCGAGGAA TGCAGTCTTG CAAGCCTACT CTGGACCGAQ 20 40 

CAGCTGACCT CTTCTTCCAC ACCCCTTCTC ACTATCTCTT TTGCCTAATA AATATGGAGG 2100 

GCTQTGTAAA GCTCAGGGCC CTTGTCCACT AGAGGCAAGG TGTCCCCTGA CCCTTCTTCC 2160 
AAACAT 

Seq ID NO: 216 Protein sequence 
Protein Accession #: NP_115566*1 

1 11 21 31 41 51 

i I I I I I 

MLTGDIVDWG PRGQLDGKEE NQK8WBKLCM KWWQAFHASS LYNTGIQSQS AAQIAWHGAG 60 

F8PPLPQWHY LGRKGPIQKM IWKAAFPFMN GMIWVAIH.S KNSNSKQMSL HVTFVKNITT 120 

QFTVCVFKPY VFLAAKKDQJ. QVNNTQLTCK SCQLYHCINH STLQTHKIST TMTHSCIVGL 180 

WIPVK1»SEPW AATIALHFVK LltLTQFTHCV RRGLGMIIFA IVYl*VYIiII8 WW8SVALH3 240 
SIQfTAQYVEN WTRTVNQGML LEHKlKTEIiO TEVAVL 

Seq ID MO: 217 DMA sequence 

Nucleic Acid Accession Jit fgenbsh predicted 

Coding sequence* 1 . .1566 

1 11 21 31 41 SI 

I I I 1 I I 

ATGGTGAACC CCAAATCCAC TTCCTOCCTC TTCAGGTTAT GTTTTTTGCT CCTGAGGAGT 60 

CAQAACCTGT QGGTTGAAGA GCAAATTCAA TGCAAAAACA TATTGGGCAT ATGTTOCCAA 120 

TCCCCCAGCA GTATGGCCTA TACTTTGGAG CTCACTCCTC CTGAGATTTA TCACGATCAG 180 

GGAGAGTGGG CTCCAGGACC CCTAACTCCC CGTGAJC&TAG AAAAGTTAGA CTCTCAGAAC 240 

AATGTCATTA ATTATACCAC TCCACTQGAA GGACTCCCTT TGTTTATCAC CACAAAGAGG 300 

TCQCTCABCC ATAGCTGXCT TGCAATTCAA GCTCAAACRT GGTTGAGTCA CTATGGAAAA 3 60 

ATTATGTACT TATTAGGTCT TGGTTCTATT AATGTAACTG GTGTGCTAAC CAATCATTCC 420 

CAGTCCAGTC ACCCTAATTG TGCTGATTAT ACAGAATGGA TTCCATTCAA TAGTTOCTAC 480 

CCCACTCTGT GGACCCAGTG TCTTGATCCA CTGGCTAGTA AACAATATAT GTCAACTGAA 540 

GACACTGTGG ATTGGGAACC TAAAGGTCAA TIAGATGGAA AAGGTGAAAG TCAGAAATCA 600 

TGGCACAAAC TTCACTGGCA TTGGCQGCAA GCTTTTAATG CTTCTTCTXT ATACAACAGC 660 

AGAATCCAAT CCCAGTCTGC TGCTCRGATT GCTTGGCATG GAGCAGGCTT TAGCCCACCT 720 

CTTCCTCAGT TGCATTATCT GGGGAQGAAA GGACCAATTC AAGAAACTAT ATGGAAGGCA 780 

GCACTCCCAT TTATGAATGG CAACATCTGG ATTGGAACAC TGTCTAATAA TAGCAATAGT 640 

AAGCAACACA GTCTTAATGT TGCATTTGTA AAGAATATCA CZCRCTCftGTT TACAGTTTGT 900 

GTTTTTAATC CTTATGCCTT TTTGGCAGCT AAGAAGAACC AGCTTCAGGT GGAGAACTGG 960 



1037 
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PCT/US02/3681© 



10 



ACACGCACAG 
GAAGTGGCAA 
TTGCAGCAGC 
TATAACCAAA 
TCCAACATCA 
ACTCAAQAAT 
CTCAACCCTT 
ATGTTGTTTT 
AGAATGAGAQ 
GGATATGCAG 
ATATGA 



CTGACCAAGC 
TGTTGAAATC 
AATTGCGTCA 
GTGAGTATCC 
CCTTTGATAT 
TTCAGCCTTC 
GGACCTATCT 
GTCTCTGTCT 
CCTCCCAGCC 
GGAGCCAAAG 



GAGGCTACTT 
CATGGTTCT3 
TCATTTTAAT 
GTGGQACCTT 
TGGTGAATTA 
TTTAQAAGAC 
AAGGCACCAC 
TCGGTTTATA 
CAGCCTTACA 
GCCTGTGGGA 



CAGAATAAAA 
TGGTTAGGAG 
CACATTCATA 
GTGAAAGCCC 
CAAAACAAAA 
TGGACOGAAT 
ATTAACATCT 
GTCTGTAAAA 
TTCTTTCAAT 
OGTGACCAAC 



TTAACACTGA 
AACAGGTACA 
TTTGCGTAAC 
ATTTGCAAGG 
TT ATTGATT T 
TCCAGGAAGG 
TAIATGTAGT 
TCGGATGGAC 
TAATACATAA 
TCAGCATTCT 



GTTACAAACT 
AAGCTTGCAG 
TAACTCAGAA 
AGCTTTCACA 
AAATAGGCAA 
CCTGGAGAGC 
TCTTGGAATA 
CACCAATTGG 
ACAGAAAGGG 
GCTGGAGGCT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 

1500 
15G0 



15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



Seq ID NO: 218 Protein sequence 
Protein Accession ^ : FGBEJESH predicted 



1 
I 

MVNPKSXSSL 
GEWAPGPLTP 
IMYbLGLGSI 
DTVDWEPKGQ 
LPOliHYLGRK 
VFNPYAFIAA 
IiQQQLREHFN 
TQEFQPSLED 
RMRASQPSLT 



11 
I 

FRLCFLLLRS 
RDIEKLDSQN 
NVTGVLTMH6 
LDGKGE3QKS 
GPIQETIWKA 
KEQJQliQVENW 
HIHICVTNSS 
WTKP Q EGLE S 
PPQLIHKOKG 



21 
I 

QNLWVBEQIQ 
NVINYTTPLB 
OSSHPNCADY 
WHKLHWHWRQ 
ALPPMNGNIW 
TRTADQARLL 
YNQSEYPMDL, 
I±H1?W X YIJRHH 
GYAQ3QRPVG 



31 
I 

CKNILGICSQ 
GLPLFITTKT 
TEWIPFNS9Y 
AFHASSLYN8 



QNKINTEUQT 
VKAHLQGAPT 
INIIiYW&GI 
RDQLSILLBA 



41 
I 

fiPSSMAYTLB 
SliSHSCIiAIQ 
FTUNTQCLDP 
RIQSQSAAQI 
KQHSLNVAFV 
FVAMIJCSMVL 
SNITPDIGEL 
MLFCLCUIF3 
I 



Beg ID NO: 219 DNA sequence 
Nucleic Acid Accession. #; 
rv>^ rvj sequence? *1- -90O 



FGENESH predicted 



1 
1 

ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CDGQCCCACTG 
GGCTGCGGCG 
GGACCCCGGG 
CTTCCTAACT 
CCGGTQCGCA 
CTTTGCTACC 
TTTCAAAACA 
GTGCTGCTGG 
GACCAGGGQQ 
ATCCGAGCCT 
TTTGACTCGG 
GCCAAAGGTG 



11 
I 

GGGAGC7TGAG 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGCTGTDCA 
GAGGAGACTG 
CAGGCTCTCC 
TTGAGCTCTG 
CGGATACCQA 
TCACAGAGAA 
TGGGCACCCA 
GCCGGGAGGG 
GCTGCTACCT 
Cr ATT C T CAG 
TGCGCACCCT 



21 
I 

OGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGGT 
CCGGGGAGCT 
GAGCAGGCCC 
CCGOCCCGCC 
GGACACAGCG 
TGTCTTCCTG 
ATGGCTGCCC 
GGCCGACCTG 

TGAQTGCTCA 
TGCCATTGAG 
CTCCCTCTGC 




QGCATCAAGT 
TACACCTGCA ATGGQTACCC 
ACGTACGTTC AATCGCCCGT 



COAGGTGGCG 
CCTGCAGTGC 
C3GACAGGAGG 
GCGTGCTTCA 
GAGATCCGCA 
AGGGACGATG 
CAACCCCAGG 
GCCTTGACGC 
CACAAA3CCC 
CGCTGGAAGA 



CTGGTGCGGC 
AAGTCCTGGT 
ATTTTGACCG 
GCGTGGTGCA 
CGCACAACCC 
TCAACGTACT 
CTCAGGGTCT 
AGAAGAACTT 
GQCTGGAGAA 
AGTTCTTCTG 



51 
I 

LTPPEIYHDQ 
AQTWLSHYGK 
LASKQYMSTE 
AHHGAGFSPP 
KNITTQFTVC 
WLGEQVQSLQ 
QNKIIDLNRQ 
VCKIGWTTNW 



51 
I 

QACCCCTCCC 
GGGOGAOGGC 
CGOGCGCTAC 
GCGGCCGCGT 
AGGGCGCAGA 
CCAGGACGCT 
QGATGGAGCT 
ACTTCGTTCC 
gcccagctcc 



AATTCAGCTG 
GGCCGAGAAG 
GAAGGAAOTA 
GAAACTGAAT 
CTTCOTTTQA 



Seq IP NO; 220 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I 1 1 I I I 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEIi GIKCVLVODG AVGKfiSLIVe YTCNGYPARY 
RPT ALtDTFS G rYVQSPVKPH GCGGAVHRGA GAGVSAGGRR GFRGOTWSRF HGGAGAAQDA 
LPNSGBPSPA PAVQVIfVDGA PVRIELflDTA GQEDFDRLSS LCYPDTDVFI, ACFSWQPSS 
FOJ6TITSKWLP BIRTHNPQAS VLIjVGTQADL RDDVNVLIQIi EQQGREGFVP QPQAQGLAEK 
IRACCYLECS AIjTQKN1,XEV FDSAILSAIB HKARLEKKLN AKGVRT&SRC RWKKFFCFV 

Seq ID NO: 221 DNA sequence 

Nucleic Acid Accession ft: XM_0G3832.2 

Coding sequence: 1. .711 



1 
1 

ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
ATTGAGCTCT 
CCGQATACCG 
ATCACAGAGA 
GTGGGCACCC 
GGCCGGGAGG 
TGCTGCTACC 
GCTATTCTCA 
GTGCGCACCC 



11 
1 

GGGAGCTGAG 



AGAGCAGCCT 
CGCTGGACAC 
GGGACACAGC 
ATGTCTTCCT 
AATGGCTGCC 
AGGCCGACCT 
GCCCCGTGCC 
TTGAGTGCTC 
GTGCCATTGA 
TCTCCCGCTG 



21 
I 

CGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTOTG 
GGGACAGGAG 
GGCGTGCTTC 
CGAGATCCGC 
GAGGGACGAT 
CCAAJCCCCAG 
AGCCTTGACG 
GCACAAAGCC 
CCGCTGGAAG 




GTCAACQTAC 
GCTCAGGGTC 
CAGAAGAACT 
CGGCTGGAGA 
AAOTTCrTCT 



41 



GCGTGCTGGT 
ATGGGTACCC 
TGGATGGAGC 
GACTTCGTTC 
AGCCCAGCTC 
CCCAGQCGCC 
TAATTCAGCT 
TGGCCGAGAA 
TGAAGGAAGT 
AGAAACTGAA 
GCTTCGTTTG 



€0 
120 
180 
240 
300 
360 
420 
4B0 



Seq ID NO: 222 Protein sequence 
Protein Accession ft: XP_063 832.1 



11 
I 



21 
I 



31 
I 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



SI 
I 

GACCCCTCCC 
GGGCGACGGC 
CGOGCGCTAC 
TCCGGTGCGC 
CCTTTGCTAC 
CTTTCAAAAC 
TGTGCTGCTG 
GGACCAGGGG 
GATCCGAGCC 
ATTTGACTCG 
TGCCAAAGGT 
A 



SI 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
S4D 
600 
660 



1038 
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5 

10 
15 



20 



25 



35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



MPPRELSEAE PPFLRAPTPP PRRRSAPPKL GI KCVLVGDG AVGKSSI*IVS YTCNGYPARY 60 

RPTALDTF3V QVLVDGAPVR IELWDTAGQE DFDRLR3LCT PDTDVFLACF 9VUQP9SFQN 120 

1TEKWLPEIR THNPQAFVLL VGTQADLRDD VNVLIQLDQG GKSOPVPQPQ AQGLABKXRA 180 
CCYLECSALT QKNLKEVFDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KPPCFV 

Seq ID NO: 223 DKA sequence 

Nucleic Acid Accegsion U - Eos sequence 

Coding s^guance: 1..1161 



1 11 21 31 41 51 

I I I I I I 

ATGAATOGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60 

TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTT 120 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180 

TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCIGACT TTCTACTGAT CATCTGCCTG 240 

CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA AGTTTGGGGA CATCCCTTGC 300 

CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT OCTCACGGTG 360 

GTGGCGGTAG ACAGGTATTT CCGGGTQGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 460 

CACCTOCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC S40 

AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 600 

CTGGQCATCA TCCTGTrCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780 

TCGGGCACGC AGAATTGTOA AGTGTACCGC TGGGTGGAGC TGGCGTTCTT TATCACTCTC 840 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 

GCTCCAGAGG OGTTAATGGC CAACTCCOGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080 

JU ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1140 
CAGTTGGGAT GTTGCATCGA G 



Sea; ID NO« 224 Protein Begngnge 
Protein Accession ft : Eos sequence 

1 11 21 31 41 SI 

I 1 I I ! I 

MNRHHLQDHF LEIDKKNCCV FRDDFIAKVL PPVLGLEFIF GLLGNGLALW IFCFHLKBWK 60 

SSR1FLFNLA VADFLLIICL PFVMDYYVRR SDWKFGDIPC RLVLFMFAMH RQGSIIFLTV 120 

VAVDRYFRW HFHHALNKIS NWTAAII6CL LWGITVGLTV HLLKKKLLIQ NGPANVCISF 180 

SICHTPRMHE AMFLLEFLLP LGin-FCSAR IIWSLRQRQM DRHAKIXRAI TFIMWATVF 240 

VICFLPSVW RlRIPWLLHT SGTQBtCEVYR GVDLAFFITL SFTYMN8MLD PWYYFSSP6 300 

FPNFPSTLIN RCLQRKMTGE PDNMRSTSVE LTGDPNKTRG APEALMAUSG EPWSPSYLGP 360 
TSHNHSKKGH CHQBPASLEK QLGCCIE 

Seq ID HOi 225 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1 , ,1092 

1 11 21 31 41 51 

111)11 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60 

TTCCGAGATG ACTTCATTGT CAAQGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTC 120 

GGGCTTCTGG GCAATGGOCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180 

TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTGGCTGACT TTCTACTGAT CATCTGCCTG 240 

CCCTTCCTGA TGGACAACTA TGTGAGGCGT TGGGACTGGA AGTTTGGGGA CATCCCTTGC 300 

CGGCTGATGC TCTTCATGTT GGCTATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360 

GTGGCGGTAG ACAGSTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

AATCGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTATTGG CCTGACAGTC 480 

CACCTOCTGA AGAAGAAGAT GCCGATCCAG AATGG03GTG CAAATTTGTG CAGCAGCTTC 540 

AGCATCTGCC ATACCTTOCA GTGGCACGAA GCCATGTTCC TCCTGGAGTT CTTCCTGCCC 600 

CTGGQCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780 

TCGGGCACGC AGAATTGTOA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 840 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 90D 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 

GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080 

Seq ID NO 1 226 Protein sequence 
Protein Accession #? Eos sequence 

1 11 21 31 41 51 

i 1 * 1 1 » 

MNRHHLQDHF LEIDKKNCCV FRDDFIVKVL PPVLGLEFIF GLLGMGLALW IFCFHLKSWK 60 

SSRIFLFULA VADFLIiIICL PFLMDNYVRR WDWKPC3DIPC RLMLFMLAMN RQG3IIPLTV 120 

VA VDRY FRW HFHHALNKIS NRTAAIIBCL LWGITIGLTV BLLKKKMPIQ WGGANLCSSF 180 

BICHTFQWHE AMFLLEFFLP LGIlLFCSAR IIW3LRQRQM DRHAKXKRAI TFIMVVATVF 240 

VICFLPSWV RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMN8NLD PWYYFSSPS 300 

FPNFFSTL IN RCLQRKMTGE PDNNRSTSVE 1TGDFNKTRG APEALMANSG EPWSPSYLGP 360 
TSP 
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Seq ID NO: 227 DMA sequence 

Nucleic Acid Access ion #1 NM_006018 

Coding sequencer 61.. * 122 4 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CGCCACTTTG 
ATGAATCGGC 
TTCCGAGATG 
GGGCTTCTGG 
TCCAGOCGGA 
CCGTTCGTGA 
CGGCTBGTGC 
GTGGCGGTAG 
AATTGGACAG 
CACCTCCTGA 
AGCATCTGCC 
CTGGGCATCA 
GACCGGCATG 
GTCATCTGCT 
TCGGGCACGC 
AGCTTCACCT 
TCXXCCCAACT 
CCAGATAATA 
GCTCCAGAGG 
ACCTCAAATA 
CAGTTGGGCT 
CAGATTCAQA 
GTGT6ACCAC 
TTCATCTCTG 
AGTTGGAGCC 
GGGGGGBGCT 
GAGAGCTGAG 
AGCCAGTAGG 
TGGACTCAGG 
AGGGGATTA0 
TTGCTGCTTT 
GQTTGAQCRG 



CTTOSCTTOT 
CAAAAAAAAA 



11 
1 

CTGQAGCATT 
ACCATCTGCA 
ACTTCATTGC 
GCAATGGCCT 

ttttcctgtt 

TGGACTACTA 
TCTTCATGTT 
ACAGGTATTT 
CA3CCATCAT 
AGAAGAAGTT 
ATACCTTCCG 
TCCTGTTCTG 
CCAAGATCAA 
TCCTTCCCAG 
AGAATTGTGA 
ACATGAACAG 
TCTTCTCCAC 
ACCGCAGCAC 
CGTTAATGGC 
AOCATTCCAA 
GTrGCATOQA. 
GAATCTGATT 
AGGAATCCTG 
ACQCTCGCAG 
AGAGATCTAC 
CAQCTCCTCQ 
ATTGGAGGGA 
TCACCTGGCT 
GAAGAGACTC 
CCCCAGAAGG 
CAACCAGCGA 
TTAAAACGGC 
AAACGTGCCT 
GTTTCTGTAC 
A 



21 
I 

CACTAGGCGA 
GGATCACTTT 
CAAGGTGTTG 
TGCCCTGTGG 
CAACCTGGCA 
TGTGCGGCGT 
TGCCATGAAC 
CGQGGTGGTC 
CTCTTGCCTT 
GCTGATCCAG 
GTGGCACGAA 
CXCAGCCAGA 
GA6AGCCATC 

AGTGTACCGC 
CATGCTGGAC 
TTTGATCAAC 
GAGCGTCGAG 
CAACTCCGGT 
GAAGGGACAT 
GTAATGTCAC 
TAGGGAAACT 
GAGGAACAGA 
GACTGAAGAT 
TTGTGACTTG 
GGTGATATCT 
ATTGTGTTGC 
TCCGTGGACC 
ACATGCTTTG 
GACTGAGCTA 
CTAATGCAAT 
TTCAGGATAG 
CTGCCCCACG 
TTACCAAAAA 



31 
1 

GGCGCTCCAT 
CTGGRAATAQ 
CCGCCGGTGT 
ATTTT CTGTT 
GTAGCTGACT 
TCAGACTGGA 
CGCCAGGGCA 
CATCCCCACC 
CTGTGGGGCA 
AATGGCCCTG 
GCTATGTTCC 
ATTATCTGGA 
ACCTTCATCA 
CGGATCCGCA 
TCGGTGGACC 
CCOGTGGTGT 
CGCTGOCTCC 
CrrCZWCAGGGG 
GAGCCATGGA 
TGTCACCAAG 
TGQACTOGGC 
GTGGCAGATG 
GAGTAARGCT 
GGGCAAATTG 
TTGGCCTTCT 
AGCCTGCTTG 




TGGGGCTGGA 
TCCACCTCAA 
TTCTACTGAT 
ACTTTGGGGA 
GCATCATCTT 
ACGCCCTGAA 
TCACTGTTGG 
CAAATGTGTG 
TCCTGGAGTT 
GCCTGCGGCA 



TGGCGTTCTT 
ACTACTTCTC 
AGAGGAAGAT 
AOCeCAACAA 
GCCCCrCTTA 
AACCAGCATC 
CTAAOOTTTC 
AGTGGGAGAC 
TCTAGGCATC 
TAGGOGTTTC 
TCCCACATCT 
TGAGCTCTAG 



AATTCATCTT 
GTTAGTA7CT 
AACAGTGTTA 
CCATTCCTCT 
AAAGCTGTTT 



TCTACCACTT 



TCAGACAAGC 
QTGTTTCCGG 
TTATGGGAAA 
CTTGTTTATA 
COCAOCTGTT 
GTGCACGTTC 
CAATAAATTT 



51 
1 

agccgcactc 
ctgctgtgtg 

BTTTATCTTT 
GTCCTGGAAA 
CATCTGCCTG 
CATCCCTTGC 
CCTCACGGTG 
CAAGATCTCC 
CCTAACAGTC 
CATCA3CTTC 
CCTCCTGCCC 
GAGACAAATG 
CATCGTCTTT 
CCTGCACACT 
TATCACT CTC 
CAGCCCATCC 
GACAGGTGAG 
AAiCCAGAGGC 
TCTGGGOOCA 
TCTGGAGAAA 
CTGGAACTTC 
TGGTTGCAAG 
TGAAACTTGC 
TGCTGAGCAG 
GCCTCAGACT 
CAGGGATAAG 
TCATTAAACA 
TTTAGAGAAA 
TGGGTGTAAT 
GGAAATGGCA 
GTAATCTAAB 
TCGTTTXACC 



TGATAGGAGA 



Seg ID HO; 228 Probein sequence 
Protein Accession #: NP_006009,1 



1 11 

I I 

MNEHHIjQDUP UEIDKKNCCV 
SSRIFLFNLA VADFLLIId. 
VAVDRYFRW HPHHAIiNKI8 
SICHTFRWHE AMFLLEFIiP 

viCFi/PBvw hieifwiiUtt 

FPWFFSTI)IN RCLQRKMXGE 
TSNNHSKSCGH CffQKPAgLEK 



21 



Saq ID NO: 225 DHA pequence 

Nucleic Acid Accession #: NM_01439B,l 
Ceding sequencex 64..1314 



1 
I 

GGCAOOGATT 
ACCATGCDCC 
CACGATGGCA 
ACXGCAGCAG 
CCTCACCAAA 
ACAGTAAAAA 



GTTACTGAAG 
CCACCAGCTC 
ACTCAACCCA 
ACAACOGGTC 
AATACCACCC 
CCAT0GTCAG 
GCAGAGATGG 
TACTTCAACA 
AACCTTCTGT 
TCATATTATA 
CAAGGAATCA 
GTGAGTGAAC 
CTTCAAGCCT 
TACACAATTG 
GGTGTCTATA 
CCCBGGGGGA 



11 
I 

CGGGGCCTGC 
GGCAGCTCAG 
GTCAAATGAG 
CAACAQTACA 
CTTTAGCAGC 
TTCCAACAAC 
CCCTGGTCAC 
TTACAGTCGG 
ATACAGCTGG 
GTAACCAGAC 
AGAAGCCTGA 
GCACAGCTGC 
XCAAGACTGG 
GGATACAGCT 
TCGACCCCAA 
TGAATTTXCA 
TCAGTGAAGT 



AGAGCCTCCA 
TTGATTTTGA 
TGCTTCCTGT 
AAATCCGCCT 
ATGAAAATAA 



21 
I 

CCGGACTTCG 

AGCAAAAGCA 
GOACATAAAA 
AAGATTCATG 
XACCCCAGCA 
AACCCAGGCC 
CCCTAGCTTA 
AACCAGTTCA 
CACCCTTGCA 
TCAACCCACC 
ACCTGCCTCC 
AATTTATCAG 
GATTGXTCA& 
CGCAACGCAA 
GGGCGGATX-r 
GGGAGCCTAT 
GGTGATGTTC 
GTTGTCAGCC 
AGATGACCAC 
GATTGGGGCC 
AAGGTGTCAA 
TGGAATTTAG 



31 
I 

CCGCACQCTG 
GCGCTCTTCG 
XTTCCAGAAA 
AAACCTGTCC 
GATGGTCATA 
ACTACAAAAA 
ACACGCAACA 
GCCCCTTATT 
TCAACCGTCA. 
GCAACTTTAT 
CATGCCCCAG 
ACGGTTCCTG 
GTTCTAAACG 
GACAAGGAGT 
GCCTCTGGGA 
GTGAATCTCA 



-41 

I 

CAGAACCTCG 
CGTCCCTGGC 
CCAGAGATTA 
AGCAACCAGC 
TCACCTTTCA 
ACACTGCAAC 
ACTCACACAC 
CACTGCCACC 
GCCACACAAC 
CGATAGCACT 
GAACAACGGC 
GGCCCACCCT 
GAAGCAGACT 

ACTGTGGCAC 
CATTTACCAA 



60 
120 
190 
240 
3 00 
360 
420 
4S0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
13BO 
1440 
150O 
1560 
1620 
1660 
1740 
1800 
1060 
1920 
1980 
2040 



31 41 51 

1111 
FRDDFIAKVL* PPVLGLBFIF GLLGNGLALW IFCFHLKSWK 
FFVMDYYVRR SDHNFGDIPC HIjVLFMFAMN RQGSIIFDTV 
NWTAAIISCL LKGITVGIiTV HLIiKKKLLIQ NGPANVCISP 
LGIIIiPCSAR IIwautQHQM J2RHAKIKRAI TTIMWAXV7 
8GTQNCEVYR SVDLAFFIT1, SPTYMNSMUJ PWYYFSBPS 
PE8SNP.STGVE I/TOD PUKiaG APE71LMAKSG EFWSPSYLGP 



CAGACAGCAG 
CACCTGCAGG 
TTTGGAAATG 
ATCGTGGTTG 
TCATCTGGAT 
AGAACTCTTT 



TCGGGCATTC 
TGAAAACAAC 
TGGATGAGTG 
GTCTCTOCCT 
ACCAGAGAAT 
CATCCCTTCC 



60 
120 
180 
240 
300 
360 



51 
1 

CCCAGCGCCC 
CGTAATTTTG 
TTCTCAACCT 
TAAGCAAGCA 
AACAGCGGCC 
CACCAGCCCA 
AGCTCCXCCA 
CACCATCACC 
TGGGAACACC 
GGACAAAAGC 
AGCTGCOCAC 
TGCACCTCAG 
CTGTATAAAA 
ACCTCGGAGA 
CCGAAAATCC 
GGATGAAGAA 
GACAGTTTAC 
CTTCAAGTGC 
CGATGTCCAA 
CTCGTCTGAC 



CTAATTGTTG 
AGGATGGATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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TTGGGAAATT COCTCAGAGT GTGGGTCCTT CAAACAATGT AAACCACCAT CTTCTATTCA 1440 

AATGAAGTGA GTCATGTGTG ATTTAAGTTC AGGCAGCACA TCAATTTCTA AATACTTTTT 1500 

GTTTATTTTA TGAAAGATAT AGTGAGCTGT TTATTTTCTA GTTTCCTTTA GAATATTTTA 156Q 

GCCACTCAAA GTCAACATTT GAGATATGTT GAATTAACAT AATATATGTA AAGTAGAATA 1S20 

5 AGCCTTCAAA TTATAAACCA AGGGTCAATT GTAACTAATA CTACTGT C3TG TGCATTGAAG 1680 

ATTTTATTTT ACCCTTGATC TTAACAAAGC CTTTGCT3TG TTATCAAATG GACTTTCAGT 1740 

QCTTTTACTA TCTGTGTTTT ATGGTTTCAT GTAAOVrACA TATTCCTGOT GTAGCACTTA 1800 

ACTCCTTTTC CACTTTAAAT TT6T T T TT ST TTTTTGAGAC GOAGTTTCAC TCTTGTCACC I860 

CAGGCTGGAG TACAGTGGCA OGATCTCGGC TTATGGCAAC CTCCGCCTCC CGGGTTCAAG 1920 

10 TGATTCTCCT GCTTCAGCTT CCCGAGTAGC TGGGATTACA GGCACACACT ACCACGCCTG 1980 

GCTAATTTTT GTATTTTTAT TATAGACGGG TTTCACCATG TTGGCCAGAC TGGTCTTGAA 2040 

CTCTTGACCT CAGGTGATCC ACCCACCTCA GCCTCCCAAA GTGCTGGGAT TACAGGCATG 2100 

AGCCATTGOG CCOGGCCTTA AATGT TT T TT TTAATCATCA AAAAGAACAA CATATCTCAG 216D 

GTTQTCTAAG TGTTTTTATG XAAAACC&AC AAAAAGAACA AATCAGCTTA TATTTTTTAT 2220 

15 CTTGATGACT CCTGCTCCAG AATTGCTAGA CTAAGAATTA GGTGGCTACA GATGGTAQAA 22 BO 

CTAAACAATA AGCAAGAGAC AATAATAAT G GCCCTTAATT ATTAACAAAG TGCCAGAGTC 2340 

TAGGCTAAGC ACTTTATCTA TATCTCATTT CATTCTCACA ACTTATAAGT GAATGAGTAA 24Q0 

ACTGAGACTT AAGGGAACTG AATCACTTAA ATGXCACCTG GCTAACTGAT QGCAGAGCCA 2460 

GAGCTTGAAT TCATGTTGGT CTGACATCAA GGTCTTTGGT CTTCTCCCTA CACCAAGTTA 2520 

20 CCTACAAQAA CAATGACACC ACACTCTGCC TGAAGGCTCA CACCTCATAC CAGCATACGC 25B0 

TCACCTTACA GGGAAATGGG TTTATOCAGG ATCATGAGAC ATTAGGGTAG ATGAAAGGAG 2640 

AGCTTTOCAG ATAACAAAAT AGCCTATCCT TAATAAATCC TCCACTCTCT GGAAGGAGAC 2700 

TGAGGGGCTT TGTAAAACAT TAGTCAGTTG CTCATTTTTPA TGGGATIGCC TAGCTGGGCT 2760 

GTAAAGATGA AGGCATCAAA TAAACTCAAA GTATTTTTAA ATTTTTTTQA TAATAGAGAA 2B20 

25 ACTTCGCTAA CCAACTGTTC TTXCTTGAGT GTATAGCCCC ATCT TGTGGT AACTTGCTGC 2680 

TTCTGCACTT CATATCCATA TTTCCTATTG TTCACTTTAT TCTGTAGAGC AGCCTGCCAA 2940 

GAATTTTATT TCTGCTGTT1* TTTTTGCTGC TAAAGAAAGG AACTAAGTCA GGATGTTAAC 3000 

AGAAAAGTCC ACATAACCCT AGAATTCTTA GTCAAGGAAT AATTCAAGTC AGCCTAGAGA 3060 

CCATGTTGAC TTTOCTCATG TGTTTCCTTA TGACTCAGTA AGTTGGCAAG GTCCTGACTT 3120 
30 TAGTCTTAAT AAAACATTGA ATTGTAGTAA AGGTTTTTGC AATAAAAACT TACTTTG3 

Beq ID NOi 230 Protein sequence 
Protein Accession #: NP_0 55213.1 

35 1 11 21 31 41 51 

1 I I I I I 

MFRQU9AAAA I^ASIAVHiK DGSQMRAXAF PBTRDYSQPT AAATVQDIKX PVQQPAKQAP 60 

HQTLAARFMD GHITPQTAAT VKIPTTTPAT TKNTATTSPI TYTIiVTTQAT PNNSHTAPPV 12 0 

TEVTVGPSliA PYELPPTITP PART ACTS S3 TVSHTTGNTT QPSNQTTIiPA TDSIALEKST 180 

40 TGQKPDQPTH APGTTAAAEN TTHTAAPAST VPGPTIAPQP SSVKTGIYQV LHGBRLCIKA 240 

EMG1QLIVQD KESVFSPSRY FNIDPUATQA SHHOGTHKSN LIiLNPQGGFV NIjTFTKDEKS 300 

YYISEVGAYL TVSDPKTVYQ GIKHAWHFQ TAVGHSFKCV 3BQSLQX.SAH LQVKTTDVQL 360 
QAFDFEDDHF GNVDECSSDY TIVLPVIGAI WGliClMGMG VYKIULHCQS 9GYQRI 

45 Seq ID HO: 231 DMA sequence 

KUcleic Acid Accession MM_OOS409.3 
Coding sequence; 94*. 376 

_^ 1 11 21 31 41 51 

50 i | | | I I 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 

TTGGCTGTGA TATTGTGTGC TACRGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 

TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

55 ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 

AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTITGTACAG TTATGTGAAG GATGAAAGGT 540 

60 GGGTGAAAGQ ACCAAAAACA GAAATACAGT CITCCTGAAT GAATGACAAT CAGAATTCCA 600 

CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAQCTACCTT AAGAAAGGCT 660 

GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGXXCTTAC TTGTTGTATT ATAGATICAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATQCTTAC AAjCTATTCTG TTGTGACTAT 780 

. _ GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATQCT AlATTACTAT B40 

65 CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 

CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATrTTTTCAT GCAAATACAC ACTXCTTTGC 960 

CCAAATATCA TGTAGCACAT CAATATGTAG GQAAACRTTC TCATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATAOGCTA 1080 

TGGX3ATACTQ GCAACAGTGG ACATATTTCA TAACCAAATT AQCAGCACCG GTCTTAATTT 1140 

70 GATGTTTTTC AACTTTTATT CATTQAGATG TTXTGAAGCA ATTAGGATAT GTQTGTTTAG 1200 

TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA 1GTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTXGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
75 GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTr TTAAAAAAAA AAA 

Seq ID NOi 232 Protein secruence 
Protein Accession #t NP_005400.1 

80 1 11 21 31 41 51 

1 I I 1 i I 

MSVKGMAIAL AVILCATWQ GPPMPKRGRC LCIGPGVKAV KVADIBKA3I MYPSHNCDKI 60 
EVIIHiKKNK GQRCLHPKSK QAftLIIKK^E RKNF 
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Seq ID HO: 233 DMA sequence 

Nucleic Acid Accession tf: NMJ)00577.1 

Coding sequence: 41. .520 

5 l li 21 31 41 51 

1111)1 

GGCACGAGGG GAAGACCTCC TGTCCTAXCA GGCCCTCCCC ATGGCTTTAG AGACGATCTG 60 

CCGACCCTCT GQGAGAAAAT CCAGCAAGAX GCAAGCCXTC AGAATCTGGG ATGTTAACCA 120 

GAAGACCTTC TATCTGAGGA ACAACCAACT AGTTGCCGGA TACTTGCAAG GACCAAATGT 180 

1U CAATTTAGAA GAAAAGATAG ATGTGGTACC CAXTGAGCCT CATGCTCTGT TCTTGGGAAT 240 

CCATCGAGGG AAGATGTGCC TGTCCTGTGT CAAGTCTGGT GATGAGACCA GACTCCAGCT 300 

GGAGGCAGTT AACATCACTG ACCTOAGCGA GAACAGAAAG CAGGACAAGC GGTTCGCCTT 360 

CATCCGCTCA GACAGTGGCC CCACCACCAG TTTTGAGTCT GCCGCCTGCC CCGGTTGGTT 420 

OCTCTGCACA GCGATGGAAG CTGACCAGCC CGTCAGCCTC ACCAAXAXGC CXGACGAAGG 480 

15 CGTCATGGTC ACCAAATTCT ACTTCCAGGA GGACGAGTAG TACTGCCCAG GCCTQCCTGT 540 

TCCCATTCTT GCATGGCAAG GACTGCAGGG ACTGCCAGTC CCCCTGCCCC AGGGCTCCGG 600 

GCTATGGGGG CACTGAGGAC CAGCCATTGA GGGGTGGACC CTCAGAAGGC GTCACAACAA 660 

CCTGGTCACA GGACTCTGCC TCCTCTTCAA CTGACCAGCC TCCATGCTGC CTCCAGAATG 720 

GTCTTTCTAA TGTGTGAATC AGAGCACAGC AGCCCCTGCA CAAAGCCCTT CCATGTCQCC 780 

20 TCTGCATTCA GGATCAAACC CCQACCACCT GCXTCAACCTO CTCTCCTCTT GCCACTGCCT 840 

CTTCCTCCCT CATTCCACCT TGCCATGCCC TGGATCCATC AGGCCACTTG ATGACCCCCA 900 

ACCAAGTGGC TCCCACACCC TGTTTTACAA AAAAGAAAAG ACCAGTOCAT GAGGGAGGXT 960 

TTTAAGGGTT TGTGGAAAAT GAAAATTAGG ATTTCATGAT TTTTTTTTTT CAGTCCCCGT 3.020 

GAAQGAGAGC CCTTCATTTG GAGATTATGT TCTTTCGGGG AGAGGCTGAG GACTTAAAAT 10B0 

25 ATTCCTQCAT TTGTGAAAITG ATGGTGAAAG TAAGTGG TAG CTTTTCCCTT CTTTTTCTTC 1140 

TTTTTTTGTG ATGTCCCAAC TTGTAAAAAT TAAAAGTTAT GGTACTATGT TAGCCCCATA 1200 

ATTTTTTTTT TCCTTTTAAA ACACTTCCAT AATCTGGACT CCTCTOTCCA ©GCACTGCTG 1260 

CCCAGCCTCC AAGCTCCATC TCCACTCCAG ATTTTTTACA GCTGCCTGCA GTACTTTACC 1320 

TCCTATCAGA AGTTTCTCAG CTCCCAAGGC TCTGAGCAAA TGTGGCTCCT GGGGGTTCTT 1380 

30 TCTTCCTCTG CTGAAGGAAT AAATTGCTCC TTGACATTGT AGAGCTTCTG GCACTTGGAG 1440 

ACTTGXATGA AAGATGGCTG TGCCTCTGCC TGTCTCCCCC ACCAGGCTGG GAGCTCTGCA 1500 

GAGCAGGAAA CATGACTCGT AT ATGT CX CA GGTCCCTGCA GGGCCAAGCA CCTAGCCTCG 1560 

CTCTTGGCAG GTACTCAGCG AATGAATGCT GTATATGTTG GGTGCAAAGX TCOCTACTTC 1620 

CTGTGACTTC AGCTCTGTTT TACAATAAAA TCTTGAAAAT GCCTAAAAAA AAAAAAAAAA 1680 

35 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 



40 
45 
50 



Seq ID NO: 234 Protein sequence 
Protein Accession #: WP_000568-1 

1 11 21 32 41 51 

I I I I I ) 

MAIi^TICRPS GRKSSKMOAF RIWDVNQKTF YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 60 
HAIiFLGlHGG KMCLSCVKSG DETRHQLEAV NITDIaSENRK QDXREAFIRS DSGPTISFBS 120 
AACPGKPEiCT AMBADQPVSL TNMPDEGVMV TKFYFQEDE 

Seq ID NO: 235 DNA sequence 

Nucleic Acid Accession #i NM_001840-1 

Coding sequence; 14 9.. 156 7 



1 11 21 31 41 51 

I I 1 I I I 

GGGOACTAOG GAGAGCTCTG CAGGGAGCCG AGGCCOCCGC CCGGGCCAAG GGAGCTTCTG 60 

TOCCGAGGAC CAGGGGATGC GAABOGATTG CCCCCTGTGG GTCACTTXCT CAGTCATTTT 120 

GAGOTCAGCC TAATCAAAGA CTGAGGTTAT GAAGTCGATC CTAQATGGCC TTGCAGATAC 180 

55 CACCTTCCGC ACCATCAGCA CTGACCTCCT GTACGTGGGC TCAAATGACA TTCAGTACGA 240 

AGACATCAAA GGTGACATGG CATCCAAATT AQGGTACTTC CCACAGAAAT TCCCTTTAAC 300 

TTCCTTTAGG GGAAGTCCCT TCCAAGAGAA GATGACTGCG GGAGACAACC CGCAGCTAGT 360 

GCCAGCAGAC CAGGTGAACA TTACAGAATT TTACAACAAG TCTCTCTCGT CCTTCAAGGA 42 O 

GAATGAGGAG AACATCCAGT GTGGGGAGAA CTTCATGGAC ATAGAGTGTT TCATGGTCCT 480 

OU GAACCCCAGC CAGCAGCTGG CCATTGCAGT CCTGTGOCTC ACGCTGGGCA GCTTCACQOT 540 

CCTGGAGAAC CTCCTGGTGC TGTGCGTCAT CCTCCACTCC CGCAGCCTCC GCTGCAGGCC 600 

TTCCTACCAC TTCATCGGCA GCCTGGOGGT GGCAGACCTC CTGGGGAGTG TCATXTTTGT 660 

CTACAGCTTC ATTGACTTCC AOGTGTTGCA CCGCAAAQAT AGCOGCAACG TGTTTCTGTT 720 

CAAACTGGGT GGGGTCACGG CCTCCTXCAC TGCCTCOGTG GGCAGCCTGT TCCTCACAGC 780 

65 CATGOACAGG TACATATCCA TTCACAGGCC CCTGGOCTAT AAQAGGATTG TCACCAGGCC 840 

CAAGGCCGTG GTGGCGTTTT GCCTGATGTG GACCATAGCC ATTGTGATCG CCGTGCTGCC 900 

TCTCCTGGGC TOGAACTGCG AGAAACTGCA ATCTQTTTGC TCAGACATTT TCCCACACAT 960 

TGATGAAACC TACCTGATGT TCTGGATCGG GGTCACCAGC GTACTGCTTC TGTTCATCGT 1020 

GTATGCGTAC ATGTATATTC TCTGGAAGGC TCACAGCCAC GCOGTCOGCA TGATTCAGCG 1080 

70 TGGCACCCAG AAGAG CATCA TCATCCACAC GTCTGAGGAT GGGAAGGTAG AGOTGACCCG 1140 

GOCAGACCAA GCCOGCATGG ACATTAGGTT AGCCAAGACC CTGGTCCFGA TGCTGGTGGT 1200 

GTTGAXCATC TGCTGGGGCC CTCTGCTTGC AATCATGGTG TATGATGTCT TTGGGAAGAT 1260 

GAACAAGCTC ATTAAGAOGG TGTTTGCA.TT CTGCAGTATG CTCTGCCTGC TGAACTOCAC 1320 

CGTGAACCCC ATCATCTATG CTCTGAGGAG TAAGGACClG CGACACGCTX TGGGGAGCAX 1380 

75 GTTICCCTCT TGTGAAGGCA CTGCGCAGCC XCXGGATAAC AGCATGGGGG ACTCGGACTG 1440 

OCTGCACAAA CACGCAAACA ATGCAGCCAG TGTTCACAGG GCCGCAGAAA GCXGCATCAA 1500 

GAGCACGGXC AAGATTGCCA AGGTAACCAT GTCTGTGTCC ACAGACAOGT CTGCCGAGGC 1560 

XCTGTGAGCG XGAXGCCXCC CIGGCAGCAC AGGAAAAGAA TTTTTTTTTT TAAGCXCAAA 1620 

_ _ ATCTAGAAGA GTCTATTGTC TCCTTQGTTA TATTTTTT TA ACTTTACCAT GCTCAATGAA 1680 

OU AAGGTGATTG CCACATGTCA CTTATTTGCT TAGTTTCCGX XIGGGCTAAT CTTCCGGGGT 1740 
XCGTAGGAAA CCTTT 



Seq ID NO: 236 Protein sequence 
Protein Accession #t NP 001831.1 



1042 



WO 03/042661 



1 11 21 31 41 51 

I I I 1 i 1 

MKSILDGLAD TTPRTITTDI* LYVGSNDIQY EDIKGDMASX. LGYFPQXFPL TSFRG9PFQE 60 

KMTAEDNPQL VPADQVNITB FTOXSLSSPK ENE5HIQCGE BFMDIECFMV LNPSQQLAIA 120 

VLSLTLGTPT VLBNLI.VLCV ILHSRSLSCR PSYHFIGSUY VADLIiGSVIF VYSFIDPHVF 180 

HRKDSRNVPL FKLGGVTASF TASVGSLFLT AIDRYISIHR PLAYKRIVTH PKAWAFCLM 240 

WTIAIVIAVL PLLGWNCEKL QSVCSDIFPH IDETYLMFWI SVTSVLTjU»I VYAYMYILWK 300 

AHSHAVRMIQ RGTQKSITIH TSBDGKVQVT RPDQARMDIK LAKTLVLILV VIi 1 1 CWGPLIi 360 

A1MVYDVFGK MNKLIKTVFA FCSMIiCI»IiN9 TVKPIIYALK SKDLRBAFRS MFPSCEGTAQ 420 
PIvDNSMGDSD CLHKHAHKAA SVHRAAESCI KSTVKIAKVT HSVSTDTSAE Ala 

Seq ID NO: 237 DNA sequencB 

Nucleic Acid Accession #: MM_016083-2 

Coding sequence s 64 . . 1482 

1 11 21 31 41 51 

I 1 I ! 1 I 

GATTGCCOCC TGTGGGTCAC TTTCTCAGTC ATTTTGAGCT CAGCCTAATC AAAGACTGAG 60 

GTTATGAAGT CGATCCTAGA TGGCCTTGCA GATACCACCT TCCGCACCAT CACCACTGAC 120 

CTCCTGTACG TGOGCTCAAA TGACATTCAG 1ACGAA3ACA TCAAAGGTGA CATGGCATCC 160 

AAATTAOGGT ACTTCCCACA GAAATTCCCT TTAACTTCCT TTAGGGGAAG TCCCTTCCAA 240 

GAGAAGATGA CTGCGGGAGA CAACCCCCAG CTAGTCCCAG CAQACCAGOT GAACATTACA 30 0 

GAATTTTACA ACAAGTCTCT CTCGTCCTTC AAOGAGAATG AGGAGAACAT CCAGTGTGGG 360 

QAGAACTTCA TGGACATAOA GTOTITCATG GTCCTGAACC CCAGCCAGCA GCTOGCCATT 420 

GCAGTCCTGT CCCTCAOGCT GGGCACCITC A03GTCCTGG AGAAOCTCCT GGTGCTGTGC 480 

GTCATCCTCC ACTCCCGCAG CCTCCGCTGC AGGCCTTCCT ACCACXTCAT CGGCAGCCTG 540 

GCGGTGGCAG ACCTCCTGGG GAGTGTCATT TTTGTCTACA GCJTCATTGA CTTCCACGTG 600 

TTCCACOGCA AAGATAGCCG CAACGTGTTT CTGTTCAAAC TGGGTGGGGT CACGGCCTCC 660 

CTCACTGCCT CCGTGGGCAG CCTGTTCCTC ACAGCCATCG ACAGGTACGT ATCCATTCAC 720 

AGGCCCCTGG CCTATAAGAG GATTGTCACC AGGCCCAAGG GCGTGGTGGC GTTTTGOCTG 790 

ATGTGGACCA TABCCATTGC GATCGCCGTG CTGCCTCTCC TGGGCTGGAA CTGCGAGAAA 840 

CISCAATCTG TTTGCTCAGA CATTTTCCCA CACATTGATG AAACCTACCT GATGTTCTGG 900 

ATCGGGGTCA CCAGCGTACT GCTTCTGTTC ATCGTGTATG D3TACATGTA TATTCTCTGG 960 

AAGGCTCACA GCCACGCCGT CCGCATGATT CAGOGTGGCA CCCAGAAGAG CATCATCATC 1020 

CACACGTCTG AGGATGGGAA GGTACAGGTG ACCCGGCCAG ACCAAaCCCG CATGGACATT 1080 

AGGTTAGCCA AGACCCTGGT CCTGATCCTG GTGGTGTTGA TCATCTGCTG GGGCOCTTTG 1140 

CTTGCAATCA TCGTGTATGA TGTCTTTQGG AAGATGAACA AGCTCATTAA GACGGTGTTT 1200 

GCATTCTGCA GTATGCTCTG CCTGCTGAAC TCCACCGTGA ACCCCATCAT CTATGCTCTQ 1260 

AGGAGTAAGG ACCTQCQACA OGGTTTCCGG AGCATGTTT C CCTCTTGTGA AGGCACTGCG 1320 

CAGCCTCTGG ATAACAGCAT GGGGGACTCO GACTGCCTGC ACAAACACGC AAACAATGCA 1380 

GCCAGTGTTC ACAGOQCCGC AGAAAGCTGC ATCAAGAGCA CGGTCAAGAT TGCCA&GGTA 1440 

ACCATGTCTG TGTCCACAGA CACGTCTGCC GAGGCTCTGT GAGCCTGATG CCTCCCTGGC 1500 

AGCACAGGAA AAGAATTTTT TTTT TTAAGC TCAAAATCTA GAAGAGTCTA TTGTCTCCTT 1560 

GQTTATATTT TTTTAACTTT ACCATGCTCA ATGAAAAGGT GATTGTCACC ATGATCACTT 1620 

ATCAGTTTGC TAATGTT TCC ATAGTTTAGG TACTCAAACT CCATTCTCCA GGGGTTTACA 16B0 

GTOAAGAAAG CCTGTTGTTT AAGTGACTGA ACGATCCTTC AAAGTCTCAA TGAAATAGGA 1740 

GGGAAACCTT TGGCTACACA ATTGGAAGTC TAAGAACCCA TGGAAAAATG CCATCAAATG 1800 

AATAATGCCT TTGTAACCAC AACTTTCACT ATAATGTGAA ATGIAACTGT CCGTAGTATC I860 

AGAGATGTCC ATTTTTACAA GTTATAGTAC TAGAGATATT TTGTAAAATQ TATTATGTCC 1920 

TGTGAGATGT GTATCAGTGT TTATGTGCTA TTAATATTTG TTTAGTTCAG CAAAACTGAA 1980 

AGGTAGACTT TTATGA0AAC AATGGACAAG CAGTGGATAC GTGTCAATGT GTGCACTTTT 2040 

TTTCTATATT ATTGCCCATG A2ATAACTTT AGAAATAAAC CTTAATATTT CTTCAA&TAT 2100 

CTCTATTTAA TTTTGACACT GAAATAACCG TAAAGGTTTA TTTTTCTGTT ACCTCAACAA 2160 

GAAGAATTTG AAGACTTCAA AATATTGAGC AGAATTCATT CATACTTAAA AATTTATTAG 2220 

CCCTGCATTT TCATAGGAAG ACACATTATC TTCTGGACTA TAGCTGTTCT AATGGATTAT 2280 

AATCAGAATG GAAGAGAGAA AGCATATTGA CTTTTTTTGA GCGACATCTC TGACTTTCTT 2340 

TAGTCTTTAG CTATTACTGG ATCTCTTAA0 ACAGCATGTG TTAATCTTAA TGTATATCGT 2400 

TATCACTGTG CAGTTGCTGT TTACTTGAAT AGTATTGTGT TCCTATATTC CAGGTTTAAG 2460 

TAGATTTCAT GCCTGGGTGG CCAAACAACA GTCTTCATTT TTTTTAATTG AAAAGAAOTA 2520 

GTGTCTGGAT CAGTAAAATT ATACTGTGTG TGAGTGTGAA TATAAATGTG TGTATGTGTG 2580 

TTTCTGTCCG TAACTGTTAC AGTAATGTCA TAAAGTGAGA AAACTGTGAC CAAGTATAAA 2640 

CTTTTACCAC TTGCTGCACT CTTGCACATG GATTCAGTXT CTAAAATTGA GTTCTTCCTG 2700 

TAATCTTGTT GATAAAAATA CTGACTCCAA CCATTCAAAA ATTTCACCCC ATCCCTCCTT 2760 

AAGAGATTGG ATCAAGTATT ACTAAATTGA CCTTTAOGTA TTACACAAGA CCAGTGCTTA 2820 

GCAAAAAATA ATGACAGGCA TCCAAGGAAG GGATGTATTT GTAGTGTTAT TGCCAGGAAA 2080 

GGAGAGTACT TTGGTTTCTQ AGCACCGAAT ATTGAGCAAT ATGT CAGT CA CTAAAAGGAA 2940 

GACAGTTCTA CAGAAAAACA AATGGTAACA TTTTTCAATA GCGTGTGTAG ATAGTATGCA 3000 

CTATATACAT CACGTTAAAG TAGGACTATC ACACCCAGCC CATGTGGCTA AAAAAGCTGA 3060 

ATCAGACAGT GGATGAGACA CACAACGGCA GTQAAGAACC GATACACTTG GCATTGACGT 3120 

CTAGCTATGC TGTATCTGTG CTTTGCCCAC ATGCCCTTGG TGACAGCTGA GCACCCAGCT 3180 

CTQTCTTGGT AGGTTTGGGC TAAGQAACAA ATCTCTCCTT TGCTCGTGGT TAGCAAGATA 3240 

CACTCAAGCA TGAAGATAAA CACAGCTGCT TTCTTCTTAC ACCCCGGTCT CATGCTCCTT 3300 

AATGGD3CCA TGGGTGCTTG TTGGGCCTTT TTCCAGTAAG GAATGATATT GCTGAAGAAT 3360 

CTACTTAACC CTGACAAATT TTAATTATAA TCTCTTCTTA TACAGATAAA ACATGACTCC 3420 

TACAAQGCCC CAAGGTTTAC ATAGTCTGAA GTGAAGTACA GAGCTGGCAT CTATCTQOTG 3480 

ATTTCTAGCX CTCGAGATAC CCAAGCAGCC TGATGQGGCA GTTCCCCTTC TTACGGTTCA 3S40 

CGCTCTAAGG CAGGATGTGG CTTATGAGAT ACTTTGCATT GTCTGTCTGC ACACCTTOAA 3600 

TCTGCCTGCT GGCTCCCTTA CTTTAOCTCT CTGTCATGTG CAGATGAAGG CTCAGGGTGC 3660 

TAOAGGATTA GTAAGATCTC TTTCTAAAGA CAGGAGAGAT TATTTACAAG AAGAACTCAC 3720 

CAGGGTTTAG TTTGCATTTA AGAATTGCCA GTCTTTTGTC CTGCKTCATC TTGAACATTA 3780 

ATCCACATGT TTCAGAGCTC ACCAGGCAGT ACCAATGCTC TTTTCACAGC TATGAAGAGC 3840 

TAGAGAAATT CTTGTTATGG TAGAAAAATT TCAOGGTTCA TTTTTGAAAC TGCATTTGTG 3900 

CGTATGCAGT GTAGATTTTA TAGTGTGTTG TGCTTTCAAG ATCTAAATCA TATATAATAA 3960 
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ATTAAGGGAC AATGGGGCTG ACAGCACTAA ACTTGGTGCT TATTGATATT CTAAGAAATA 4D20 

TCTGTGAAAT ATCATCACOT ATGTTATACA ACCTTCATTT AAAAAGGTlT AAAACTAGTT 4 080 

AGATTCACTT TGACACTTTT CATATCATTT CTTAACCCAA GTGACGAAAA CATTGTCCCC 4140 

AATGAATATA CTCATTAGAA TTACCATTTG TTAATATCAC TCATTAATTA ACCCCATAAT 4200 

TAGATCCATT AATTTAAATG ATTTAAATTT AAGTAAGTTT TATAAGGTCT GACATCAGAG 4260 

Q TAT CTT ACT TTCCTCTGAG GATGATGTAC TTGCCCTGAC CATGCATTTT ACCATCACAC 4320 

ATGTTCAGAA AGGGCCAAAT TCCCAACCTG CTCATTTTTT TTTTTATGAG AGTCATGATG 4380 

AATCAGTCCT AGAATGTTTC ATTTGCACAA GTAGGGCTGC CTCCAAGAGG AACCTCTQAT 4440 

TTATTTTGTA TGAAATATAT GTGAAAGGAT AT3AATCTGA GAQATGCTGT AGACATCTGT 4S00 

CCTACACTTG AGATGATTTC CAAGCCTCTC TGGCACTTTG AGTTAAGTCT ATCTGGTATT 4550 

AAATOCCAAG GACCTTTTGC TGCCTAAATC CACTCTGCAG GAAATA3GCC CAACCAOCAG 4620 

ATGAGAATTA GGCCCTGGAT GAGTAGCGCT ATAGTTACTG TCCTGTTGAT TAATTTCTGC 4 6 BO 

CATTTCATGT CCATAAAAGA GACCACCCAT ATCATGCACA CAATTAGATT TCTCACACTC 4740 

TAACTGTATA TTTGTATGAT ATTTTAAAAT CTCCTAAATG CTQGGCAATG GCTATTAACA 4800 

ATTAATTGTC TTGCACTGGC CTTCTGATGA AATGTTAACA ATGCCTATTG TAATATAGAA 4860 

AAAAACATTC TATCTACTGA TTTGGGCTGA ATGTATGTAA ATAGGTTTCT AAAAAGTCAG 4920 

ATGTTTGAGC AGTGGCCXAC AAATCAGTAA TTTTOQGGTG GGAGAGTTTC TTTACATTGC 4960 

CGTGGCATCT TAAAAGCTAT CTTCATGTAA ATTGACTGTA CTAGGCCTAC TGGGGATCAG 5040 

AGTTCCCAAG AAAGGAAACC TTTTCTTGTA TCTGGATTCA AATTTATTTC CAATGTTTCA 5100 

AGCGGGAAAC ATGACTCTTT ATTGTCTGTA AATCTAACAT TATTACTTTT CCTCTTAGAA 5160 

GAATATTQTA TTGTTAGATG TTTGTTGAGC TGGTAACATC GTTGCAACCA CTGCAATATC 5220 

TTCGTTAGTA ATCTGTATAA TACTTTGTAT ACAAGTACTG GTAAGATTGT TATTAAATGT 52 BO 

AGCTTCAGTC ATTAAATTAC TATAGCAAAG TAGTACITCT TCTGTAATAT TTACAATGTA 534 0 

TTAAGCCCAC AQTATATTTT AT TT CAATGT AATTAAACTG TTAACTTATT CAAAGAGAAA 5400 

ACATCTCATC ATGTCTATTG TCCAAAGTTA CCTGGAATCA AATAAAAATT CTAGATTACC 5460 
ATGAAGAACA TA 

Seq ID HOi 238 Protein sequence 
Protein Accession, ft i NP_057167.1 



1 
1 

MXeiLDGLAD 
KMTAGDNPQI* 
VLiSLTLGTFT 



WTIAIA1AVL 
AHSHAVKMIQ 
AIMVYDVFGK 



11 
I 

TTFRTITTEHi 
VPADQVNITE 
VLEHLI»VLCV 
FKJjGGVTASL 
PIXGWNCEKI* 
RGTQKSIIIH 
MNKLIKTVFA 
CLMKHANiJAA 



21 
I 

LYVGSNDIQY 
PYNKSLSSPK 

TASVG8I»ETjT 
QSVCSDIPPH 
rSEDGKVQVT 
FCSHLtCLLNfe 
SVHRAAESCI 



31 
I 

EDIKGDMASK 
HNEENIQCGE 
PSYHPIGSIA 
ATDR.YVSIHR 
IDETYLMFWI 
RFDQARMDIR. 
TVNPIIYAUt 
KSTVKIAKVT 



41 

I 

T^OYFPQKFPL 
NFMDIECFMV 
VADIiLGSVH? 
PLAYKRIVTR 
GVTSVTiTiTiFI 
IiAKTLVI.II»V 



MSV8TDTSAE 



51 
I 

TSFRGSPFQE 
lilPSQQIAIA 
VYSPIDFHVF 
PKAWAFCLM 
VYAYMYILWK 
VLIICWGPLL 
HPPSCBGTAQ 
All 



60 
120 
180 
240 
300 
360 
420 



Seq ID KOt 239 DNA sequence 

Nucleic Acid AccesBion #t NM_033lfll-l 

Coding sequence i 17. .1252 



1 
I 

ATQAAGTCQA 
CTGTACGTGG 
CCAGTGTGGG 
GCTGGCCATT 
GGTGCTGTGC 
CGGCRGOCTG 
CTTCCACGTG 
CACGGCCTCC 
ATCCATTCAC 
GTTTTGCCXG 
CTGCGAGAAA 
QATGTTCTGG 
TATTCTCTGG 
CATCATCATC 
CATGGACATT 
GGGOCCTCTG 
GACGGTGTTT 
CTATGCTCTG 
AGGCACXGCG 
AAACAATGCA 



11 

\ 

TCCTAGATGG 
GCTCAAATGA 
GAGAACTTCA 
GCAGTCCTGT 
GTCATCCTCC 
GCGGTGGCAQ 
TTCCACCGCA 
TTCACTGCCr 
AGGCCCCTGG 
ATGTGGACCA 
CTGCAATrCTG 
ATCGGGGTCA 
AAGGCTCACA 
CACACGTCTG 
AGGTTAGCCA 
CTTGCAATCA 
GCATTCTGCA 
AGGAGTAAGG 
CAGCCTCTGG 
GCCAOTGTTC 
ACCATGTCTG 



21 
I 

CCTTGCAGAT 
CATTCAGTAC 
TGGACATAGA 
COCTCACGCT 
ACTCOCGCAG 
ACCTOCTGGG 
AAGATAGCCG 
CGGTGGGCAG 
OCTATAAGAG 
TAGGCATTGT 
TTTGCTCAGA 
CCAGGGTACT 
GCCACGCCGT 
AGGATGGGAA 
AGAjGCCTGGT 
TQGTGTATGA 
GTATGCTCTG 
ACCTGCGACA 
ATAACAGCAT 
ACAGGGCCGC 
TGTCCACAGA 



31 
I 

ACCACCTTCC 
GAAGACATCA 
GTGTTTCATG 
GGGCACCTTC 
CCTCGGCTGC 
GAGTGTCATT 
CAA03TGTTT 
CCTGTTCCTC 
GATTBTCACC 
GATCGCOGTG 
CATTTTGCCA 
GCTTCTGTTC 
CCGCATGATT 
GGTAC3VGGTG 
CCTGATCCTG 
TGTCTTTGGG 
CCTGCTGAAC 
CGCTTTGCGG 
GGGGGACTOG 
AGAAAGCTGC 
CACGTCTGCC 



41 
I 

GCACCATCAC 
AAGGAGAATG 
GTCCTGAACC 
ACGGTCCTGG 
AGGCCTTCCT 
TTTGTCTAjCA 
CTGTTCAAAC 
ACAGCCATCG 
AGGCCCAAGG 
CTGCCICTCC 
CACATTGATG 
ATCGTGTATG 
CAGCGTGGCA 
AOCCGGCCAG 
GTGBTOTTGA 
AAGATGAACA 
TCCACCGTGA 
AGCATGTTTC 
GACMGCCTGC 
ATCAAGAGCA 
GAGGCTCTGT 



51 
I 

CACTGACCTC 
AGGAGAACAT 
CCAGCCAGCA 
AGAACCTCCT 
ACCRCTTCAT 
GCTTCATTGA 
TGGGTGGGGT 
ACAGGTACAT 
OGGTGGTGGC 
TGGGCTGGAA 
AAACCIACCT 
CGTACATGTA 
CCCAGAAGAG 
ACCAAGCCCG 
TCATCTGCTG 
AGCTCATTAA 
ACCCCATCAT 
CCTCTTGTGA 
ACAAACACGC 
CGGTCAAGAT 



Ceq ID NO: 240 Protelii eeguence 
Protein Acceeelon #r NP 149421.1 



MALQIPPSAP 
LSLTLGTFTV 
P.KDSRKVFLF 
TIATVIAVLP 
HSHAVRMIOR 
IMVYDVPGKK 



11 
1 

SPLTSCTWAQ 
LENLtiVLCVI 
KLGGVTAEFT 
IiLGWNCBKLQ 
GTQKSIIIHT 
KKLIKTVPAF 
LHKHANNAA8 



21 
1 

MTFSTKTSKK 

ASVGSLF7.TA 
8VC9DIPPEI 
SEDGKVQVTR 
CSMLCKLiNST 
VHHAAS6CIK 



31 
) 



SYHPIGSLAV 
IDRYX 9 IHKP 
DETYUtPWIG 
PDQARMDlRIi 
VNPIIYALRS 
STVK1AKVTM 



41 
1 

FMDIECFMVL 
ADLIX3SVIPV 
LAYKHIVTHP 
VTSVIitiLPIV 
AKTTTiVLITiW 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
7 B0 
840 
900 
960 
1020 
1080 
1140 
1200 



SVSTDTSAEA 



51 
I 

NPSQQIiAIAV 
YSPIDFHVFH 
KAWAFCLMW 
YAYMYILWKA 
IiIICHGPIjLA 
FPeCEGTAQP 
1m 



60 
120 
180 
240 
300 
360 



Seq ID NO; 241 DNA sequence 

Nucleic Acid Accession. #j NM 003596.1 



1044 



WO 03/042661 



Coding sequence: 82.. 1194 

1 11 21 31 41 51 

* I I I I 1 

GTAGACTGTC CATGGCCTGA ACATTTTCCG AAAATCATTT TOAGCAAAAT ATCTGTTXAA 60 

TAACAAGATA ACCACATCAA GATGGTTGGA AAGCTGAAGC AGAACTTACT ATTGGCATGT 120 

CTGGTGATTA GTTCTGTGAC TGTGTTT TAC CTGGGCCAGC ATGCCATGGA ATGCCATCAC 180 

CGGATAGAGG AACGTAQCCA GCCAGTCAAA TTGGAGAGCA CAAGGACCAC TGTGAGAACT 240 

GGCCTGGACC TCAAAGCCAA CAAAACCTTT GCCTATCACA AAGATATGOC TTTAATATTT 300 

ATTGGRGGTG TGCCTCGGAG TGGAACCACA CTCATGAGGG CCATGCTGGA CGCACATCCT 360 

GACATTCGCT GTGGAGAGGA AACCAGGOTC ATTCCCCGAA TCCTGGCCCT GAAGCAGATG 420 

TGGTCACGGT CAAGTAAAGA GAAGATCCGC CTGGATGAGG CTGQTGTTAC TGATGAAGTQ 480 

CTGGATTCTG CCATGCAAGC CTTCTTACTA GAAATTATCG TTAAGCATGG GGAGCCAGCC 540 

OCTTATTTAT QTAATAAAOA TCCTTTTGCC CTGAAATCTT TAACTTACCT TTCTAGGTTA 600 

TTCCCCAATG CCAAATTTCT CCTGATGGTC CGAGATGGCC GGGCATCAGT ACATTCAATG 660 

ATTTCTCGAA AAGTTACTAT AGCTGGATTT GATCTGAACA GCTATAGGGA CTGTTTGACA 720 

AAGTGGAATC GTGCTATAGA GACCATGTAT AACCAGTGTA TGGAGGTTGG TTATAAAAAG 780 

TGCATGTTGG TTCACTATGA ACAACTTGTC TTACATCCTG AACGGTGGAT GAGAACACTC 840 

TTAAAGTTCC TCCAGATTCC ATGGAACCAC TCAGTATTGC ACCATGAAGA GATGATTGGG 900 

AAAGCTGGGG GAGTGrCTCT GTCAAAAGTG GAGAGATCTA CAGACCAAGT AATCAAGCCA 960 

GTCAATGTAG GAGCTCTATC AAAATGGGTT GGGAAGATAC CGCCAGATGT TTTACAAGAC 1020 

ATGGCAGTGA TTGCTCCTAT GCTTGCCAAG CTTGGATATG ACCCATATGC CAACCCACCT 1080 

AACTACGGAA AACCTGATCC CAAAATTATT GAAAACACTC GAAGGGTCTA TAAGGGAGAA 1140 

TTCCAACTAC CTGACTTTCX TAAAGAAAAA OCACAGACTG AGCAAGTGGA GTAGCAGAAC 1200 

CAGGAGCCTC TTCCATACAT GAGGAAAGAT TGCTGCCTTT TCAGCAGAAG GGAAATTCCT 1260 

AGGATTGGCT GTCCCCTGCC AAGCTTGGTG GAGCGTCTGC ACCTTGGCTG CGCCGCCTGr 1320 

GCATTTGCCA GTTTCCTCCC ACTGAGAGGA TGGAGGTGTC CaCACAGCTT TGGGCCTCGT 1380 

GAGGQATCTG CCTCCTGAGC AAAGAGCTCT TGATCCCOAT TTCATGCACA GCCCTGCAGT 1440 

AAGGAGCCCA GAAGGAACAT □TGTTTCCTG TTAAAACTCC TCTTOTTCTC TTTTCTTACA 1500 

TTATGACGTT TGTTTTCAAG GAGAGGGTTT AAAAATGGGA TCCTGTAAGC AGACTTGGGC IS 60 

AGTCTCCTTT TOAAATAGGI TOTCTGTACA TQTTCTAATG TTTTGTAGAA CACGTGTGCC 1620 

TGTTTAAGTG TATTGATGTG AATAATATTA AATATCCTAA TTATTTAATT CATTGTATTG 1680 

TTTCTG AGAA GTTGGGAAAT TACCATTATA CATTTACAAC CTAATC3ACTT TTGTATTTTA 1740 
TTTTTCAAAA TAAAAGCTTT CAATOTGA 

Seq ID NO: 242 Protein eeijuence 
Protein Accession #i np_0035B7.1 

1 11 21 31 

I I I 1 

MVGKLKQNlir LACLVISSVT VPYLQQHAME CHHRIEERSQ 
KTPAYHKDMP LIFlGGVPRS GTTLHJtAHIiD AHPD IRGBBE 
KIRLDSAGVT DEVLDSAMQA FI*LEIIVKHG BPAPYLCNKD 
LMVKDGEA9V HSMISRKVTI AGFDLNSYRD CLTKWNRA1E 
QLVLHPKRWM RTLLiKFLQIP TOJHSVLHHEE MIGKAGGVSIj 
KWVGKIPPDV liQDMAVIAPM 1AKLGYDPYA NPPMYGKPDP 
KEKPQTBQVB 

Seq ID NOs 243 DKA sequence 
Nucleic Acid Accession #s NM_001492.3 
Coding sequence* 13 95*. 2513 

1 12- 21 31 41 51 

I I I I I 1 

ACGCGQGGCG CGCGGCTCCG TCGGCTACCG CGGGCGGGCG CAGGCGACGG GCACGGCGGG 60 

CGAGCGGGCG GTATGGCGBC GGCGGGGCCC GCGGCGGGGC CGACGOGGCC CGAGCCCATG 120 

CCGAGCTACG CGCAGGTAGT GCAGCGCGGC TGGGGCAGCG CGCTGGCOGC GGCGCGGGGC 180 

TGCACGGACT GCGGCTGGGG GCTGGCGCGT CGOGGCCTGG CTGAGCACGC GCACCTGGCG 240 

CCGCCCGAGC TGCTQCTGCT GGCGCTCGGC GCGCTGGBCT GGAOCGOQCT GCGCTCCGCG 300 

GCCACTGCGC GCCTCTTTCG GCCOCTGGGG AAGCGGTGCT GCCTCCAGCC CAGAGATGOC 360 

GCCAAGATGC CCGAGAGCGC TTGGAAGTTT CTCTTCTACC TGGGCAGCTQ GAGCTACAGT 420 

GCCTACCTGC TGTTTGGCAC CGACTACCGC TTCTTCCATG ACCCACCATC TGTCTTCTAC 480 

GACTGGACGC CGGGCATGGC AGTGCCACGG GACATTGCAG CCGCCTACCT QCTCCAGGGA 540 

AGCTTCTATG GCCACTOCAT CPACGCTACG CTATACATGG ACACCTGOC3 CAAGGACTCG 600 

GTGGTCATGC TGCTCCACCA GGTGGTCACT CTCATCCTCA TCGTCTCCTC CTAOGCCTTC 660 

CGGTACCACA ATGTGGGCAT CCTTGTGCTC TTCCTGCACG ATATCAGTGA CGTGCAGCTT 720 

GAGTTCACCA AGCTCAACAT TTACTTCAAG TCCCGCGGCG GCTCCTACCA TCOGCTGCAT 7 HO 

GCCTTGOCAG CAGACTTGGG C1GCCTCAGC TTCGGCTTCA GCTGGTTCTG GTTCCGCCTC 840 

TACTGGTTCC CGCTCAAGGT CCTGTATGCC ACCAGTCACT GCAGTCTGCG CACGGTGCCT 900 

GACATCCCCT TCTACTTCTT CTTCAATGCG CTCCTGCTGC TGCTCACCCT TATGAACCTC 960 

TACTGGTTCC TGTACATCGT GGCGTTTGCA GCCAAGGTGT TGACAGGCCA GGTGCACGAG 1020 

CTGAAGGACC TGCGGGAGTA TGACACAGCC GAQGCCCAGA GCCTGAAGCC CAGCAAAGCC 1080 

GAGAAGCCAC TGAGQAA0GG CCTGGTQAAG GACAAGCGCT TCTGAACCCC TCGGCCCCGC 1140 

CCCCGTGGAC OCGGCCCCAC CCCGAATACC CCGGOCACGC TCCCCGTCCT TGGCCGCCCC 1200 

TCCACCCCCT CCAACTCTGC TCCTCTAGGG CCGCCGCCAC CTCCCCTGGG ACCCCGCCCC 1260 

CTCATCCTGC CTCCATTTCC CGOCCACGCC CCCCAGGACC CCTGCCCCTC CGGGGACACC 1320 

GGCCOCGOCC TCAGCCCACT GGTCCCGGGC CGCCGCGGAC CCTGCGCACT CTCTGGTCAT 13 BO 

CGCCTGGGAG QAAGATGCCA CCOCCGCAGC AAGOTCCCTG CGGCCACCAC CTCCTCCTCC 1440 

TCCTGGCCCT GCTGCTGCCC TCGCTGCCCC TGACCCGCGC CCCCGTGCCC CCAGGCCCAG 1500 

CCGCCGCCCT GCTCCAGGCT CTAGOACTGC GCGAXGAGCC CCAOGOTGCC CCCAGGCTCC 1560 

GGCCGGTTCC OCOGGTCATG TGGCGCCTGT TTCGACGCCG GGACCCCCAG GAGACCAGGT 1620 

CTGGCTCGCG GCGGAOGTCC CCAGGGGTCA CCCTGCAACC GTGCCACGTG GAGGAGCTGG 1680 

GGGTCGCCGG AAACATCGTG CGCCACATCC CGGACCGCGG TGCGCCCACC CGGGCCTCGG 1740 

AGCCTGTCTC GGCOGCGGGG CATTGCCCTG AGTGGACAGT CGTCTTCGAC CTGTCGGCTG IB 00 



41 
1 


51 




PVKLESTRTT 


1 

VRTGLUIiKAN 


60 


TRVTPRILAIj 


KQMWSRSSKE 


120 


PFALKgLTYI* 


SHLFPHAKFI* 


180 


TMYNQCMBVG 


YKKCMLVHYE 


240 


SKVERSTDQV 


IKFVNVGAL8 


300 


KIIESTEEVY 


KGEFQLPDPI* 


360 
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TGGAACCCGC 
CGGCAGCCCC 
CGGACCCCGG 
CGGAGCTGCT 
CGCTGGCGCT 
TSCTGGTGAC 
AACCCGTGTT 
TCCGCGAGGT 
GCCAGGGTCA 
ACCAOGCTGT 
GCTGCOTGCC 
TGGTGCTGCG 



TGAGCGCCCG 
GGAGGGCGGC 
GCCGGTGCTG 
GGGCGCCGCT 
ACGCCCCCGG 
CCTCGACCCG 
GGGCGGCGGC 
GGGCTGGCAC 
GTGCGCGCTG 
GCTGCGCGCG 
CGOGCGCCTG 
GCAGTATGAG 



AQCCGGGCCC 
TGGGAGCTGA 
CTCCGCCAGT 
TGGGCTCGCA 
GCCCCTGCCQ 
CGCCTGTGCC 
CCCGGGGGCG 
CGCTQGGTCA 
COCGTCGCGC 
CTCATGCACG 
TCGCCCATCT 
GACATGGTGG 
ACAATAAATG 



GCCTGGAGCT 
GCGTGGCGCA 
TGGTGCCCGC 
ACGCCTCATG 
CCTGCGCGCG 
ACCCCCTGGC 
CTTGTCGCGC 
T02CGCCGCG 
TGTCGGGGTC 
CGGCCGCCCC 
CCGTGCTCTT 
TGGACGAGTG 
CCGCGTGGTC 



GCGTTTCGCG 
AGCGGGCCAG 
CCTGGGGCCG 
GCCGCGCAGC 
CCTGGCCGAG 
CCGGCCGCGG 
GCGGCGX3CTG 
CGGCTTCCTG 

GGGAGCCQCC 
CTTTGACAAC 
CGGCTGCCGC 
TGCTC 



GCGGCGGCGG 
GGCGCGGGCG 
CCAGTGCGCG 
CTCOGCCTGG 
GCCTCGCTGC 
CGCGACGCCG 
TACGTGAGCT 
GC CAACT ACT 
CCGGCOCTCA 
GACCTGCCCT 
AGCGACAACG 
TAACCCGGGG 



1860 
1920 
19B0 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
24 GO 
2520 



Seq ID NO i 244 Protein sequence 
Protein Accession #: hp_0Q14B3.2 



1 
I 

MPPPQQGPCG 
VMWRLFRRKD 
AGHCPEWTW 
VLLRQL.VPAL 
DPRLCHPLAR 
AIiPVALSGSG 
YKDMWDECG 



11 
I 

EHLLIrLLALlp 
PQETRSGSHR 
FDLSAVEPAE 
GPPVRAELLG 
PHRDAEPVLG 
GPPALNHAVL 



2Z 

1 

IiPSLPLTRAP 
TSPGVTLQPC 
RPSRARLELR 
AAWARNASHP 
GGPGGACRAR 
RAJjMHAAAPG 



31 
I 

VPPGPAAAUb 
HVEELGVAGN 
FAAAAAAAPE 
RSLRUVLALR 
RLYVSFREVG 
AADLPCCVPA 



41 51 

1 I 
QALGXcRDEPQ GAPRIiRFVPP 
IVRHIPDRGA PTRA8EPVSA 
GGWEL5VAQA GQGAGADPGP 
PRAPAACARL AEASIiLI»VTIi 
WHRWVTAPRG FliANYCQGQC 
BLBPISVLFF DNSDNWXiRQ 



60 
120 
1B0 
240 
300 
360 



Seq ID NOi 245 DNA sequence 

Nucleic Acid Accession #i HM_021267.1 

Coding sequence s 17. , 1125 



1 11 21 31 41 51 

111 111 

ACGCGGGGCB CGCGGCTCCG TCGGCTACCG CGGGCGGGCG CAGGCGACGG GCACGGCGGG 60 

CGAGCGGGCG GTATGGCGGC GGCGGGGCCC GCGGCGGGGC CGACGQGGCC CGAGCCCATG 120 

CCGAGCTACG CGCAGCTAGT GCAGCQCGGC TGGGGCAGCG CGCTGGCGGC GGCGCGGGGC 180 

TGCACGGACT GCGGCTGGGG GCTGGCGCGT QGOGGCCTGG CXOAGCACGC GCACCTGGCG 240 

CCGCCCGAGC TGCTGCTGCT GGOGCTCGGC GCGCTGGGCT GGACCGCGCT GCGCTCCGCG 300 

GCCACTGCGC GCCTCTTTCG GCCCCTGGCG AAGOGGTGCT GCCTCCAGCC CAGAGATGCC 360 

GCCAAGATGC CCGAGAGCGC TTGGAAGTTT CTCTTCTACC TGGGCAGCTG GAGCTACAGT 420 

GCCTACCTGC TGTTTGGCAC CGACTACCCC TTCTTCCATG ACCCACCATC TGTCTTCTAC 480 

QACTBGACGC CGGGCATGGC AGTGCCACGG GACATTGCAG CCGCCTACCT GCTCCAGGGA 540 

AGCTTCTATG GCCACTCCAT CTACGCTACG CTATACATGG ACACCTGGOG CAAGGACTCG 600 

□TGOTCATGC TGCTCCACCA CGTGGTCACT CTCATCCTCA TCGTCTCCTC CTACGCCTTC 660 

CGGTACCACA atgtgggcat ccttgtgctc TTCCTGCACG ATATCAGTGA CGTGCAGCTT 720 

GAGTTCACCA AGCTCAACAT TTACTTCAAG TCCCGCGGCG GCTCCTACCA TCGGCTGCAT 780 

GCCTTGGCAG CAGACTTGGG CTGCCTCftGC TTCGGCTTCA GCTGGTTCTG OTTCCBCCTC 840 

TACTQOTTCC CGCTCAAGGT CCTGTATGCC ACCAQTCACT GCAGTCTGCG CACGGTGCCT 900 

GACATCOCCT TCTACTTCTT CTTCAATGCG CTCCTGCTGC TGCTCACCCT TATOAACCTC 960 

TACTGGTTCC TGTACATCGT GGCGTTTGCA GCCAAGGTGT TGACAGGCCA GGTGCACGAG 1020 

CTGAAGGACC TGCGGGAGTA TGACACAGCC GAGGCCCAGA GCCTGAAGCC CAGCAAAGCC 1080 

GAGAAGCCAC TGAGGAACGG CCTGGTGAAG GACAAGCGCT TCTGAACCCC TCGGCCCCGC 1140 

CCCCGTGGAC CCGGCCCCAC CCCGAATAGC CCGGCCACGC TCCCCGTCCT TGGCCGCCCC 12 00 

TCCACCCCCT CCAACTCTGC TOCTCTAOGG CCGCCGCCAC CTOCCCTGGG ACCCCGCCCC 1260 

CTCATCCTGC CTCCATTTCC CGGCCACGCC CCCCAGGACC CCTGCCCCTC CGGGGACACC 1320 

GGCCCCGCCC TCAGCCCACT G3TCCCGGGC CGCCGCGGAC CCTGCGCACT CTCTGGTCAT 1380 

CGCCTGGGAG GAAGATGCCA CCGCCGCAGC AAGGTCCCTG CGOCCACCAC CTCCTCCTCC 1440 

TCCTGGCCCT GCTGCTGCCC TCGCTGCCCC TGAQCCGCGC CCCOGTGCCC CCAGGCCCAG 1500 

CCGCCGCCCT GCTCCAGGCT CTAGGACTGC GCGATGAGCC CCAGGGTGCC CCCAGGCTCC 1560 

GGCCGGTTCC CCCGGTCATG TGGCGCCTGT TTCGACGCCG GGACCCCCAG GAGACCAGGX 1620 

CTGGCTCGCG GCGGACGTCC CCAGGGGTCA CCCTGCAACC GTGCCACGTG GAGGAGCTGG 1680 

GGGTCGCCGG AAACATCGTG CGCCACATCC CGGACCGCGG TGCGCCCACC CGGGCCTCGG 1740 

AGCCTGTCTC GGCCGCGGGG CATTQCCCTG AGTGGACAGT CGTCTTCGAC CTGTCGGCTG 1B0O 

TGGAACCCGC TGAGCOCCCG AGCCGGGCCC GCCTGGAGCT GCGTTTCGCG GCGGCGGCGG 1860 

CGGCAGCCCC GGAGGGCGGC TGGGAGCTGA GCGTGGCGCA AGCGGGCCAG GGCGCGGGCG 1920 

CGGACCCCGG GCCGGTGCTG CTCCGCCAGT TGGTGCCCGC CCTGGGGCCG CCAGTGCGCG 1980 

CGGAGCTGCT GGGCGCCGCT TGGGCTCGCA ACGCCTCATG GCCGCGCAGC CTCOGCCTGG 2040 

CGCTGGCGCT ACGCCCCCGG GCCCCTGCOG CCTGCGCGCG CCTGGCCGAG GCCTCGCTGC 2100 

TGCTGGTGAC CCTCGACCCG CGCCTGTGCC ACCCCCTGGC CCGGCCGCGG CGCGACGCCG 2160 

AACCCGTGTT GGGCGGCGGC CCCGGGGGCG CTTGTCGOGC GCGGCGGCTG TACGTGAGCT 2220 

TCCGCGAGGT GGGCTGGCAC CGCTGGGTCA TCGCGCCGCG CGGCTTCCTG GCCAACTACT 2280 

GCCAGGGTCA GTGCGCGCTG CCCGTCGCGC TGTCGGGGTC CGGGGGGCCG CCGGCGCTCA 2340 

ACCAOGCTGT GCTGCGCGCG CTCATGCACG CGGCCGCCCC GGGAGCCGCC GACCTGCCCT 2400 

GCTGCGTGCC CGCGCGCCTG TCGCCCATCT CCGTGCTCTT CTTTGACAAC AGCGACAACG 2460 
TGGTGCTGCG GCAGTATGAG GACATGGTGG TGGACGAGTG CGGCTGCCGC TAACCCGGGG 

Seq ID NO: 246 Protein sequence 
Protein Accession #s NP_067090.2 

1 11 21 31 41 51 

I I I 1 I I 

MAAAGPAAGP TGPEPWPSYA QLVQRGWG8A LAAARGCTDC GHGLAERGIiA EKAHLAPPBL 60 

LLIALGALGW TALRSAATAR LPRPliAKRCC LQPRDAAKMP BSAWKPXiFYI* GSWSYSAYIili 12 0 

FGTDYPPPHD PPSVFYUWTP GMAVPRDIAA AYHjQQSFYG HSIYATIjYMD TWRKDSWMIi 180 

1046 



WO 03/042661 



PCT/US02/36810 



LHHWTLILI VSSYAFRYHN VQILVLFLHD ISDVQIiEFTK LNIYFKSRGG SYHRLHALAA 240 

DliGCLSFGFS WFMFRLVWFP liKVLYATSHC SLRTVPDIPF YFFFNALUjL IVTLMNIiYWFL 300 
YIVAPAAKVL TGCVHELXDL REYDTAEAQS IiKPSKAEKPI> SNGLVKDKRF 

5 Seq ID NO: 247 DNA sequence 

Nucleic Acid Accession #s NM_D02081-1 
Coding sequence: 222..189B 

<#v 1 11 21 31 41 51 

10 ] I I 1 I 1 

GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCCGC CGCCGCCGCC 60 

GGCTTTTGTT GTCTCCGCCT CCTCGGCOGC CGCCGCCTCT GGACCGCGAG CCGCGGQCGC 120 

DGGGACCTTG GCTCTGGCCT TCGOGGGCGG GAACTGCGCA GGACCCGGCC AGGATCCGAG 180 

AGAGGCGOGG GCGGGTGGCC GGGGGOGCCG CCGGCCCGGC CATGGAGCTC CGGGCCCGAG 240 

15 GCTGGTGGCT GCTATGTGCG GCCGCAGCGC TGGTCGCCTG CGCCCGCGGG GACCCGGCCA 300 

GCAAGAGCCG GAGCXGCGGC GAGGTCCGCC AGATCTACGG AGCCAAGGGC TTCAGCCTGA 360 

GCGACQTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGOG GATCTGTCCG CAGGGCTACA 420 

CCTGCTGCAC CAGCGAGATG GAQGAGAACG TGGCCAACCG CAGCCATGCC BAGCTGGAGA 480 

CDGCGCTCCG GGACAGCAGC CGCGTCCTGC AGGCCATQCT TGCCACCCAG CTGOGCAGCT 540 

20 TGGATGACCA CTTCCAGCAC CTGCTGMCG ACTCGGAGCG GACGCTGCAG GCCACCTTCC 600 

CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 660 

AGCTGCGCCT GTACTACCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG GCCGAGTTCT 720 

GGGCCGQCCT GCIC5GAGCGC CTCTTCAAGC AGCTOCACCC GCAGCTGCTG CTGCCTGATG 780 

ACTACCTGGA CTGCCTGGGC AAGCAOGCCG AGGCGCTGCG GCCCTTCGGG GAGGCCCCGA 840 

25 GAGAGCTGCG CCTGCGGGCC ACCCGTGCCT TCGTOGCTGC TCGCTCCTTT GTGCAGGOCC 300 

TGGGCGTGGC OVGCOACGTG GTCCGGAAAG TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 960 

OGAGAGCTGT CATGAAGCTG GTCTACTOTG CTCACTGOCT GGGAGTCCCC GGCGCCAGGC 1020 

CCTGCCCTOA CTATTGCCGA AATGTGCTCA AQGGCTGCCT TGCCAACCAG GCCGACCTGG 1080 

ACGCCGAGTG GAGGAACCTC CTGGACTCXZA TGGTGCTCAT CACCGACAA3 TTCTGGGGTA 1140 

30 CATCGGGTGT GGAGAGTGTC ATCGGCAGCG TGCACAOGTG GCTGGCGGAG GCCATCAACG 1200 

CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 1260 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAOCGGCG CCGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAOA AGCTGGTCTC TGAAGCCAAG OCCCAGCTCC 1380 

GCGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 1440 

35 TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCOG IS 00 

AGQTCATGGG TGACGGCCTt? GCCAACCAGA TCAACAACCC CGAGGTGGAG GTGGACATCA 1560 

CCAAGCCGGA CATGACCATC CGGCAQCAOA TCATGCAGCT GAAGATCATG ACCAACOGGC 1620 

TGC3GCAGCGC CTACAACGGC AAOGAOGTGG ACTTCCAGGA CGCCAGTGAC GACGGCAGCG 1680 

A _ GCTOGGGCAG CGGTGATGGC TGTGTQGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 1740 

40 GCTCCAGCTC COGGACGCCC TTGACCCATG OCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 1800 

AGAAGACCTC GGCTGCCAGC TGCCCCCAGC CCCCGACCTT OCTCCTGCCC CTCCTOCTCT 1860 

TCCTGQCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 1920 

GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGACG ATATTTAATT CACCTCAGCC 1980 

TOGAGAOQCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG 2040 

45 GTCCCAGCCC CAGGCCTGGC CTOGCCTGCC TTTCTGCCTT XTAATTTTGT ATGAGGTCCT 2100 

CAGGTCAGCT GGGAaCCAGT GTGCGCAAAA GCCATGTATT TCAGGGACCT CAGGGGCACC 2160 

TCCGGCTGCC TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC OCCTCGAGGC 2220 

CTAC3U3AGGA GGCCTCAAAG CAACCCGCTG GAGCCCACAG CGAGCCTGTG GCTTOCTOCC 2280 

_ . CGCCTCCTCC CACTGGGACT CCCAGCAOAG OOCACCAGCC AGCCCTGGCC CACCCCCCAG 2340 

50 CCTCCAGAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCAGGGTC TGGCAGAGCC 2400 

TCTGAGATGA TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCCGG CCCCACCTCC 2460 

CTGC3GCCCTT GAGGGGCCCC AGCGTCTGCA GGGTGAOGCC TOAGACAGCA CCACTGCTGA 3520 

GGAGTCTGAG GACTGTCCTC CCACAGACCC TGCAGTGAGG GGCCCTCCAT GCGCAGATGA 2580 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAOGAGGGGA AGCTGGGCCC AAAGGCCCAG 2640 

55 GGAGGCAGCG TGGGCTCTGC CAATJGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 

GOCTTGCTGG GGTCCAGGGC TGTTGGAGGA CCCCGAGGGC TGAGGAGCAG CCAGGAGCOG 2760 

CCIX5CTOCCA TOCTCACCCA GATCAGGAAC CAGGGCCTCC CTGTTCACGG TGACACAGGT 2B20 

CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGO GA1GCTGGTG GCTGGTGAGA 2880 

CCCCaCACTG CACACGGGAA TGGCTAGGTC CCTTCCCGAC CCAGCCAGCT GCACTGCAGG 2940 

GO GCACGGGGAC CTGGATAGTT AAGG0CTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTCGGAGGC CCGCAGGGCC 3060 

CACCTTGGAC CCTGGTGACC TCCTOTCACT CACTGAGGCC ATCAGGGCCC TQCDCCAGGC 3120 

CTGGACGGGC CCTCCTTCCC TCCTGTCCCC CAQCTGOCAG GTGGOCCTGG GGAGGGGTGG 31 BO 

TGTGGTC3TTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTO GGGGCAGCTG 3240 

65 TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGG TTTGCTTTTC ATCACCGTCC 3300 

CGCACAGTOa ACGGAGGTCC CCGGTTGCTG GTCAGGTCCC CATOGCTTGT TCTCTGGAAC 3360 

CTGACTTTAG ATGTTTTGGG ATCAQGAGCC CCCAACACAfi GCAAGTCCAC CCCATAATAA 3420 

CGCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATG003GOC GCCAGGACAG 3480 

CAGCACTGCC GCTGCACACA GACGGCCTAG GGGTGGCSCT CAGACCCCAC CCTACGCTCA 3540 

70 TCTCTGGAAG GGGCAGOCCT GAGTGHTCAC TGGTCAGGGC AGTGGCCAAG CCTGCTGTGT 3600 

GCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGOM QTGACGTGTG TTCTTTTGAG 3660 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 

Ssq TD NO: 248 PJQteln. sequence 
75 Protein Accession #; KP_0 02072.1 

1 11 21 31 41 51 

I I I I I 1 

MfibRARGWWIi IiCAAAALVAC AftGDPASKSR SOGEVHQIYG AKfiFfiLSDVP QAEISGEHLR 60 

80 ICPQGYTCCT SEMEENIAHR SHAELETALR DSSRVLQAML ATQLRSFDDH FOHLrLNDfiER 120 

TLQATFPGAF GSLYTQKAJRA FRDbYSHLKL YYRGAKLELfi STLAEFWARL LEHLFXQI^HP ISO 

QLLliPDDYIiD CLGKQAEAliR PFGEAPRELR LRATRAFVAA HSFVQGLGVA SDWHKVAQV 240 

PLGPEC6FAV MKLVYCAHCL GVPGARPCPD YCRNVLKGCI* ANQADLDAEW BNLIJ3SMVI*I 300 

TDKFWGTSGV BSVIGSVHTW LAEAIHALQD NRDTLTAKVX QaOONPKVHP QGPGPEEKRR 360 

1047 



WO 03/042661 



PCT/US02/36810 
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Seq ED NOt 249 DWA. aecruence 

NUcleic Acid Accession Hi NM001492.3 

Coding sequence: B . . 1664 



GAAGGCCATG 
CATTTTCCTC 
GCCGGGTCCA 
CTTCTTCAGA 

CxCCTGATCT c 
CACCTTCTCT 
CX5CCTGGAGC 
GGCCCGGGAC 
CGAGCCOCCT 
GCGGTGCGGT 
QQTQTaCCOA 
ATGCCTAGAG 
CAGCCCCAGG 
TGACGGGAAC 
CACCTGCCCG 
TGGACCCTQC 
CTGOCACTGC 
CCTQCA£3CCA 
CTGCCGCGCC 
CGCCTGCQCT 
GCTGGGCTTC 
TflCTCAOQGC 
CTACATGGGA 
GGCOCCGCCG 
ACTGCTOSTG 
TGGCCACTCC 



21 

! 

GGATOTCCGG 
GGCCCGCTGG 




RC3KUVPRERP PSGTLEKLVS KAKAQLRDVQ DFW I SLPGTD C8EKMALSTA SDDRCNNGMA 
RGRYLPEVMG DGLANQINNP EVBVDITKPD MTIRQQIMQL KIMTWRLRSA YUGNDVDFQD 



51 
I 

TCCTAGCGCT 
ACTCTTTCGG 
CCTGCCGCCT 
CCCCGTGCGC 
GftGOGCCCGC 
CCTGGCCTGG 
TTGGAGGGCC 
GAGGCCCGTG 
GCGCGCGCTG 
BOGCCCCCTC 
AGGCGCCGCT 
GTGAATGCCG 
GCAGCTGCCT 
CTGGGCCCTG 
CCTTTGAATG 
CATGK3CAGA 
CTQCCTACAT 
ACCGGTGCAG 
TOCGCTGCCG 
GCGCGGGCCG 
GCTCCTGCGC 
CGCGCCCCTG 



42D 
4B0 



CTCGTCDGTA 
TGCTCCTTCC 
GCGCGCTGGG 
ATGAATTGGG 



11 
I 

GTCTCCCCAC 
CCCCAOACAC 

GTCTGCCTGA 
GCGCTGAGTG 
CCA.CTGCCCG 
TTCATCATCG 
CTGCTGGCGC 
ATTCAGCGCG 
GCCGTCGGOA 
CCGGGACTGC 
GCAGGCTGCA 
GGCTGGACTO 
GGCCCGTCCT 
CCGTGTGCCA 
CGTGGGTTCT 
TTCAACGGCG 
CCACCTGGTT 
TGCCGCAATG 
GGCTTCGCGG 
AACGGCGGCA 
GGOGGCCGCG 
GGCCGCTGCT 
GCGCGGTGTG 
GGCCTCAGGC 
GCCGCGGGCG 
CAGGATGCTG 
GATGGACTCA 
GATTGGAAXC 
ATCTAOGCTC 
CAGAGGCAGC 
TAGAGTCTCT 



AGCCTGGGCT 
CGCGOGGACC 
ACGGGCXCTT 
AAACCTGGAQ 
GCGTGGCTGG 
CAGGCGCCTG 
CCGCGTGCAC 
GCCCCTGCX3C 
GCCCTGAGCA 
GACCCCTCTG 
CTGCTAOCAC 
ATQGAGGCAG 
ACGGGCTGCG 
GCTTGX GTGT 
TCCAAGGCTC 
GCGGACTCTG 
GTCCTCX3CTG 
CGTGTGTGGA 
ACTQCCQCGA 
ACGCCCACTT 
AGTTCCCA3T 
CCGGGGACCC 
TOGCC33QCGC 
GGTCTCGCTT 



31 
1 

QCTCCTCTCC 
CGTCTTCGAG 
CCCCTGCAGC 
CTCAGAGGAG 
GGTCTACACC 
GCAGGTGCCC 
AGAGGAGTTA 
CAGGCGGCGC 
GGAGCTGCGC 
GCGCCTCTGC 
ACCGCTCGAG 
TGGCTTCTGT 
CACGGXCCCT 
OGGATGCCTT 
CTGTAGTGAG 
GTGTGAGGTG 
CGGGGGTGCA 
CAACXGTGAG 
CCTGGACCTG 
OGAGCACGAC 
GGGCGGCGOC 
GCGCGCGGAG 
CTOCGGCCTC 
GCACCCCGAG 
TCAGCGCTAC 
TGCGCTCTTG 
GCTGGCTGGG 



GCCCTGAAGA TGTAGACCCT 
GGGAGGTAGC GACGCCCCTT 
ACCTGCTTTT TCCCTAGCCT 
GGAAGOTTTT AAGCCCATTT 



GCCGCCQABT 
GAGCAGCCCG 
TTCCGGGACG 
GGAGACCAGA 
TTGGCAGCOG 
TTCTCGTACC 
GGTCCGOGGA 
GACGAATGTG 
GAACAGCCCa 
GTCTCCACCA 
GTCCCTGGGC 
ACACCCAGGT 
AGOGGGGTGA 
GACCCTGACT 
AAGAGGGTGG 
GGCCACGCCC 
CTGGACGACT 
aCQCACCGCT 
CCGTGCGCCG 
GTCTGCGCTT 
GGCGCAAGCG 
CTTTTGOCTC 
CTGGTCCACG 
ACCCCGGAGC 

CAAOGQATTT 
TTCCCOCCGC 
TCCT CGA'l"i 
TCAGTTCTAA 



GCTTGCCOGC 
raaCTCTGGG 
TGCGCCGCCG 
CGTCAGTCCA 
ATGGTCCGAG 
ATOTCATATC 
TACACACTGG 
TGTCCGTGAA 
CTTACTTTCA 



Seq ID MOt 250 Protein sequence 
Protein Accession $ * NP_0SB637.1 



RVCLKPGLSE 
SF1XETWRHE 
PAVK3TACTRI* 
EGWTGPIjCTV 
PROF YGLRCS 
PCRNGGLCLD 
EX3GKEKTSHRA 
PGLHPGDPQR 
PDAUWTjRTQ 
GQRQHIiLFPY 



11 
I 

SQTVILAliIF 
EAAESPCALG 
liGDQIGGPAW 
CRPRSAPSRC 



21 
I 

LPQTRPAGVF 
AAIiSARGPVY 
BLLARVAGRR 
GPtJDRPCAFl, 



31 

1 



VSOVTCATXJF 
LGHALRCRCR 
DPCAARPCAH 
YLLPPALGLIf 



PSSXLBVK 



AGFAGPRCEH 
GGRCYAKFfiG 
VAAGVAGAAL 



TBQPQAPAPD 
RLAAGGPWAR 
EDECEAPIATC 
LVPGPGPCDG 
AUPOeAYICH 
DUmCAORAC 
LVCACAPGYM 
LLVHVRRRGH 
PQGIYVISAP 



41 
1 

PGPGAPRSPC 
LPLPDGLLQV 
DIQRAGAWEL 
RAGCSPEHGF 



GPPGFQGSNC 
ANGOTCVEGG 
GARCEFFVRP 
SQDAGSRIiIiA 
BTYAREVATP 



Seq ID NO: 251 DMA sequence 

Nucleic Acid Accession #: CAT cluster 



1 
I 

GAAATATAAC 
AQATTATTGA 
GCTTCAATTC 
TTAGAAAGTC 
AGTTTTACTA 
TTGTGCAAAC 
GGGTQCTATT 
GGTTTTCTTT 
GTTTACTOAA 
TTATGTAAAT 
TGCCATTGTC 
TAAAAATACA 



11 

1 

CATTGCAATT 
CAATCCCAAA 
CTTATATGAC 



CTOTAGTGAC 
AAGCTTTCAT 
TACCTAGAAC 
TTCAAGGAGG 
AGCACTCAAT 
ATGTTAAATA 
ATATCAAACT 
TTTCAATGTT 



21 
I 

AGAAAATATC 
TATACAATTX 
TTCAGACTGG 
TATACATAAA 
CGTTGAGAAG 
TGTTAAGTGC 
CATTGTCTAC 
TTCAATTAAG 
GAAGTCAAAT 
TAACATGTAA 
TGCTTTGTAT 
AAAAAAAAAA 



31 
I 

CAAAATAQCC 
TTCXTTAAAA 
AGAAGCCTGT 
TTCTCAATTT 
ACCCTTGTTT 
CTQTATTCCT 
TACAATTAAC 
GCAATAAGAT 
TACTGAAGCT 
GGAAAATAAA 
CATACTAATG 
AAAAAAAA 



41 

) 

TQTTATCTTC 
GTAGTACAAT 
TAAACCACTG 
GGCTGCTGTA 
ATTXACAT1T 
TTC3VTTTACT 
ATTTACATTA 
GTTIGCTGGA 
T7TGCCTACA 
CAATAXTATA 
T T ACATAACT 



60 
120 
180 
240 
300 
360 
420 
490 
540 
£00 
660 
72 O 
780 
840 
900 
960 
102 0 
1060 
1140 
1200 
1260 
1320 
13BD 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



PFRDAWPGTF 
RFSYRAP.CBP 
CEQPGECRCL 



BKRVPRCSI.-Q 
GAHRCECALG 
DGA9AKPAAP 
OTPSPfiVHAL 
I*FPP1*HTGRA 



60 
120 
160 
240 
300 
360 
420 
4B0 
540 
600 



51 
I 

CACGTGGCCT 
TTCCTTTGTA 
TTAfiTTTCAG 
CACGTGCCAG 
GAASCACTGT 
TCATGTCCAG 
CAAAGTGTGT 
GAAACCTATT 
TCTTGGTCTT 
ATTATGTGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seg; ID NO: 252 DMA sequence 

Nucleic Acid Accession : Eos sequence 

I 11 21 31 41 51 

II 1111 
AGGTACTGCC AGAAAGQATC AGGACCTGGA GTCTGGCAAG AGGAAGACAG AGGCCIGTGT 



60 



1048 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



GGGAAOCGAG TTGTTATCTT 
TAGATAACCG AAAGTAAAAA 
TACAGCCGAG AGACAGTAAA 
TTTTGTTTTG TTTTTTTCCT 
ACATGCTAAA ATATCAAGGT 
AGAAGTGTAT AGTGATAATG 
AGTTATATCA GGGATTTTTT 
AAAATGATTC CATAATATGT 
ATTTGAAGAT TGTGCTTGGA 
GAAATTAGAC TTTCAGAAAA 
TTAGAGCTAG CAGGCAAAAT 
TCTATTCAAA ATACAAGGAC 
ACTGAAGCAG AAAGACATGA 
AAATGTTCTG TCTTACTCTG 
CCACGTGTGC GCCTCCTCTC 
CTGGCATCCC AAACTGAAAT 
TGGGGAATCT GAGGGTCTGT 
GAACGGCAGG ACGCTCCGAT 
CCTATGACAG ACAGCCCTGT 
CAT ACTCAT C TCCATCGCTT 
TTCCCAAATG GCAGCACCAA 
TTAACCACTG TTTAATCTGG 
AGAATTCAAG GCAAGCTAGT 
AATGTCAAAA TCTCAGTCCA 
GGGTGGGCAA ATGTGTATAG 
TTTCATGAGG CTTTTTACCT 
GTAGATATAC ACTAGAAGTA 
AACAATATGA TAAAAAGTTT 
TTTAATGTAA TCTAAAAATA 
CATTGTCGCC TTTTCATAAA 
ATAGGCATTT ACACAGAAAT 
TCTTCCACGT GGCCTAGATT 
ACAATTTCCT TTGTAGCTTC 
CACTGTT AGT TTCAGTTAGA 
CTGTACACGT GCCAQAGTTT 
CATTTGAAGC ACTGTTTGXG 
TTACTTEATC3 TCCAGGGGTG 
CATTACAAAG TGTGTGGTTT 
CTGGAGAAAC CTATTGTTTA 
CTRCATCTTG GTCXTTTATG 
TTATAATTAT GTGTTTGCCA 
TAACTTATCG ATCAATAAAA 



TGGTTATCTA GCTGTATGAG 
CTTCTTCAAG ATCGCCGGGG 
AACCAGAAAG GTCAGGAATA 
TATGATTAAA GGTQGGATGA 
TCCAGATATG TCTTGAGAGG 
AGT AAAGATG CATGTGCAGT 
TCTTAGAAAG GTGTTGCAOA 
AGACTTGQGC AGTTCCTTTG 
AATTACTTTA CATGTATTTG 
GTTTATACTG GAAGGTTAAT 
GAAAAAAAAA AAAAAAAARG 
AGATGCTTCT CTGTTCCAAG 
TGGAGACGAG ATCGCCTCCC 
CGOCTAGCAT TGGAAATGAA 
TTTTTGTTTA AGGATGATCA 
AATTAGGAOG TATATAGACC 
CTTGCCTAAT TGATTCOGCT 
TCCCACGCTC GGGGGCAAGr 
TGGGQGGTGG GGGTATGAAA 
AAGAAAGTAA AGGCATTTCC 
CTGGTCTGTA GCTGCTACAT 
CCATAATTAA GTTTGGCTTT 
AGAAAGCAAT TCCAAGAAAS 
TGAGATTATG GCCTTGTGAC 
AGATAAAATA CATATCTCXtt 
CTTAACATGA TACATCTAGG 
AAACTATAAA GTCATTAGCC 
DTTTTGGAAT AGTCTCAGCT 
ACACTATGCC TAGCAGAAAC 
AGTCCTTAAG TTTTCCATAT 
ATAACGATTG CAATTAGAAA 
ATTGACAATC CCAAATATAC 
AATTCCTTAT ATGACTTCAG 
aagtctgaga gactttatac 

TACTACTGTA GTGACOGTTG 
CAAACAADCT TTCATTGTTA 
CTATTTAOCT AGAACCATTG 
TCTTTTTCAA GGAGGTTCAA 
CTGAAAGCAC TCAATGAAGT 
TAAATATGTT AAATATAACA 
TTGTCATATC AAACTTGCTT 
ATACATTTCA ATGTT 



TGTATTGGTC 
AGCGTGTGAG 
CTTATTGAAT 
GAGAAAATTA 
GGTTGTTGCA 
TTGTT C TATT 
GATGTCTGGT 
GGAGOCACCT 
GGCTGTATGT 
AATTTGTATC 
CAAGGGCTGA 
AGGGTTTCCT 
CTCTTGTCAA 
AGTGACATTT 
GGTCTATCCA 
TGACAAAAAT 
AAACGGAATG 
GATAAAGOGG 
AAAACATCAA 
CACCCACAGC 

CTTCGTTGTT 



TTCATAAAGC 
AATGAAAGAC 
CTAACTTTGT 
AATGACACAC 
GCTGCAAAGG 
TTAAGGCAAA 
ACCTAGTTTA 
CCCTCTCAAA 
CAATXTGTAC 
TACTGAGGAC 
TTTTTATTCT 
TGAGGAAGCT 
AGTGTTAAAA 
AOSCCACAAC 
GGAAACAGCT 
GGAAAGGGGG 
CAGGAGATGT 
GGCCGGGCAG 
GTGCACACAC 
CATCTGCAGC 
TGTTAATTTT 
TGAGAT TTTC 



CACATTTTTG 
TATAACAGTC 
AACTTGGTCT 
TGTAGAAAGT 
AGATGGGTCT 
TTTGGCGCTT 
GTCACCAAOC 
ATATCCAAAA 
AATTTTTCTT 
ACTGGAGAAG 
ATAAATTCTC 
AGAAGACCCT 
AGTGCC7TUTA 
TCTACTACAA 



CAAATTACTG 
TCTAAGGAAA 
TGTATCATAC 



CTTTTGTTTT 
GTTATTATAA 
AATTGTGCTA 
TGATTATGAT 
AAATAGCCAT 
TGGAGGTCCC 
AAACATTTGT 
TAGCCTGTTA 
TAAAAGTAGT 
CCTGTTAAAC 
AATTTGGCTG 
TGTT T ATTTA 
TTCCTTTCAT 
T T AACATTTA 
AAGATGTTTG 
AAGCTTTTGC 
ATAAACAATA 
TAATGTTACA 



Seq ID NO: 253 IMA sequence 

Nucleic Acid Accession NM_001650.2 

Coding sequence a 40.1011 



1 
1 

GGGGCAGGCA 
AGGCGGTGGG 
GGGGTCTGGA 
TTTGTTCTCC 
GTCGACATGG 
TTTGGCCATA 
AGGAAGATCA 
ATTGGAGCAG 
ACCATGGTTC 
TTTCAATTGG 
TCAATABCTT 
ACTGGTGCCA 
GAAAACCATT 
TATGAGTATG 
AAAGCTGCCC 
GAGACGGATG 
GAGGAGAAGA 
CGCACTGAAA 
GAAACAGATT 
GTCTAAACAA 
TCCAAATCTA 
TCTAGTTACC 
CCTGACAGAA 
AGTCAATTCT 



11 
1 

ATGAGAGCTG 
GTAAGTGTGG 

CTCAAGcrnr 

TCAGCCTGGG 
TTCTCATCTC 
TCAGGGGTGG 
GCATCGCCAA 
GAATGCTCTA 
ATGGAAATCT 
TETTTACrTAT 
TAGCAATTGG 
GCATGAATCC 
GGATATATTG 
TCTTCTGTCC 
AGCAAACAAA 
ACCTGATTCT 
AGGGGAAAGA 
GCAGACAAGA 
1GTTATAAAT 
TAAATATTTC 
AAAAAAGAAA 
TTTCATTAAC 
CTCAAAGACA 
TATTTGAATA 



21 

I 

CACTCTGGCT 
ACCTTTGTGT 
CTGGAAAGCA 
ATCCACCATC 

ccraaxscacTT 

CCACATCAAC 
GTCTGTCTTC 
TCTGGTCACA 
TACCGCTGGT 
CTTTGCCAGC 
ATTTTCTGTT 
OGCCOGATOC 
GGTTGGGCCC 
AGATGTTGAA 
ABGAAGCTAC 
AAAACCTGGA 
CCAATCTGGA 
CTCCXTAGAA 
TAGAAATGTG 
ATAATTTACA 
TATTTTTAAG 
AACCAATTTT 
GGTCTATCAG 
TT T ATT CT AT 



31 
I 

GGGGAAGGCA 
ACCAGAGAGA 
GTCACAGCGG 
AACTGGGGTG 
GGACTCAGGA 
GCTGCAGTGA 
TACATCGCAG 
CCTCCCAGTG 
CATGGTCTCC 
TGTGATTCCA 
GCAATTGGAC 
TTTGGACCTG 
ATCATAGGAG 
TTCAAACGTC 
ATGGAGGTGG 
GTGGTGGATG 
GAGGTATTGT 
C1GTCCTCAG 
CAGGTTTGTT 
AAGGAGGAAC 
ATGTTCTTAA 
AACOGTGTGT 
CTTATTCCTT 
TAAACTGAGT 



41 
I 

TGAGTGACAG 
ACAT CATGGT 
AATTTCTGGC 
GAACAGAAAA 
TTDCAACCAT 
CTGTGGCCAT 
CCCAGTGCCT 
TGGTGGGAGG 
TGGTTGAGTT 
AACGGACTGA 
ATTTATTTGC 
CAGTT AT CAT 
CTGTCCTCGC 
GTTTTAAAGA 
AGGACAACAG 
TGATTGAOGT 
CTTCAGTATG 
ATTTCCTTCC 
GTTTCATGTC 
GGAAGAAACC 
GCAAATATAT 
CAAGATTTGG 
CTCTACTGGA 
TTAACAATGG 



120 
180 
240 
300 
360 
420 
480 
540 
£00 
660 
720 
7 B0 
840 
S00 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1330 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



51 
1 

ACCCACAGCA 
GGCTTTGAAA 
CATGCTTATT 
GCCTTTAGGG 
GGTGCAGTGC 
GGTGTGCACC 
GGGGGCCATC 
CCTGGGAGTC 
GATAATCACA 
TGTCACTGGC 
AATCAATTAt 
GGGAAATTGG 
TGGTGGCCTT 
AGCCTTCAGC 
GAGTCAGGTA 
TGACJCGGGGA 
ACTAGAAGAT 
ACJCCATTAAS 
ATATTACTCA 
TATTGTGAAT 
ACCTATTTTA 
TTAAQTCTTG 
ATATTGGTAT 

c ■ 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10BO 
1140 
1200 
1260 
1320 
13 B0 



Seq ID NO « 254 Protein aeguence 
Protein Accession #: NP 001641.1 



80 



1 11 21 31 41 SI 

I I I I I I 

MSDRPTARHK GKCGPLCTSfi HIMVAFKGVW TQAFWKAVTA BFLAMLIFVI. I*SLGSTINWG 60 

GTEKPLPVDM VLISLCFGLS IATMVQCFGH 1SGGH1NPAV TVAMVCTRKX SIAKSVFYIA 120 

AOCLGAIIGA G2LYLVTPPS WGGLGVTMV HGNLTAGHG1* ZjVEL.IITFQL VFTIFASCDS 180 

KRTDVTGSIA IiAXGESVAIG ELFAINYTGA fiHKPARSPGP AVIMGNWaiH HIYWVGPI1G 240 

AVLAGGDTBY VFCPDVKPKR RPKEAP9KAA QQTKGSYMEV KDNHSQVETD DI>II»KPGWH 300 

1049 



WO 03/042661 



PCT/US02/36810 



VIDVDRGEEK KGKDQSGEVL SSV 



Seq ID NOt 2S5 DNA sequence 
Nucleic Acid Accession ft-: U26742.1 
Coding sequence : 325 . - 144 9 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



i 
I 

CAGGAAACCC 
GQCAGCGGAC 
CTCAAACCTC 

TCTTTAAGTG 
GCTGTCCCTT 
GCAGAAAGAA 
TCCACCTACA 
GTGGACATAT 
AACACTGAAC 
AACAAACGGA 
AACTTCCTGC 
AAAATGGCTT 
TCAATGATTT 
GAAGTTCTCA 
TCAGCCAGAT 
CTTATGTCAG 
AATGTGGAAA 
GGATTTCGCT 



AAATCACCTG 
□GTGAAOCTT 
ATCGTTGATA 
TCTGTTCCCT 
TGCGTCTAGA 



TTTCCCATTT 
TCACATTTCT 
CAATAAAGCT 



11 
I 

TGGTACTQGC 
CCGGCACTTC 
CCTGCAGACC 
ATTOCAAGAT 
CCATAATCAA 
CATTGAATTT 
GACAGCTBTT 
GAACAGCATG 
GGAATGTCAT 
TCAACGTGTC 
TCCCAACCAC 
TTGCAGCGTT 
TAGCCACATT 
CTGACTCCAG 
AACTACOCAC 

ATCC1CCCCC 
ATGTCTTCCA 
ACCGATGCCA 
OOSGTGGTTC 
CTAAGAAGCT 
TGCACCCCAT 
CTTGGCCTCC 
CCTCAGGAAG 
TGGATAACAT 
CCTCCAGGGT 
TCAGTCATTT 
TTCTGTCACC 
GTGTACACTA 



21 
I 

AGCAGCCAGC 
CAACATTATT 
AATGGACACC 
AAATAGGATT 
CTGCCATTTC 
TAGAATGATT 
TGCAGAGATG 
CAAGCTTAGG 
AGAAGCATTG 
DCGCTTAGAG 
TCACCAAATC 
TGATCCGGAA 
GTGTGGAGGG 
TGGG GTGA TG 
GGCAGTTTTT 
CCAACAGAAA 
GCAGTGTCTG 
TCCOGTTGAG 
ACAGTGTCAC 
TCATAGCAAC 
GACTAATGCA 
GTTCCCAGAT 
CAGACCTGTA 
tCCTTllATT 
GACTTCTTCT 
GCATGGTACC 
TGGAATGTTC 
ACAGAGATTG 
AATGTCC 



31 
I 

AAATAATAAG 
TTCTAAGAGT 
TAOCAATCCT 
AAAGAATATA 
GAAGATAGTG 
AGGGCTCAAG 
TTTGTTCAQA 
CGGGAAAATG 
GCTGTGCTCT 
CATGTGGAGC 
GGCCATGGTA 
AAGATCATGG 
GTTTATGGAC 
GAAGGTCCTT 
AAAGTGAGGT 
GTCTGGTTGC 
T3TTCCTACT 
AATTACCAGC 
CAGCACCAAA 
TTAAGCAAGT 
CAGCCTGAGA 
ACCAGCATGA 
ACCAGGAGCT 
ACCCTAAAAT 
CATTAACCCA 



GCCTACGGTT 



41 
1 

CCCACATGAC 
AAAGCGGCTC 
TTGGCGAGTC 
TGGATGAAGT 
GATGGTTTTG 
GGAAAAGAGG 
ATCTGGATCG 
AGAAATGCAA 
CTCTGAACAA 
CCACTATTTT 
AGTCCATCAG 
AAATTTCAGT 
ACAAATTAAG 
GATATGACCA 
CAT TTGGTTA 
TAAATGGTTT 
CTCTTCTGCA 
GCCACAGTGA 
TCTGTCAGGA 
TGAAAGAGTA 
CCCTGAGCTG 
AGCCACTCAA 
ACGACAOCCT 
CGGACGOTGC 
AT T CCTATAA 
AAATATGATT 
ACAGTTGGGT 
TCTGTTTTGA 



SI 

1 

ccacaactCt 

CTACTCCAGG 
AGTGACTGAA 
GCTTGGGAAG 
AAAAGTTCAT 
AAATACCATG 
CATCCGACTC 
TTTGCACCTG 
CCTGGACCCA 
TTACCAGCTC 

ATTTGCTGTC 
ATATATTTTC 
ATTCCTTCGG 
CACAGAACAG 
CTTGGACACG 
TOGACTAGCA 
GAGTATGATG 

CACGTCATGG 
TGCTTCCAGC 
CTTGGCTCAC 
GTTCTCCCAC 
TTTTGGTGGA 
TACTTTGAGC 
ATTTCCCTTT 
TGTTTAAAGC 
GGGTGCTGTT 



Seq ID NO: 256 Protein sequence 
Protein Accession fti AAC5 0424.1 



11 
1 



I 

KIEDSGKHGH TMABRRQIiFA 
ALItBSEVLNMIi DPKTBtNVSR 
PEOHGKISVF AVKMAIiATliC 
VFEGPSFGYT EQSARSCF6Q 
VBC3YCH9SS MMGFRYRCQQ 
NAbSKSLSCA SSRSPLHPHF 
FITRSSDGAF GGCV 



Seq H> HO? 257 DNA sequence 

Nucleic Acid Accession #; NM_004172.x 

Coding sequence; 179.. 1807 



SO 
120 

iao 

240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 
1680 



21 31 41 51 

1111 
EMRAQDLDRI RLSTYRXACK LRFVQKKCNIj KbVDXWNVIK 
LKAVli STI FY QLNKRMPTTH QIHVBQSISL LLMFIaLAAFD 
GGK1MDKLRY IFSMISDSSG VHVYGRYDQF JjREVLKLPTA 
QKKVTLNGFIi DTLM8DPFPQ CLVWLPLLHR LANVENVFHP 
CHHYQLCQDC FHRfiHAGGSH SNQHQMKEYT SWKSPAKKIiT 
PCQPEKPUTC* AHTVDTWPPR PVTSHHDTIiF SHSVP&SGSP 



1 
I 

GCGGATTGTT 
CACCCdTAC 
TGTGGGTGAT 
GACTAAAAGC 
AGTCCGTAAA 
TAAAAGTTAC 
TA CAATC CIT 

ctcctttcct 

CTCCAGTCTT 
GCGAGCTGTA 
TGTCATCATC 
TGXACGAGTG 
TCTGGTAGAA 
□CCCATCCAG 
GGAGACTCTT 
CAATGCOCTG 



GGTAGCAGTA 
TGTGGAGATG 
CATTGTTOGC 
ACGGAAAAAC 
GACCTCTTCA 
CGTGGACAAG 
GACTGCOCTC 
GAACTTOGGA 



11 
I 

GCTCCGTTGT 
AAAATCAGAA 
TCCCAGACAC 
AATGGAGAAO 
CGCACACTTT 
CTGTTTCGOA 
GGATTTACCC 
GGGGAACTTC 
GTCACAGGAA 
GTCTATTATA 
ATCCATCCTG 
ACAGCTGCAG 
GCXTTCCTTTA 
GCCAACGAAA 
ACCCGAATCA 
GGTCTAGTTQ 
CAGGCCCTGA 
ATAATGTGGT 
GAAGACATGG 
TTACTCATTC 
CCTTGGGTTr 
AGTTCTGCCA 
CGCGTCACCA 
TATGAGGCTT 
CAAATTATTA 



CACTGACGAC ATCACGCTCA 



21 
1 

ACCTGCTGGG 
AAGTTGTGTT 
TGAAGTGCAA 
AGCCXZAAQAT 
TGGCCAAGAA 
ATGCTTTTGT 
TCCGACCATA 
TGATGAGGAT 
TGGCGGCQCT 
TGACTACCAC 
GGAAOGGCAC 
ATGCCITCCT 
AACAGTTTAA 
CSCTIGTGGG 
CAGAGGAGCT 
TCTTCTCCAT 
GAGAGTTCTT 
ATQCCCCCGT 
GTGTGATTGG 
ACGCAGTCAT 
TTATTGGAGG 
CCCTACCCAT 
GATTCGTGCT 
TGGCTGCCAT 
CAATCAGCAT 
TGGTCACTAT 
TCATCGCOGT 



31 
i 

GAATTCACCT 
TTCTAATACC 
AGAAGAGACC 
GGGGGGCAGG 
GAAAGTOCAG 
GCTGCTCACA 
CAGAATGAGC 
GTTACAGATG 
AGATAGTAAG 
CATCATTGCT 
AAAGGAAAAC 
GGACTTGATC 
AACCAACTAT 
TGCTGTGATA 
GGTCCCAGTT 
GTGCTTCGGT 
TGATTCTCTT 
GGGTATTCTC 
GGGGCAGCTT 
OGTCTTGCCA 
GTTGCTGCAA 
CACCTTCAAG 
CCCCGTAGGA 
TTTCATTGCT 
CACAGCCACA 



41 
I 

CGTTACTGCT 
AAAGAGGAGG 
CTCCTAGAAA 
ATGGAGAGAT 
AACATTACAA 



TACCGGGAAG 
CTGQTCTTAC 
GCATCAGGGA 
GTGGTOATTG 



GGACTGGTTC 



GAGAAGAGAA 
AACAATGTGT 
CCAQGATCTG 
TTTGTGATTG 
AACOAAGCCA 
TTCCTGATTG 
GCCATGTACA 
CTCCTCTACT 
GCACTCATCA 
TGCCTGGAAG 
GOCACCATTA 
CAAGTTAACA 
GCTGCCAGTA 
CTGACATCTG 
CTGGATCGCC 



60 
120 
180 
240 
300 
360 



51 
1 

TGATATCTTC 
TTTGGCTTTC 
ACTAAAATAT 
TCCAGCAGGG 
AGGAGGATGT 



TCAAGTACTT 
CACTTATCAT 
AGATGGGAAT 
GCATAATCAT 
AAGGCAAAAT 
TCCCtCCAAA 
GCTTTAAAGT 
CTGAGGCCAT 
TGAATGGAGT 
GAAACATGAA 
TCATGAGACT 
CTGGGAAGAT 
CCGTGACTGT 
TCTTGGTAAC 
CCGCrCTGGG 
AGAACAATGG 
ACATGGATGG 
ACTTTGAACT 
TTGGGGCAGC 
TCGGCCTGCC 
TCCGGACCAC 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 00 
1440 
1500 
1560 
1620 



1050 



WO 03/042661 



PCT7US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CACCAACGTA CTGGGAGACT CCCTGGGAGC TGGGATTGTG GAGCACTTGT CACGACATGA 1680 

ACTGAAGAAC AGAGATGTTG AAATGGGTAA CTCAGTGATT GAAGAGAATG AAATGAAGAA 1740 

ACCATATCAA CTGATTGCAC AGGACAATGA AACTGAGAAA CCCATCGACA GTGAAACCAA 1800 

GATGTAGACT AACATAAAGA AACACTTTCT TGAGCACCW3 GTGTTAAAAA OCATTATAAA 1BG0 

ATCTTTCCAT CTCATTACAG CTCATTCGCT CCAGCAAGCC CGTCATCTTC CCTTTTOCICC 1920 

CTTCTGATAA GACTGGAAAA TAGTCCTCCA AAACACAAGG GAGGATTTTG GGTGGCCAAA 1980 

GTGTACAATT TTCATCCCAC AATTGAAATT TTXAAATCAT TTCATGTTAG TCTTACDGAA 2040 

TAAGGTACCA AGATCACAAA TAGTGTTGAT CAGATCTTAC AAGTTTATGT GGCACACAAT 2100 

TCCTATAAAT GTGATTTTTT TATATAAGTT AAAGAGACAA ATAGTAQGCT AAAAACATTT 2160 

TAAAATCAAC TTTTGAAATT TAAAAATCTT TCAGAATACA AITCAGTTTT AGTTTCAAAA 2220 

TGTTAACAAC TTGAATTACA AOCGGTTATC AOTTGGACAG TAAGATTTTA TCCCTTTCTC 2280 

TTCTGACTGG TATACCTATT TCATTAGTAG CTAGGTGCAC ATATACATCT AGCACAGCTG 2340 

T3AGGACAGA CAGAAGGCAA AGTTTCCATG TGQCCTTGAG CAAGTCCCAT CTCACCTCXA 2400 

GGCCTCAGTG TCCTCATCTA TAAAATGAGG GACTTCCCTA GAAGTCTTCA TGGTCTCTTC 2460 

CAGCCC3K3AC ATCCTGTGAT GTCATGAAAG CACCTGCCCT CTGTTTCCCC TCAGAACACC 2520 

CTGTAOCATC CATGGAQmC GAGGCCTTCA GAAAAGACAC TTOkATGGGA GTGAACATTT 2560 

CTAACTAAGG ACAGGATGGC TGTGTGTGGT GGTCACCAGG TCCTGTTGAQC AAAGTGCAGG 2640 

TTATGCAAGT CGCCAOGCAG GAGGCCATTC CAGGAGTGGG ATTATTCATC AAACTCTTTG 2700 

CC CAGT T CAT CCCAATGGGG GAAaTATTCC CTTCTTTCCT ACrCXGaQ&A GAATGTCTCC 2760 

TGCCACTCCT CAACTGATGA TABACTTCGA AAACAGATGA GAAGACTAGC AGCTAGCAAG 2B2 0 

GGTGCTTGTA GTCACACTGTT GGAACACTAA AGAGCTAGGA AAGAGTTGAG CACAGGCAAC 2880 

ATTACAAACA AAGGATTTGA AAACACCAAG AGTACAGGTC TTCITTAAGG AAGAATAAAA 2940 

AAGAAGAGGT TCATTTTTCT GGCTTTTTTT TTCACCTGAA ACACTTTTTC TCGAGTCCAA 3000 

AATCATTCCC CCCGTGAAGT CTGCTTACCA AAACATAA&A CGACTTATAT ATTTGAAAGA 3060 

AGTCRAATGA ATGAGCTCTC TAATAQAAGT CCAXGAGTTG AGTGGGTATT TCTTATTTGA 3120 

AAGTGTTTTT CTTTAATCAA AAGTCCTTAG AATGAGGGAA ACAAAATATT TATTTGTTTT 3180 

GGAATCCCAC TTATCAAATC ATTCAAAACT TTCAGCTGGA GTGGGGTTTG CTTTTGTTTT 3240 

GTTTGTGTCC ATAAGAGAAA TOGTAGAA0A TQAATCAGTA TGAAGACACT GTCAATGAGG 3300 

TTATGAGAAA AAAACAGCAG OGGCATTAGT TTCAGGCAAG GCAGCTCCCA GGTTTAGAGA 3360 

TTAATTTTTA CCCCCTAAGG AATATCCAGT CAAAGACGCT GAGTGGGAGC TGTCAGGCAG 3420 

TAGCAGCTGT GTTTGAGTTT CTGGCTGAAA ATGGTGAAGA ATGGACTTAA TTATGCTAAC 3480 

AAACTGAAAA ATCTAGACAT AGATCCTCTG ATATACAATT AGAGATATTT TTATATAGAC 3540 

CCCAAGCATT CTGTGCATAA AAGTTAACAT TAGGCTGTGG TGCAGTAACC ATTTAATGTC 3600 

GAGGCTCTAT TTCGGAAATA CACTACAAAT GTTAAAGTAC GTGGCTGTCX? TCTTAAGACA 3660 

CTAGTAGAGC AAAGACTTAA TCATATCAAC TTAATTCTGT TACACAATAT GTGTTTTTTA 3720 

ATATACTAAC CATTTCTTAT GGAAAGGTCC TGTGGGGAGC CCATCCTCTC GCCAAGCCAT 3780 

CACAGGCTCT GCATACACAT GCACXCAGTG TGGACTGGGA AGCATTACTT TGTAGATGTA 3840 
TTTTCAATAA AGAAAAAAAT AGTTTTACAT T 



Seq ID NO; 2S8 Protein sequence 
Protein Accession JJ* HP_004163.1 



1 
1 

MTKSNGEEPK 
GTJLGPTLRP 
MRAWYYMTT 
NIjVEACFKQP 
VNAliGLWPS 
rVEMKDWGVl 
GTSS8SATIJP 
IiNPGQlITIS 
TTNVLGDSIiG 
KM 



11 
I 

MGGRMERFQQ 
YRMSYRSVKY 
TIIAWIGII 

MCFGFVIGNM 
GG0LAKYTVT 
ITFKCLEEKm 
ITATAASIGA 



21 
\ 

GVRERTLIAK 
FSFPGKItLMR 
1VIIIHPGKG 
VPIQANETLV 
KEQGGALHBF 
VIVGLLIHAV 
GVDKHVTRFV 
AGrPQAGLVT 
ELXNRDVEMG 



31 
1 

KKVCNITKED 
KLQKLVLPLI 



FDSIjHEAIWR 
EVIiPItLYFIiV 
LFVGATXNHD 
MVIVLTSVGL 



41 

1 

VKSYLFRNAF 
ISSLVTGMAA 
XVRVTAADAF 
METLTRITEE 
LVAV1MWYAP 
TFKNPWVFIG 
GTALYEALAA 
PTDDITIiIIA 
KPYQLLAQISjJ 



51 
1 

VI»I/rVTAVIV 
UJSKASGKMG 
LDKLRNMFPP 
LVFVPGSVNG 
VGILFIilAGK 
GUiOAliITAL 
IFIAQVNNFE 
VDVTPLDRLRT 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID MOs 259 DKA sequence 

Hucleic Acid Acce b a ion HM_021948,X 

Coding sequence: 48.. 2783 



1 
\ 

TGTGGCACTG 
TCCTGCCCCT 
TGGAAGGAGA 
TGCAGGGG3T 
OGCGGAGCCG 
GCCGGGAGGC 
GGTTCCGOQT 
QCGAGGTGCG 
ACAGCRGCGA 
CTGCCCGCEA 
ACATOGOCAC 

ACGGAGACAT 
TCTAIGATBT 
CAGAGAAGCT 
CCACCACGGG 
GGCTAGCTGA 
GCTTGCCTGS 
ACAGCX33CTT 
CCTCCAACCC 
AGACCCTGGA 

tctactccat 



11 21 

] I 
CCTGOGTACC CAACGCCAGC 
GCTGGCAGCC CTGGTCCTGG 
CAQCrCAGAG GACOGCGCTT 
GCTCGGCGGC GOCCTCACCA 
CCX3GGCTGTG CTGGGCTCTC 
AGAGGTGCTG GTGGCGCGGG 
GGCACTGCCT GCGXAOCCAG 
CCCCAACGAC TCAGGTATCT 
OGCTGTGGAG GTCAAGGTCA 
TGCTTTCTCC TTTTCTGGGG 
C3CCGGAGCAG CTCXATGCOG 
GTCGGATCAG ACGGTGAGGT 
GGATGGCTTC CCCGGGGTOC 
GTACTGTTAT GCTGAAGACC 
GACATTGGAG GAAGCACGGG 
CCAACTGTAT GCAGCCTGGG 
TGGCAGTGTG CGCTACCCCA 
TGTCAAGACT CTCTTCCTCT 
CAAOGTCTAC TGCTTCCGAG 
AGCCTOCAAC CCAGCCTCTG 
GGAACTGCAG CTGOCTCAGG 
CCCCATCATG GASGACGGAQ 
TAGGACGCTC CTA0AATTTG 



31 
1 

CCTGGGTAGC 
OCCAGGCTCC 
TT0GC3GTG0Q 
U'CCCTTGCCA 
OGOGGGTCAA 
GAGTGCGCGT 
OQTOQCTC3«2 
ATCGCTGTGA 
AAGGGGTCGT 
CCCAGdAGGC 
CCTAOCTTGG 
ATCCCATCCA 
GGAACTATGG 
TAAATGGAGA 
CGTACTGCCR 
ATGQTGGCCT 
TCGTCACACC 
TCOCCAACCA 
ACTGGGCCCA 
ATGGACTAGA 
AAGCCACAGA 
GAGGTGGAAG 
AAACACAATC 



41 

! 

CTGCAGCATG 



CATCGGGGGC 
CGTCC ACTAC 
GTGGACTTTC 
CAAGGTGAAC 
CGACGTCTOC 
GGTCCAGCAC 
CTTTCTCTAC 
CTGTGOO0GC 
GGGCTATGAG 



TGTGGTGGAC 
ATTGTTOCTG 
GGAGCGGGGT 
GGACCACTGC 
CAGCCAGCX3C 
GACTGGCTTC 

GOCTTCTGOC 
GGCTATCGTC 
GAGTGAATCC 
CTOCACTOCA 
CATGGT7UXX3 



51 
I 

GCCCAGCTGT 
GCAGATGTTC 
GACGCGCCAC 
CTGOGGCCAC 
CTGTCCCGGG 
GAGGOCTACC 
CIGGOGCTGA 
GGCATGGATG 
GGAGAGGGCT 
ATTGOAGCCC 
CAATGTGATG 
GAGQCCTGTT 
CCGGATGACC 
GGTGACCCTC 
GCAGAGATTG 
AGCCGAGGGT 



CCCAATAAGC 
ATCCCTGAGG 
ACAGTGACAG 
CGTGGGGCCA 
GAAGACCCAG 
CCGAOQGGGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



1051 



WO 03/042661 



PCT/US02/36810 



TCTCAGAAGA GGAAGGTAAG GCATTGGAGG AAGAAGAGAA ATATGAAGAT BAAGAAGAGA 1440 

AAGAGGAGSA AGAAGAAGAG GAGGAGGTGG AGGATGAGGC TCTGTGGGCA TGGCCCAGCG 1500 

AGCTCAGCAG CCCGGGCCCT GAGGCCTCTC TCCCCACTGA GCCAGCAGCC CAGGAGGAGT 1560 

_ CACTCTCCCA GGCGCCAGCA AGGGCAGTCC TGCAGCCTGG TGCATCACCA CTTCCTGATG 1620 

D GAGAGTCAGA AGCTTCCAGG CCTCCAAGGG TCCATGGACC ACCTACTGAG ACTCTGCCCA 1680 

CTCCCAGGGA GAGGAACCTA GCATCCCCAT CACCTTCCAC TCTGGTTGAG GCAAGAGAGG 1740 

TGGGGGAGGC AACTGGTGGT CCTGAGCTAT CTGGGGTCCC TCGAGGAGAG AGCGAGGAGA 1800 

CAGGAAGCTC CGAGGGTGCC CCTTCCCTGC TTCCAGCCAC ACGGGCCCCT GAGGGTACCA 1860 

GGGAGCTGGA GGCCCCCTCT GAAGATAATT CTGGAAGAAC TGCCCCAGCA GGGACCTCAG 1920 

1U TGCAGGCCCA GCCAQTGCTG CCCACTGACA GCGCCAGCCG AGGTGGAGTG GCCGTGGTCC 1980 

CCGCATCAGG TGACTGTGTC CCCAGCCCCT GCCACAATGG TGGGACATGC TTGGAGGAGG 2040 

AGGAAGGGGT CCGCTGCCTA TGTCTGCCTG GCTATGGGGG GGACCTGTGC GATGrrGGCC 2100 

TCCGCTTCXG CAACCCCGGC TGGGACGCCT TCCAGGGCQC CTGCTACAAG OU2TTTTCCA 2160 

CACGAAGC3AG CTGGGAGGAG GCAGAOACCC AGTGCCGGAT GTACGGG3CG CATCTGGCCA 2220 

1 J QCATCAGCAC ACCCGAGUAA CAGGACTTCA TCAACAACCG GTACCGGGAG TACCAGTGGA 22 BO 

TCGGACTCAA CGACAGGACC ATCGAAGGCG ACTTCTTGTG GTCGGATGGC GTCCCCCTQC 2340 

TCTATGAGAA CTGGAACCCT GGGCAGCCTG ACAGCTACTT CCTGTCTGGA GAGAACTGCG 2400 

TGGTCATGGT GTGGCATGAT CAQGQACAAT GGAGTQACGT GCCCTGCAAC TACCACCTGT 2460 

~ n CCTACACCTG CAAGATGGGG CTGGTGTCGT GTGGGCCGCC ACCGGAGCTG CCCCTGGCTC 2520 

ZU AAGTGTTCGG CCGCCCACGG CTGCGCTATG AGGTGGACAC TGTGCTTCGC TACCGGTGCC 2580 

GGGAAGGACT GGCCCAGCGC AATCTGCCGC TGATCCGATG CCAAGAGAAC GGTCGTTGGG 2640 

AGGOCCCCCA GATCTCCTGT GTGCCCAGAA GACCTGCCCG AGCTCTQCAC CCAGAGGAGG 2700 

ACCCAGAAGG ACGTCAGGGG AGGCTACT&3 GACGCTGGAA GGCGCTGTTG ATOCOCCCTT 2760 

0 _ CCAGCCCCAT GCCAGQTCCC TAGGGGGCAA GGCCTTOAAC ACTGGCGGCC ACAGCACTGG 2820 
ZZ> CCTGTCACCC AAATTTTCCG TCACACCCTG CGCTCACCAG AGGAAGTGAC AACATGAC 

fieq id NO: 260 Protein sequence 
Protein Accession #s NP_06B767.l 

30 1 11 21 32 41 SI 

! I I I I I 

KAQIaFLPLLA ALVIAQAPAA LADVLEGD9S EDRAFRVRIA GTOAPLQGVLG GAI/TIPCHVH 60 

YLRPPPSRRA VLGSPRVKHT FT»SRGREAEV LVARGVRVKV NEAYRFRVAL PAYPASLTDV 120 

^ SLAIaSELSPK DSGIYRCEVQ HGIBDSSDAV EVKVKGWFI. YREGSARYAF 8FSGAQEACA 180 

35 RIGAHIATPE QLYAAYLGGY EQCDAGHJ^SD QTVHYP1QTP REAC YGDMD 3 FPGVRKXOW 240 

DPDDIiYDVYC YAEDLKGELP LGDPPEKLTIj EEARAYCQER GAEXATTGQIi YAAWDGGLDH 300 

CSPGWliADGS VRYPIVTPSQ ROGGQLPGVK TLFIjFPNQTG FPNKESRPNV YCFRDBAQP5 360 

AIPEASNPAS NPASDGLEA1 BTVTETLBEL QI*PQBATESE SRGAIYSIPI MEDGK3GGSST 420 

.p. PEDPAEAPRT LLEFETQSMV PPTGF3EEEG KALEEEEKYE DEEBKEEBEE EEEVEDEALW 480 

4U AWPSELSSPG PEA8LPTEPA AQEESLSQAP ARAVLQPGAS FIiPDGESEAS RPPRVBGPPT 540 

ETLPTPHEEN LASPSPSTLV EAREVGEATG GPELSGVPRG E6EETGSSEG APSLLPATRA 600 

PE gTRE LEAP 9EDNSGRTAP AGT8VQAQFV LPTDSAfiRQG VAWPASGDC VPSPCHNGGT 660 

CLEJSKESVRC LCLPOYGGDL CDVGDRFCNP GHDAFQGACY KHFSTRRSWB EAETQCRMYG 720 

A HLA5ISTPEI BQDFINKRYR BYQWIGLNDR TIEGDFLWSD GVPfcLYENWK PGQPDSYFIjS 780 

GBWCWMVWH DQOQWSDVPC NYHLSYTCKM OLVSCGPPPE LPIiAQVFGRP KUBXEVDTVh 840 

RYRCREGLAQ RNLPliIRCQE NGRHEAFQIS CVPRRPARAIi KPEBDPEGRQ GRIiLGRWKAL 900 
LIPPSSEMPG P 

Seq ID NO= 261 DMA sequence 
JyJ NucXelc Acid Accession #i NM_004386.1 
Coding sequence ■ 2.. 3967 

1 11 21 31 41 51 

<C 1 I > 1 I I 

gatgggggoc ccgtttgtct gggccttggg CCTTTTGATG CTGCAGATGC TGCTCTTTGO? 60 

GGCTGGGGAA CAGGGCACAC AGC3ATATCAC CGATGCCAGC GAAAOGGGGC TCCACATGCA 120 

GAAGCTOGGG TCTGGGTCAG TGGAGQCTGC GCTGGCGGAG CTGGTtSGOCC TGCCCTGTCT 180 

CTTTACCCTG CAGCCACGGC CAAGCGCAGC CCGAGATGCG CCTCGGATAA AGTGGACCAA 240 

Ad GGTGCGGACT GOGTCGOGCC AGCGACAGGA CTTGCCCATC CTGGTGGGCA AGGACAATGT 300 

OU CGTQAGGGTG GCCAAAAGCT GGCAGGGAOG AGTGTCACTG CCTTC5CXACC CCCGGCGCCG 360 

AGCCAACGCC ACGCTACTTC TGGGGCCACT GAGGGOCAGT GACTCTGGGC TGTACCGCTG 420 

CCAGGTGGTG AGGGGCATCG AGGATGAGCA GGACCTGGTG GCCTTGOAGG TGACAGGTGT 480 

TGTGTTCCAC TACCGATCAG CCCQGGACCQ CTATGCACXG ACCTTOGCTG AGGCCCAGGA 540 

GGCCTG CCGT CTCAGCTCAG CCATCATTGC AGCCCCTOQG CATCTACAQG CTGCCTTTGA 600 

OZ> GGATGGCTTT GACAACTGTG ATGCTGGCTG GCTCTCTGAC CGCACTGTTC GGTATCCTAT 660 

CACCCAGTCC CGTCCTGOTT GCTATGGCGA CCGTAGCAGC CTTCCAGGGG TTCGGAGCTA 720 

TGGOAGGCGC AACCCACAGG AACTCTACGA TGTGTATTGC TTTGOCCGGG AGCTflGOGGG 780 

CGAGGTCTTC TACGTGGGCC OGGCCCGCCG CCTGACACTG GCCGGC23CGC GTGCACAGTG 840 

„ CCGCCGOCAG GGTGCCG GGC TQC3CCTCGCT OOGACAGCTG C3VCCTGGCCT GGCATGAGGG 900 

/Vl CCTGGACCAG TGCGACCCGG GCTGGCTGGC CGACGGCAGC GTGCGCTACC CGATCCAGAC 960 

GCCCCGCCGG CGCTGC3GGGG GCCCZUsOCCC GOGCGTGCGC ACCGTCTACC GCTTCGCTAA 1020 

CCGGACCGGC TTCOCCTCAC CCGCCGAGCG CTTCGACGCC TACTGCTTCC GAGCTCATCA 1080 

CCCCACGTCA CAACATGGAG ACCTAGAGAC CCCATOCTCT GGGGATGAGG GGGAGATTCT 1140 

GTCAGCAGAQ GGGCCGCCAG TTAGAGAACT GGAGCOCACC CTGGAGGAGG AAGAGGTGGT 1200 

JZ> CACCCCTGAC TTCCAOGAGC CTCTGGTGTC CAGTGGGGAA GAAGAAACCC TGATTTTGGA 1260 

GGAGAAGCAG GAGTCTCAAC AGACCCTCAG CCCTACCCCT GGGGACCCCA TGCTGGCCTC 1320 

ATGGCCCACT GOGGAAGTGT GGCTAAGCAC GGTGGCCOCC AGCCCTAGCG ACATGGGGGC 1380 

AGGCACTGCA GCAAGTTCAC ACACGGAGGT GGCCCCAACT GACCCTATGC CTAGGAGAAG 1440 

fi GGGGCGCTTC AAAGGGTTGA ATGQGCGCTA CTTCCAGCAG CASGAACOGG AGC3CGGGGCT 1500 

GCAAGQGGGG ATGGAGGCCA GCGCCCAGCC CCCCAOCTCA GAGGCCGCAG TGAACCAAAT 1560 

GGAGCCTCCG TTGGCCATGG CAGTCACAGA GATGTTGGGC AGTGGCCAGA GCCGGAGCCC 1620 

CTGGGCTGAT CTGACCAATG AOGlGGATAT GGCTGGAGCT GGTTCTGCTG GTGGCAAGAG 1680 

CTCCCCAGAG CCCTGGCTGT GGCCCCCTAC CATGGTCCCA CCCAGCATCT CAGGCCACAG 1740 

CAGOGCCJCCT GTCCTGGAGC TAGAGAAAGC CGAGGGCCCC AGTGCCAGGC CAGCCACCCC 1800 

1052 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



AGACCTGTTT TGGTCCCCCT TGGAGGOCAC TGTCTCRGCT CCCAGCCCTG CCCCCTGGGA I860 

GGCATTCCCT GTGGCCACCT CCCCAGATCT CCCTATGATG GCCATGCTGC GTGGTCCCAA 1920 

AOAGTGGATG CTACCACACC CCACCCCCAT CTCCACCGAG GCCAATAGAG TTGAGGCACA 1980 

TGGTOAGGCC ACCGCCACGG CTCCACCCTC CCCTGCTGCA GAGACCAAGG TGTATTCCCT 2040 

GCCTCTCTCT TTGACCCCAA CAGGACAGGG TGGAGAGGCC ATGCCCACAA CACCTGftGTC 2100 

CCCCAGGGCA GACTTCAGAG AAACTGGQGA GACCAGCCCT GCTCAGGTCA ACAAAGCTGA 2160 

GCACTCCAGC TCCAGCCCAT GGCCTTCTGT AAACAGGAAT GTGGCTGTAG GTTTTGTCCC 2220 

CACTGAGACT GCCACTGAGC CAACGGGCCT CRGGGGTATC CCGGGGICTG AGTCTGGGGT 2280 

CTTCGACACA GCAGAAAGCC CCACTTCTGG CTTGCAGGCC ACTGTAGATG AGGTGCAGGA 2340 

CCCCTGGCCC TCAGTGTACA GCAAAGGGCT GGATGCAAGT TCCCCATCTG CCCCCCTGGG 2400 

GAGCCCTGGA GTCTTCTTGG TACCCAAAGT CACOCCAAAT TTGGAGCCTT GGGTTGCTAC 2460 

AGATGAAGGA CCCACTGTGA ATCCCATGGA TTCCACAGTC ACGCCGGCCC CCAGTGATGC 2520 

TAGTGGAATT TGGGAACCTG GATCCCAGGT GTTTGAAGAA GCCGAAAGCA CCACCTTGAG 2580 

CCCTCAGGTG GCCCTGGATA CAAGCATTGT QACGOCCCTC ACQACCCTGG AGCAGGGGGA 2640 

CAABGTTGGA GTTCCAGOCA TGTCTACACT GGGCTCCTCA AGCTCCCAAC CCCACOCAGA 2700 

GCCAGAGGAT CAGGTGGAGA CCCAGGGAAC ATCAGGAGCT TCAGTGCCTC OGCATCAGAG 2760 

CAGTCCCCTA GGGAAACCGG CTGTTCCTCC TGGGACACCG ACTGCAGCCA GTGTGGGCGA 2820 

GTCTGCCTCA GTTTCCTGAG GGGAGCCTAC GGTACCGTGG GACCCCTCCA GCACCCTGCT 2880 

GCCTGTCACC CTGGGCATAG AGGACTTCGA ACTGGAGGTC CTGGCAGGGA GCCCGGGTGT 2540 

AGAGAGCTTC TGGGAGGAGG TGGCAAGTQG AGAGQAGCCA GCCCTGCCAG GGACCCCTAT 3000 

GAATGCAGGT GGGGAGGAGG TGCACTCAGA TCCCTGTGAG AACAACCCTT GTCTTCATGG 3060 

AGGGACAXGX AATGCCAATG GCftJCCATGTA TGGCTBTAGC TGTGATCAGG GCTTCGCCGG 3120 

GGAGAACTGT GAGATTGACA TTGATGACTG CCTCTGCAGC CCCTGTGAGA ATGGAGGCAC 3180 

CTGTATTGAT GAGGTCAATG GCTlTGTCTG CCTTTGCCTC CCCAGClATG GGGGCAGCTT 3240 

TTOTGAGAAA GACACCGAGG GCTGTGACCG CGGCTGGCAT AAGTTOCAGG GCCACTGTTA 3300 

CCGCTATTTT GOCCACOGGA GGGCA1GGGA AGATGCCGAG AAGGACTGOC GCCGCCGCTC 3360 

CGGCCACCTG ACCAGCGTCC ACTCACOGGA GGAACACAGC TTCATTAATA OCTTTGGGCA 3420 

TGAAAACAOS TGGATCGGOC TGAACGACAG GATCGTGGAG AGAGATTTCC AGTGGACGGA 3480 

CAACACCGGG C1GCAATTTG AGAACTGGCB AGAGAACCAG CCGGACAATT TCTTCGCGGG 3540 

TBGCGAGGAC TGTGTGGTGA TGGTGGCGCA TGAAAGCGGG CGCTGGAACG ATGTCCCCTG 3600 

CAACTACAAC CTACCCTATG TCTGCAAGAA GGGCACAGTG CTCTGTGGTC CCCCTCCGGC 3660 

AGTGGAGAAT GCCTCACTCA TCGGTGCCCG CAAGGCCAAG AACAATGTOC ATGCCACTGT 3720 

AAGGTACCAG TGCAATGAAG GATTTGCCCA GCACCATGTG GTCACCATTC GATGCCGGAG 3780 

CAATGGCAAG TGGGACAGGC CCCAAATTGT CTGCACCAAA CCCAGAOGTT CACATCGGAT 3840 

GCGGGGACAC CACCACCACC ACCAACACCA CCACCAGCAT CACCACCACA AATCCCGCAA 3300 

GGAGCGCAGA AAACACAAGA AACACCCAAC GGAGGACTGG GAGAAGGACG AAGGGAATTT 3960 

TTGCTGAAGA ACCfiGAAAAA AGAAAGCACA ACACCTTTCC CATGCCTCCT CTGGAGCCTT 402o' 

CGCCTGGGGA GACAGAAC CC AGAGAGAAAC AAGAGAGTCC AGAAGTCCCT GAACCCCAAA 4080 

CTGTTCTCGC AAAAAAAATA TTCCTTTGAA CAARGGTCTT CTTTXCCTTr TTTTACATAC 4140 

ACAAGATCTT CTTGGCAGGT GGAGCCAGGT GTCTGAAAAG TTCATTCTCG TCTGGCTGAA 4200 

CTCTGGGAGT GTGTCCCAGC TGAGGGAAQC ACAAGTAGCA AAGCTCATTG GTCTGGTCTC 4260 

TTGTTTGCCA GGCTGATTGA AGCAGGCCTT GATGAGGGTG CATGAGTGTA TGTTTGCATT 4320 

CAGATGAAGG AATTGCTTTT CACACGAGAA ATTCAGACTT AGTCAATGTT GGCTGAATTC 4380 

CTAAATCCAG GAAGAAGCCT GGACGTAGGG TCATTAGCTT TGGGAATAGA AGGCTACACA 4440 

GAAGCACACT GTTTTTGAAC TTGACAACAG CTCTCOCTTT ACCCTGGACT TCAGCCCAAG 4500 

TTCCGTCTTT GGTCTTGGTG GATAAACACA CAGTGTGGAG ATCCCACGTA CTGCATTTTA 4560 

GGGATGTTTT TAGGACAACC TCCCTCCATG CCTTCAGAGT TAGGAGTGAG AATGATCAAA 4620 

GCAATATGTA GGTGATGGAG GGAGAGTGTA TTGCTAACCC TTCCAGGTCT AGTCCAGCGC 4680 

TGAGATTTGG TGGTTCTGCA TGTGTGATGA ATCTCTTTCA CACAAATAGA CGAGAGGATA 4740 

TTTAGGGCTA GATGAGCCCA GATTTCTTCC CCCTCCATCT CTCAGGGAGA CAAAGAACGT 4800 

CCTTCCTGGA CCAAGGAGGT GCTGCCAAGT TTTCTAGCCC AGTGCACA.TA CCCAGTCCTT 4B60 

AAGCAGACAT TGGTAGTGCC CCTGCCCTGG GTCCCACTCC TGCCCCACCC CACCCTTGTC 4920 

CCTGGCCATT GCCTGGTGGT CTAGAAACAC TTAAAACTTG ARGTAGTGAC ACCTACCTGC 4980 

GGTCATATTG TAGAGAGATG CTCAGTGTTA AAACTGAAAC ACACAAACAC ACACACACAC 5040 

ACATTTTTCT CTTGTAGATT T1AATTTTTT AAGTGGGAAA GAACTCACCT TSOCTTCCTC 51 00 

CCCCAAATGT GCAACCTGTA AAAGGTCTCT CCACACCAGG GGCCAGGATC CAGTTCCCTC 5160 

ATCTCTGGCA GGAAAGATCC ACAGCTTTTC CTCCATGTCI GTTACTCACT TTCAGCAGTG 5220 

CGGGTAAAAT CTGTGGATCA GGGTTAAAAA AGCACCGTGG AGAATGGCCC TCTTCAGGAA 5280 

AGAAAAATAA GCAAATGAAT GGTCCACCTA GGGGTTCAGT AAAGAAAGAA ATGTGTTAAC 5340 

TGAGCCTGAA TCXZCTTCTGG GAAGTAATAA TGACCATTGA CAACTAAGAA GTAQACACCA 54 OO 

TGCTAAAGAC TTACATACAA TCTCGTTGAA TCTTCTCAAT AGCCCATTGA CTTAGAAACT 54 60 

GTTACTTTGC CATTTTACAC AC3VGTGAAAC TGAGGCTCAG ATATAAAGGA AAGGTACTGG 5520 

CTTGAAGTCA CAACCACGAC AGGAGTAAGG ATTTGGAATA AGGATTTGGT CCTGTTTTCT 5580 

GGACCAAATC CTTACTCTGG CTCTGCTTCAC ACTITCTCTC CATCACCAAA TCCTTACTCC 5640 

AAATCCAGAA GTCAGAGGCA ACTCCCATCT TGGTTCTGAC CCAAATOCTG CTCTGGACTC 5700 

TGGAGAGGAG ATTGAAATAT AATTGCACCC TCATACACAT ITAGGAAATG GTTAAGAAGT 5760 

GTAAACTGAA CCCTTATCCT TGTCTTCAAT CTTCCTCCCT GTAGACATCT ATCTTATTAT 5820 
GGTTATTATT CAGAAAACCC AGGGATACAG GTTTGTCTTC TTAOTTTGAT AACTCTTCTT 5880 
AGTTTAAAAT AATAATAATA ACACATCTTT GGTCATCTAT GTCACACAAA AATTTTCCTT 5940 
TGTTTGCGGG GGGCTGGGGA TGCAGTGTTT TTTGGGGGGT CTTGGTTTAT GCTCCCTGCC 6000 
CTTGAGCCCC TCAGCOGTTT GCCCTGCCCC CACCTCGGCT CCATGGTGGG AGGGGGCTCT 6060 
GGTCTTTTCT AAAGTGGGCG GTTTGTCTTT TGATCXTTCC CTTTTGGATG TGCGTGTGTG 6120 
TCTGCGTGTG CCATGTGCGT GGCACGCATA TGAGTGTGTG TGCGIGTGAA CGGCTTTGGG 6160 
TCCTGCTGGT TTTGCTGTGA GCTGCAGTGT TCTGTGGGTC TGTGGTATCT GACACTGTGQ 6240 
ACATTAATGT ACTTCTTGGA CATTTTAATA AATTTTTTAA CAGTTCAAAA AAAAAAAAAA 6300 
AAAAAAAAAA 



80 



Seg ID NO; 262 Protein sequence 
protein Accession #r NP_004377.i 



1 11 21 31 41 51 

I 1 I I i 1 

MGAPPVWALG liKLQMLDFV AGEQGTQDIT DASEEGLEMQ KLGSGSVCAA LASLVALPCL 

1053 



60 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 
20 



FTLQPRPSAA 
ARATIxbliGPI* 
ACRLSSAIIA 

LDQCDPGHUV 
PTSQHGDkET 
EKQBSQQTLS 
GRFKGLNGRY 
HADI»TNEVDM 
DLFWSPLEAT 
GEATATAPPS 
HSSSSFWPSV 
PWPSVYSKGL 
SG1WEFGSOV 
PEDQVETQGT 
PVTIjGIEDFE 
GTCNANGTMY 

ENTW IGIJTDR 
NYHLFYVCKK 
NGKWDRFQIV 
c 



RDAPRIKWTX 
RASDSGLYRC 
APRHLQAAFE 
VYCFARELGG 
DGSVRYPIQT 
P8SGDEGEIL 
PTPGDPKUlS 
PQQQEPEPGL 



VSAPSPAPWB 
PAAETKVYSL 
NRNVAVGFVP 
DASSPSAPLQ 



SGASVPPHQS 
LEV1AGSPGV 
GCSCDQSFA3 
GWHKFQGHCY 
IVERDFQWTD 
GTVLCGPPPA 
CTKPRRSHRM 



VRTASGQRQD 
QWRGIBDBQ 
DGFDNCDAGW 
EVPYVGPARft 
PRRRCGGPAP 
SAEGPFVREL 
WPTGEVWLST 
QGGMBASAQP 
SPBPWLttPPT 
AFPVATSPXJL 
PI»SI»TPTGCjG 
TBTATEPTGL 
SPGVFTiVPJCV 
PQVALDTSIV 
SPLQKPAVPP 
BSFWEEVASG 
KHCBIDrDtJC 



HTGLQFENWR 
VKNASLIGAR 



LP1LVAKDNV 
DLVPLEVTGV 
LSDRTVRYPI 
I/niAGARAQC 
GVRTVYRFAN 
EPTLEHEEW 
VAPSPSDMGA 
PTSEAAVHQM 
MVPPfilSGHS 
PMMAMLRGPK 
GEAMPTTPBS 
RGIPG8ESGV 
TPNLEPWVAT 
TPLTTLEQGD 
GTPTAAeVGE 
EEPAliPQTPM 
LCSPCBWGGT 
DAEKDCZRRHS 
ENQPDNFPAG 
KAKNNVHATV 



VRVAKSWQGR 
VFHVRSARDR 
TQSRPGCYGD 
RRQGAAIiASV 
RTGFPSPAER 
TPDFQEPLVS 
GTAASSHTEV 
EPPLAMAVTE 
RAPVLELEKA 
EWMLPHPTPI 
PRADFRETGE 
FDTAESPTEG 
DPGPTVNPMD 
KVGVPAMSTL 
SASVSSGEPT 
HAGAEEVHSO 
CIDEVKGFVC 
GHLTSVHSPE 
GEDCWMVAH 
RYQCNEGPAQ 



VSLPSYPRRR 
YALTFAEAQE 
RSSbPGVRSY 
GQlrHLAWHBG 
FDAYCFRAHH 
SGKEETLILE 
APTDPHPRRR 
MLGSGQSRBP 
EGPSARPATP 
STKANRVEAH 
TSPAQVNKAE 
LQATVDEVQD 
STVTPAPSDA 



VPHDPSSTL1> 
PCEHNPOiHG 
LCLPSYGGSF 



HHWTXRCR9 



120 
180 
240 
300 
360 
420 
480 
S40 
600 
€60 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO t 263 DMA sequence 
25 Nucleic Acid Accession ft> Eos sequence 
Coding sequence: 7.. 2 085 

1 11 21 31 41 51 

1H 1 1 1 1 1 1 

■3U GCCGCGATGG CCAGCACCAG QAQTATCGAG CTGGAGCACT TTGAGGAACG GGACAAAAGG 60 

CCGCGGCCGG GGTCGCGGAG AGGGGCCCCC AGCTCCTCCG GGGGCAGCAG CAGCTCOGQC 120 

CCCAAGGGGA ACGGGCTCAT CCCCAGTCCG GCGCACAGTG CCCACTGCAG CTTCTACCGC 180 

ACGCGGACCC TGOAGGCCCT CAGCTCGGAG AAGAAGGCCA AGAAGGCGCG CTTCTACCGG 240 

AACGGGGACC GCTACTTCAA GGGCCTGGTG TTTGCCATCT CCAGCGACCG CTTCCGGTOC 300 

35 TTDGATQCGC TCCTCATAGA GCTCACCCGC TCCCTGTCGG ACAACGTGAA CCTGCCCCAG 360 

GGTGTCCGCA CTATCTACAC CATCGACGGC AGCOGGAAGG TCACCAGCCT GGACGAGCTG 420 

CTQGAAGGTG AGAGTTACGT GTGTGCATCC AATGAACCAT TTCGTAAAGT CGATTACACC 480 

AAAAATATTA ATCCAAACTQ GTCTGTGAAC ATCAAGGGTG GGACATCCOG AGCGCTGGCT 540 

GCTQCCTCCT CTGTGAAAAG TGAAGTAAAA GAAAGTAAAG ATTTCATCAA ACCCAAGTTA 600 

40 GTGACTGTGA TTCGAAGTGG AGTGAAGCCT AGAAAAGCCG TGOOGATCCT TCTGAATAAA 660 

AAOACTGCTC ATTOCTTTGA ACAAGTCTTA ACAOATATCA CCGAAGCCAT TAAACNAGCC 720 

TCAGGAGTCG TCAAGAGGCT CTGCACCCTG GATGGAAAGC AGGTGAGAGr TACGTGTGTG 780 

CATCTGCCAG ACTTTTTTGG TGATGACGAT GTTTTTATTG CATGTGGACC AGAAAAATTT B40 

CGTTATGCCC AAGATGACTT TGTCCTGGAT CATAGTGAAT GTCGTGTCCT GAAGTCATCT 900 

45 TATTCTCGAT CCTCAnCTGT TAAGTATTCT GGATCCAAAA GCCCTGGGCC CTCTOGAOGC SSO 

AGCCAGATTT CTGCTCATGG CAGATCTTCT TCCAATGTAA ACGGTGGACC TGAQCTTGAC 1020 

CGTTGCATAA GTCCTGAAGG tgtgaatgga aacagatgct ctgaatcatc AACTCTTCXT 10 bo 

GAGAAATACA AAATTGGAAA GGTCATTGGT GATGGCAATT TTGCAGTAGT CAAAGAGTGT 1140 

ATAGACAGGT CCACTGGAAA GGAGTTTGOC CTAAAGATTA TAGACAAAGC CAAATGTTGT 1200 

50 GGAAAGGAAC ACCTGATTGA GAATGAAGTO TCAATACIGC GOCGAGTGAA ACATOCCAAT 1260 

ATCATTATGC TGGTCGAGGA GATGGAAACA GCAACTGAGC TCTTTCTGGT GATGGAATTG 1320 

GTCAAAGGTG GAGATCTCTT TGATGCAATT ACTTCGTCGA CCAAQTACAC TGAGAGAGAT 13 BO 

GGCAGTGCCA TGGTGTACAA CTTAGCCAAT GCCCTC3U3GI ATCTCCATGG OCTCAGCATC 1440 

QTGCACAGAG ACATCAAACC ABAGAATCXC TTGGTGTGTG AATATCCTGA TGGAACGAAG 1500 

55 TCTTTGAAAC TGGGAGACTT TGGGCTTGCG ACTGTGGTAG AAGGOCCTTT ATACACAGTC 1560 

TGTGGCACAC CCACTTATGT GGCTCCARAA ATCATTGCTG AAACTGGGTA TGGCCTGAAG 1620 

GTGGACATTT GGGCAGCTG6 TGTGATCACA TACATACXXC TGTGTGGATT OCCACCATTC 1680 

CGAAGTGAGA ACAATCTCCA GGAAGATCTC TTCGACCAGA TCTTDGCTGG GAAGCTGGAG 1740 

_ _ TTTCCGGCCC CCTACTGGGA TAACATCACG GACICTGGCA AGGAATTAAT CAGTCAAATG 1800 

60 CITC3U3GTAA ATGTTGAAGC TCGGTGTACC GCGGGACAAA TCCTGAGTCA CCCCTGGOTG I860 

TCAGATGATG CCTCCCAGGA GAATAACATG CAAGCTGAGG TGACAGGTAA ACTAAAACAG 192 0 

CAE7TTAA1A ATGCGCTCCC CAAACAGAAC AGCACTACCA CCGGGGTCTC CGTCATCATG 1980 

GTGAGTGGAA GQCGGCAGGT CTGGGCTGAC TGCGGAGCCG GCCTTGAAGT TTTTGAATTA 2040 

. _ GGTAGCCGGG AGCTGCCCTC ACATGGAAGT TGGTGCCTTC CGTAGTCCTA TTTCATATGA 2100 

65 AGATTGGCTT GGCATGTOGA GGGCACTCAT TCGGCAACTC CCAGGCXTTG GGCACTGTGT 2160 

GGAGGGGCTT GTGTAGGGAC CAGCAGGCCT GGTG^GAGGG GTCCAGGOGT CAAGGAGCTC 2220 

CTGGCTGGGC CCTCTGGX3CA GCTGCTTCCA CTCTTGTCTC TGCCTTCXCA TCTAGAGAGA 2280 

CTCCCAAGCC CTGGAGGGGT GTOTTGTGTT AGGAATTAAC TCCCTGCCTA CGCCAAGGCC 2340 

TCAGAAATAG ATT AT T AGAG ATGTGAATTA TTCTTTGAGA CTTGGGATAA GAAACAGCCA 2400 

70 AAGCTAAACA TATTTCAGTT TTAAAAAATC AGTGTTTTAT AAAACACAGT TTGGGGCTTT 2460 

TAAAGGTACA TAATCAAGGA AAAAAATATA TATTCATTTT TCAGGGTTGG TAACATTTTA 2520 

TGAGATGTCA GTGACAACGA TGGCCTTATT TTTTTCAGCXr TTTTCTTCTT CCAAAATGTT 2580 

TCTTAAGGCA ACTCTCCTAA ATACATAAAC ACAACAAATT AAAATOAAAA GTGACATGAG 2640 

AGTAAATGAA TCAAAAGGAA AAAACATTGA AOCAGAGGTG AGGGCAGCAC ACCCaCAGCA 2700 

75 GCTOTCCAGG CCTGAGCCAA TGCAACCCXG GGCGGGAAGG CCAGCTCAJDC GTGAGCAGGT 2760 

AGAAGCCAGC CAGOCACCCA GGCAGGGACC TTGGTTCTCC CCACACACTC CCAGGAGCAG 2820 

GQAACAGGGG TGGAGTGGCC TTTCCCAGAG CTGGAGTTGG CTGCAGCAGC TTTCGAATCA 2880 

GACCTGCCAA GGTGATGGGC GTCTGAGTTT CACATCTGGG COCCCCOTGA COCCACTGAG 2940 

or _ TCCTGACAGC TAAGQATGQG CCACCTCCAC AGCTCCGTCA CTCGTACTTG GGACAGGCCT 3000 

80 CTCATCCTCT GGGAAGGTCC TCCTTGTTTC CTACCCAACT AGAAGGGAAA CAGTGGCATA 3060 

TTCTCATOGr ACATGGTTGT CTGAAAGCCT TACCTAGGAA GACGCAGGGT CTAGATAGAA 3120 

GCTATAAGGA AGCCACACAC ATAACCCACA TCCCCACACC CCCAACATCG COCACACTCC 3180 
CCACACCCCC CACACCCCCC ACATCCCCAC CATAATTACC CCCACCTCCA AATATCTCAT 
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Seq XD NO; 264 Protein sequence 
Protein Accession #: Eos sequence 
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i 
I 

MASTRSIELE 
TLdQAIiSSEKK 
RTIYTIDGSR 



WKRI»CTLiDG 
RSSAVKYSG8 
YKIC5KVIGDG 
MLVEEMETAT 
RDI KPEHTj EjV 
IWAAGVITYI 
VNVKARCTAG 
GRRQVWPDCG 



11 
I 

HFEERDKRPH 
AKKARPTraiJG 
KVTSLDELiE 
KDFIKPKLVT 
KQVRVTCVHLi 
KSPGPSRRSQ 
NFAWKBCID 
ELF LVME LrVK 
CEYPDGTKSL. 
LLCGFPPFRS 
QILSHPWVSD 
AGLEVFBLG3 



21 
I 

PGSRRGAPSG 
DHYFKGLVFA 
GESYVCASNE 
VIRSGVKPRK 
PDF FGDDDVF 
ISAHGRSSSN 
RSTGKEFALK 
GGDliFDAITS 
KLGDFG1ATV 
ENNTjQHDIiFD 
UASQKNNMOA 
REIiPSHGSWC 



31 
1 

SGGSSSSGPK 
IS80RFRSFD 
PFSKVDYTKN 
AVHILLNKKT 
IACGPEKFRY 
VNGGPELDRC 
IIDKAKCCGK 



VEGPLYTVCG 
QXIiAGKZiEFP 
EVTGKIiKQKF 
LP 



41 
I 

GNGLXP9PAH 
AIiLIHLTRSIj 
INFNUSVNIK 
AHSFFQVEiTD 
AQDDFVLDHS 
XSFEGVHGNR 
EHLIELlKVSI 
AMVYNXANAL 
TPTYVAPXII 
APYKDNITDS 
MMALPKQW8T 



51 
1 

SAHCSFYRTR 
gDNVNXrPQGV 
GQTSRAIAAA 
ITEAIKXASG 

CSBSSTLliEK 
LRRVKHPN3I 
RYLHGt*SIVH 
AETGYGUWD 
AKKLISQMLQ 
TTGV8VIMVS 



60 
120 
ISO 
240 
3 DO 
360 
420 
460 
540 
600 
660 



Seq ID NO: 265 DNA sequence 

Nticleic Acid Accession tf: AB0206B4-1 

Coding sequence t 1 . - 1744 

1 11 21 31 41 51 

I I I I I I 

CCOOCTTGTC ATTAATACAT TAAAAAGATT CAATCTTTAC CCTGAGGTAA TTTTGGCCAG 
TTGGTACCGG ATTTATACCA AAATAATGGA CTTOATTGGT ATTCAAACCA AGATATGTTQ 
GAOGGTTACC AQAGGAQAAG GACTOAGTCC TATTGAAAGC TGTGAAGGAT TGGGAGATCC 
TGCTTGCTTT TATGTTGCTG TAATTTTTAT TTTAAATOGA CTAATGATGG CATTATTCTT 
CATATATGGC ACATATTTAA GTGGCAGCCG ATTAGGAGGC CTGGTTACAG TGTTGTGCTT 
CTTTTTCAAT CATGGAGAGT GTACCCGTBT AATGTGGACA CCACCTCTCC OTGAAAGCTT 
CTCATATCCA TTTCTTGTTC TTCAGATGTT GCTAGTGACT CATATTCTCA GGGCTACAAA 
ACTTTATAGA GGAAGCTTGA TTGCACTCTG CATTTCCAAT GTATTTTTCA TGCTTCCTTG 
GCAQTTTGCT CAGTTTGTAC SXCTTACTCA GATTGCATCA TTATTTGCAG TATATGTTGT 
CGGGTACATT GATATATGTA AATTACGGAA GATCATTTAT ATACACATGA TTTCTCTTGC 
ACTTTGTTTT GTTTTGAlGT TTGGGAACTC AATGTTATTA ACT T Cm 1 ATT ATGCTTCTTC 
TTTGGTAATT ATTTGGGGTA TTCTGOCAAT GAAACCACAT TTCCTGAAAA TAAATGTATC 
TGAACTTAGT TTATGGGTTA TTCAAGGATG TTTTTGGTTA TTTGGAACTG TCATACTTAA 
ATACTTGACA TCTAAAATTT TrOGTATTQC AGATGACGCT CATATTGGCA ACTTACTAAC 
ATCAAAATTC TTTAGTTATA AGGATTTTGA TACTTTATTG TATAOCTGTG CAGCGGAGTT 
TGACTTTATG GAAAAAGAGA CTCCRCTGAG ATACACAAAG ACATTATTGC TTCCAGTTGT 
TCTTGTAaTG TTTGTTGCTA TTGTTAGAAA GATTATTAGT GATATGTGGG GTGTCTTAGC 
TAAACAACAG ACACATGTAA GAAAACACCA GTTTGATCAT GGAGAGCTGG TTTACCATGC 
ATTGCAATTG TTAGCATEATA CflGCCCTTGG TATTTTAATT ATGAGACTAA AACTCTTCTT 
GACACCACAC ATGTGTGTTA TGGCATCACT GATCTGCTCA AGACAGCTAT TTGQATGGCT 
CTTTTGCAAA GTACATCCTG GTGCTATTGT GTTTGCTATA TTAGCAGCAA TGTCAATACA 
AGGTTCAGCA AATCTGCAAA CCCAGTGGAA TATTGTAGGG GAGTTCAGCA ATTTGCCCCA 
AGAAGAACET ATAGAATGGA TCAAATATAG TACTAAACCA GATGCAGTGT TTGCGGGTGC 
CATGCCCACG ATGGCAAGTG TTAAGCTCTC TGCACTTGGG CCCATTGTGA ATCATCCACA 
TTATGAAGAC GCAGGCTTGA GAGCCAGAAC AAAAATAGTA TACTCAATGT ATAGTCGGAA 
AGCAGCCGAA GAAGTGAAGC GAGAACTGAT AAAGTTAAAA GTGAACTATT ACATTCTAGA 
AGAGTCATGG TGTGTAAGAA GATCCAAGCC TGGTTGCAGT ATGCCTGAAA TTTGGGATGT 
AGAAGATCCT GCCAATGCTG GGAAAACTCC CTTATGTAAC CTCTTGGTGA AGGATTCCAA 
ACCTCACTTC ACCACTGTAT TCCAGAACAG TGTTTACAAA GTCCTAGAAG TTGTAAAAGA 
ATGACTGCTA CATGACCTGC TGCCTACGGA GAACTACATC TGTAATGGTT TTAATGTTTT 
GCTAAGTCAT GTGTTGTTCA TATCCGAAAA ACTTTTATAG GTAACTGTTT TCAAATAGAA 
AAC3TTTTAT TTGGTCAATT TGAATGTCAT TCTAATTATA AAAATGACTT ACACCTTTAT 
CAATTGGTTA CTATTTCAAT GCACCCTTTA AAATTTGCTA TGCAAATGAG TATATGCTTG 
TACTTGACTT TAATATTTGT GCTAAAGTGA GCAAAGCTAC CTGTATAAAG AAAACACAGT 
GGGTTGTGAC AAGGATGACA TGAAAATACA GGACAATTCT GACAATGTAG GGGCTGATTT 
TATAGTGTAA GAACTATTAA TGCCCCTTGC TTCTTTTCTC TGCCXCTXGC TCTTGTCTTT 
TGGACATTTC AGTGATTGTA AGTTCTTOGG TCATOTCRGC CCCTGTCATC AACTTGAGTT 
ACAGTAGATG GGGCAGACAT GGAGTGTTTG CTATATAAAA CTATCrGTTT GTTTTACTTC 
CTTGTGCGCT TTTTGTTCTC TGTTCTCTTQ TTAATGAAGC TTTTCCTGOC CATTATTAAT 
CCAA&CTCTT GGACCTTGTG GTTAGGAAAT TGCCTTAACT TCCAGGCATA TGGCATTATC 
GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT CTCTTCTCCT CTTCOCC3VTA TTTTCTGTCA 
AATAAGTACT GTTTACTCAT TTAGTTGCTT ATCAAGTACT TATTCXTGGT TTTAAAAAAA 
ATTAATGGTA ACTGTATTTT TCTCATTTTT AGCATTATTC AAATGTTTAT ATTTTAATAC 
CTTTAAACCA CTTTAAAGTT TTTTCATGTT TAATTATAGT TTTAJVGAAAA ACTATTTTGA 
ACAACCCCAA ATATAGTGCA TCTAGAARCT AATGTATATT TGATTAGACA TCATTTATAG 
TGGAACAOTA GAGTGTAGTA CATGGTEAATT TTTCTTTTAC TATTAftQATA CAATAAAACA 
TGACTAATTT TGCTGTCAAA AATGTAAAGA ATARTGATAA ATGGAGTTTT TTATATTTTA 
CTTTTAAGAT TGCCSGTCTT TAATAAGACA AAGCCTTAAG CCTTATGTEA TAATTTTGGT 
TCTAAAAACC ATCATTTCAG TATAAGGAAT AAGTATATTT OGTCCTCCTC TTTAGTTTTT 
TTCTTCX3AT TTATTTTTAT TTTGAAAAAT TTCTACACCT TCTTTGAATT CCTTGTATGA 
ATTITTGTTT CTTAGAAGTT AATTTGTGTG AAATGAGATT CTTCAARAGG ATGAAAOCTC 
ATAGCTCTGA GAAAAGGTTT TAGGGTTTTA AATTCTAAGC AAAGCGTGAC TATGGCTGAC 
AGACTACACA TTTAATTATA CAGCTTCTCT TTCTTAACCA CAGGCAGATT AACCTCATTG 
TGOATTGTCC TTCAGAOCXT AGTOCTCAGG CATGGTTTCT GGTQCCX3LCT CCTGGAAGCC 
GCTGTTCCCT TTCTACCTTC TTACCAGAGC CCAAGGGCAG GCCTGGTCCC GOOaAftGCAG 
CAGCTTGCTG ACATAAGTCA GCTGCAAAGG CIGAGGAGTG TGC CCTCAGA GAAGCACCGC 
CCCCCTjGTCT TGTGCCAGCG CCTAGAGCCG CAGCTCCCAG GGATGCTCCT TCCCTGGftGG 
CAGCCCAGGA GAGGGACTCT GGCAGOGTTC TTC3WSATTTG TGGCCACTGT TTCTCATTTQ 
CTGOTTGACT GTTTXTATTT CTTAGGCTTT TGCTAGTTTT AGAAAATAGG GAAGCAGOCC 
TTGATTTGTG GATTAAAAGC AACATTTGAG CGATGATGCA CAACAGTCCA GGAAAATGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
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19B0 
2040 
2100 
2160 
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2280 
2340 
2400 
2460 
252 0 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
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3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
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CGGTGGACAC 
TGCTTATCXG 
ATTTT1AAAA 
TCCAAAGAAG 
CTATTATAAC 
GAAGGATCCT 
TAGGGGAGCC 
TCCCAGCCCA 
TTCCATAGGC 
AATATTTTTG 
TTGCTACAAC 
AAAAATGCTT 
GAAAGTATGT 
ATGCCAGTTG 



TTGAGGCTGA 
TGATTGTTGC 
ATTATACTTT 
TTCACATGTA 
TGCTTCATTT 
TTTGTAGCAG 
AGTTTGGAGC 
AG CACTGT GG 
GTACAAAACA 
CTTTAGTATG 
ATTTTCGAAA 
GQCAAAAAAA 
ATCAGGAATA 
TTTACATGTA 



GGATGGGAGX 
TCACCTGAGT 
TACATTTATT 
ATAAGTAGAA 
GCTGGOAACC 
TGTTTATGAA 
AGAGGCCTGA 
AGCATCCACA 
GTATTAAAGC 
AGGAAAGTAA 
ACAAAGTTGG 
ATATAGTGTT 
AAGTGATATT 
CTATATATGT 



TGACATGAGC 
GTGGCTGATT 
TTATATTTTT 
ATTCIGTATA 
ATTAAAAGTA 
TGTAACCCCC 
AGGTCCCTGC 
CCTTTGATGG 



GGATGGGCAA 
GGCTGTATTT 
AAAATAGGCC 
GCATAGGAGT 
TAAATTAAAA 



AGGGAGAGGG 
GTGTACATCC 
CTCACCCCCA 
GGAAAAAAGC 
ATATAAATTA 
AGCAAAATAT 
TATGCAGCCG 
CAATGCAGAT 
GCA1ATTGTX 
AGAAGCGATC 
CTTTAAAAAG 
AGTGATATTA 



Seq ID WO: 266 Protein sequence 
Prohein Accession » = BAA74900.1 



1 
I 

PLVTNTIiKRF 
ACFYVAVIFI 
SYPFLVUQML 
GYIDICXLRK 
Et,StiWVIQGC 
DFMEKETPLR 
LQliiAYTALG 
GSANUQTQWN 
YEDAGLRA&T 
SDPANAGKTP 



11 
I 

NLYPEVILAS 
IiNOLMMAIiFP 
tiVTHItiRATK 
IIYIHMXSLA 
FTCiFGTVILK 
YTKlTiX.PW 
ILIHRBKLFL 
IVGEFSNLPQ 
XIVYSMYSRK 
LCNLLVKDSK 



21 
1 

WYRIYTKIMD 
IYGTYBSGSR 
IjYRGSIj I AliC 
LCFVLMFGUS 
YLTSKIFGIA 
LV7FVATVRK 
TPHMCVMASI* 
EELIBWIKYS 



PEFTTVFQNS 



3L 
I 

IjIQIQTKICW 
LGGLVTVLCF 
ISNVFFHLPW 
MKLTSYYASS 
DDAHIGNLLT 
IISDMWGVIiA 
ICSRQLFGWL 
TKPDAVFAQA 
KLKVNYYILE 
VYKVLEWKB 



AAAATCATGA 



41 
I 

TVTHGEGIjSP 
FFNHGECTRV 
QFAQFVIiliTG 
LVIIWGILAM 
£ KFFS YKDFD 
K3QQTHVRKHQ 
FCKVHPGAIV 
MPTMASVKIjS 
ESWCVRRSKP 



AGGTGCGCGC 
AGCAGTTACA 
GTAATTTCCT 
ATTAAAAATA 
GCTTTTTCCA 
GGCTATATAT 
TGGCCACAGC 
TGGTAGCAGG 
AGCATTTACA 
AAAATAGCTA 
ATARGCCTCT 
ATGAGAAAAT 
TATGAATTTT 
GAAATG 



51 
I 

IESCEGLGDP 
MWTPPI/RESF 
IASLFAVyW 
KPHFI1KIHVS 
TLLYTCAAEF 
FUHGELVYHA 
FAIIAAMSIQ 
AliRPIVNHFE 
GCSM&EIWDV 



Seq ID 2tf0: 267 UNA sequence 
Nucleic Acid Accession #: U26744.1 
Coding sequence: 59,. 1600 



1 
I 

CTTCAAAGAA 
GATTGAAGATT 
GATGAGGGCT 
TAGGTTTGTT 
ATTQCGGGAA 
AGAGGCTGTG 
AATCCATGTG 
GGAAGQCCAT 
AGGGAAGATC 
GATGGTTTAT 
TTTGGAGGGT 
GAAAAAAGTC 



TGAGTGTTCC 
TCACAATTAC 
CAAOCAGCAC 
TGCATTAAGC 
AGA7CAGCCT 
TGTAACCAGC 
TATTAOCAGG 
CAGGIATGCG 
TGCTCCTGAC 
GCTAGAAAAC 
ACAAGCTTCT 
CCGGCTCCTC 
COGGAGAGAG 
GAAGCAGGGA 
CGACCtGCGG 
AGAAAAAAGT 
ATTTAGCATT 
TTAGAAAAGG 
TTCAAACCAG 
CACAGTCACT 
TAACCTCTAC 
GTGAAACCTC 

ATAACTGACT 
GGTTOACAGT 



ATTQTQACTT 
CAGGGGGAGC 



11 
I 

TATAGATGGX 
AGTQOGAAAA 
CAAGATCTGG 
CAGAAGAAAT 
AATGCTCTGA 
CTCTCCACTA 
GAGCAGTCCA 
GGTAAAATTT 
ATGGACAAAT 
GGACGATATG 
OCTTCATTTG 
ACGTTAAATG 
TTGOCTCTTC 
TACTQCCACA 
C&GCTCTGTC 
CAAATGAAAG 
AAGTCCCTGA 
GAGAAGCCAC 
ATGAACGACA 
AGCATGCTTG 
GCAAGGCTGG 
ATCTCTTTCA 
AAGAACAGAG 
CAGGCCAGGC 
AGACAGCGCA 
CTAATGGTCC 
GTAAGTTATG 
TTTTCTCATT 
CATACTAATT 
TTTTATAACT 
GAACGAATTG 

TCTTAGC'rrr 

AGAQATACCC 
ACCCACTCAC 
TTTTTATAAA 
OCATGGCTCA 
ATTCACGTCC 
AATCTCTTTT 
GGAGACGAGA 
TATAGCCTAA 
AGCAAACTC 



21 
I 

TTTGAAAAGT 
GAGGAAATAC 
ATCGCATCCG 
GCAATTTGCA 
ACAACCTGGA 
TTTTTTACCA 



31 
I 

TCRTGCTGTC 
CATGGCAGAA 
ACTCTCCACC 
CCTGGTGGAC 
CCCAAACACT 
GCTCAACAAA 



41 

! 

CCTTCATTGA 



TACAGAACAG 
ATATGGAATG 
GAACTCAACG 
CGGATGCCAA 



CAGTATTTGC 
TAAGATATAT 
ACCAATTCCT 
GTTACACAGA 



TGCATCGACT 
GTGAGAGTAT 
AGGAClGCrr 
AGTACAGGTC 
GCIGTGCrrC 
TCAACTTGGC 



AGAGTTCAAA 

CAGCAGAGTC 

CCATOGATGC 

AAATCTTACA 

CAGAGAAGGC 

AAGATGAGCT 

AGTTGGAGGG 

TOCCCTACTG 
GCTTTTGCTC 

TGCTTCTTTT 
ATCACTACTA 
TCATTTATTG 
TCAAGTTGTT 
TGAGGTTCAT 
ACTGTGAGTA 
AATCAGGCAA 
CCAAAATGTG 
CATCTTTTTG 
TAAAAAGTAA 
GGGTCATTAC 
ACTGGAGCTG 



TGTCAAAATG 
TTTCTCAATG 
TCGGGAAGTT 
ACAGTCAGCC 
CACGCTTATG 
AGCAAATGTG 
GATGGGATTT 
CTGGAGGGGA 
ATGGAAATCA 
CAGCGG'TGAA 
TCACATCGXT 
CCACT CTGTT 
CCGGCTTGAT 
CTCTTOGTCT 
GAATAAGCAG 
GGAGATCCAG 
ACAQCAAAAC 
GGAACAGAGA 
TCTCATGAAG 
CAGGTCTTAA 
TAAT GTATGT 
TCAATGTAGT 
TCCACATCAA 



GATCAQACCC 
GTCATCCGAA 
TTCAGTTCGG 
TTAAATOOCT 
CTCRATTTTO 
GCTCTTCCCC 
CTCTCAGCTT 
ATACTTTTTT 
TCTQAACCTG 



GCl'lTAGGCA 
ATTTCTGACT 
CTCAAACTAG 
AGATCCTGTT 
TCAGATCCTC 
GAAAATGTCT 
GGCTACCGAT 
CATQCCQQTG 
CCTGCTAAGA 
OCTTTCJCAOC 
GATACTTGGC 
CCCTOCXCAG 
GAAGAACACA 
CAGCCACCTC 
CAAAGGCAGC 
AGACTTCGGC 
CCCACCCTGC 
ATGTCTGCTC 
CTACEAAAGG 
CTAACAGTGG 
TCATGCTTCA 
GCTTGAATTG 
AAGAAGAACT 
AGATCCCCAG 
TTCTCTTAAC 
AACCCACAGC 
TTTCATTTTA 
TTTCATCACA 
TGAJHAQAAAG 
AAAGCAGAAT 
TTTCCTTAGC 
TTTTTTCTGG 
TQ3TCAGGCT 



3660 
372 0 
3780 
3840 
3300 
3960 
4020 
4DB0 
4140 
4200 
4260 
4320 
4380 



51 
I 

ATTTTAGAAT 
TGTITGCAGA 
CATGCAAGCT 
TCATAGAAGC 

CCACTCACCA 
CGTTTGATCC 
CATTGTGTGG 
GCAGTGGGGT 
CCA0OGAAGT 
TCTGCCAACA 
CCCCGCAGTG 
TCCATCCGGT 
GCCAACAGTG 
GTTCTCATAG 
AGCTGACTAA 
CCATGTTCCC 
CTCCCAGACC 
GAAGTCCm 
GGCPAATTGC 
AGCAQAGAAG 
TGATTGCTGA 
TAGAGCATOA 
TGGCAGAACT 
TCCAGGAGAG 
AAGAAGAACT 
AGGGGGCTGC 
GTTTGGAAAG 
AGATATATAA 
ATGACATCTT 
AGGTATAAGf 



Seq ID NOt 268 Protein sequence 
Protein Accession #s AAC50426-1 



11 

1 



21 
I 



31 

1 



ACTCAGAAGC 
CTGAAAACCT 
CAATXATTGA 
ACTGTACTCC 
CCITACTGTT 
ACCAGAGCCT 
AAATAGGGGC 
CAAGAGCCAG 



51 



60 
120 
180 
240 
300 
360 
42 0 
480 
540 



60 
120 
180 
24D 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
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VU3GPSFGYT 
VKCSYCHSES 
NALSKSLSCA 
FITRSMLE39 
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9RRELMVQI*E 



TMAERRQLFA 
DPKTELNVSR 
AVKMAliATIiC 
EQSAHSCFSQ 
MMGFRYRCQQ 
SSRBPLHPMF 
NRIiDEEHRLI 
QEIQKLRLEH 
GLMKIiLKSEE 



BMRAQDIoDRI 
UEAVLSTIFY 
GGKlMDKliRY 
QKKVTTiNGFIj 
CHNYQLCQDC 
PDQPEKPU3L 
ARYAARIiflAE 
EQASQPTPBK 
IiRQGVSYVFY 



RliSTYRTACK 

qlnkrmptth 
ifsmisdssg 
dtlmsdpppq 
fwrgkaggsh 
ahivdtwppr 
ssssqppqqr 

CHS 



liRFVQKKCNL 
QIHVEQSISI* 
VMVYGRVDQP 
CLVWLPLLER 
8NQHQMKEYT 
PVTSMdDTLP 
SAPDTSFTID 
liRLLiRQRKDE 



Seq ID NOs 269 DNA sequence 

Nucleic Acid Accession #i HM_001276.1 

Coding sequences 127..127B 



1 
I 

AGTGGAGTGG 
TAGCTGGCAC 
GCCAGAATGG 
TGCTCTQCAT 
GGGJVGCTGCT 
GCCAATATAA 
ATGCTCAACA 
TGGAACTXTG 
TTCATCAAGT 
TGGCTCTACC 
GCCGAATTTA 
TCTGCGGGGA 
GATTTCATTA 
CACAGTCCCC 
TATGCTGTGG 
CCCACCTTCG 
TCAGOAC CGG 
ATCTGTGACT 
GCCACCAAGG 
CAGTACCTGA 
TTCC3VGGGCT 
GCACTCGCTG 

cccccrrcTGG 

GCCTCAGTCT 
GCCCTGGTGG 
GACTCGGGAT 
TGGCAAGGGA 
TGGCAAGCTC 
TACCCCCXGC 

AcrrccccTT 

CGCTTT6CTT 
TCTTCTGGGT 
ATGTT 



11 
I 

GACAGGTATA 
CAGGAGCOGT 
QTGTQAAGGC 
ACAAACTGGT 
TCCCAOATGC 
GCAACGATCA 



21 
I 

TAAAGGAACTT 
GOGCAAGGGA 
GTCTCAAACA 
CTGCTACTAC 
CCTTGACCGC 
CATCGACACC 



31 



GGTCTCAAAO 
CAflTACCGCC 
CTGGACGGAa 
TAAAGGAAGC 
AGGTCACCAT 
GCATCATGAC 
TGTTCCGAGG 
GGTACATGTT 
GGAGGAGCTT 
GAATTCCAGG 
TCCTCCGCGG 
GCAACCAGTG 
AGOATAGGCA 
CCTTCTGCGG 
CAACGTAGCC 
CTCCAGCTGG 
CCCTCCCZTG 
GCAGAGAGGT 
TAGTACACAC 
ATTXCTTCAA 
TATCACCAAG 
AAAGCCAGCT 
CCTAATTCCA 
TGGTCTATCT 
T CCTT C CTCT 



AT TTT CCAAG 
ATTCCTQCGC 
AGACAAACAG 
CCAGCCAGGG 
TGACAGCAGC 
CTACGATTTT 
TCAGGAGGAT 
QAGGCTGGGG 
CACTCT3C3CT 

ccnorrCACC 

AGCCACAGTC 
GQTAGGATAC 
GCTGGCAGGC 
CCAGGATCTG 



CCGGGAGCCT 
GGGCCTATGC 
AGGQATGGGG 
TTGTTGATGA 
CTCCCTGCCC 
GAGCCAAACA 
TGAAACCTTC 
CAGCTGCTCA 
TTGAGCGCCC 
GAGCCTTGGG 



ACAGGGCCTO 
AGAGGCXZACA 
GGCTTTGTGG 
ACCAGCTGGr 
TTCCTCTGTA 
TGGGAGTGGA 
AACCTGAAGA 
ATAGCCTCCA 
ACCCATGGCT 
CATTTTACCA 
AAAAAGCAGC 
TATGACATTG 
CATGOAGCCT 
GCAAGTCCTG 
GCTCCTGCCA 
TCTTCTGAGA 
AAGGAGGCAG 
CATAGAACCC 
GACGACCAGG 
GCCATGGTAT 
CGCTTCCCTC 
CACACAGCAC 
GATCACCTGC 
AGAGGTOCAC 
CTGTGGGGAT 
TTAATGGAAA 
CCTAGCCCTC 
TCCTACARGA 
ACTTAGGAAC 
ATAAAGTACA 
ACTAOACCCA 
ACCCCTGAGC 



41 
I 

GGGAAGAGGC 
CCCTGCCCTG 



CCCAGTACCG 
CCCACATCAT 
ATGATGTGAC 
CTCTCTTGTC 
ACACCCAGAG 
TTGATGGGCT 
CCCTAATCAA 
TGCTGCTCACJ 
CCAAGATATC 



ACaGATTCAG 
GTAAGCTOGT 
CTGGTGTTGG 
GGACCCTTGC 
TCQGCCAGCA 
AAAGCGTCAA 



TCACCAATGC 
GGGGGCCAAG 
CCTGCTGAGT 
AACACACAGA 
AGTGAGGCAT 
TGTTTACAGA 
CTTATCAAftG 
CACAGTGACC 
GTAATCGTGT 
AGAGTTTAAC 
CTGGACTCAC 
TTGCAGAGAT 



HLVPXWNVIE 
LLNFliLiAAFD 
liREVLKLPTE 
UVHVEHVFHP 
SWK3PAKKLT 
SHSVPSSGSP 
AMKQQRQLXA 
LBQRHSALQH 



51 
I 

CCTQTCTAGG 
CTCTGCTGCA 
GCTCCAGTGC 
□GAAGGCGAT 
CTACAGCTTT 
GCTCTACGGC 
TGTCGGAGGA 
TCGCCGGACT 
GGACCTTGOC 
GGAAATGAAG 
CQCAGCACTG 
CCAACACCTG 
CACAGGCCAT 
CAACACTGAC 
GATGGGCATC 
AGCCCCAATC 
CTACTATGAG 
GGTCCCCTAT 
AAGCAAGGTG 
CCTGGATGAC 
CA1CAAGGAT 
GATGCCCCGT 



60 
120 
160 
240 
BOO 
360 
420 
4 BO 



TTTGAGCTCA 
CGCAATGTAA 
TCOCCAAGCC 
GACACCATTT 
ATACTAATTA 
CCCCTATCCT 
AGTGTGTTGG 
CTCCCCCATC 
GAAGGCOGCC 



Seq ID NO: 270 Protein .Begiience 
Protein Accesaioa #j NP_001267.1 

1 11 21 31 

\ 1 1 I 

MOVKASQTGF WLVLLQCC8 AXXLVCtfYTS KSQYREGDG6 
ISNDHIDTWB WHnvniYGML HTLKNKNPNJi KTLI^VGGWN 
KSVPPFDRTH GFDGLDLAWI* YPGRRDXQHF TTLIKH^KAE 
GKVTID8STO lAKISQffiuDF 1SIMTYDFRG AWRGTTGHH9 
VGYMLRLGAP ASKLVMGIPT FGRSFTLASS STGVQAPISG 
DFLRGATVHR TLGCQVPYAT KGNQWVGYDD QKgVKSKVQY 
G6FCGQDLRF PLTNAIKDAL AAT 

fieq ID KO: 271 UNA sequence 

Hucleic AG id Accession #: HM_006474-1 

Coding sequence i 1B1. .669 



41 
1 

CFPDAIiDRFXi 
FGSQRFSKIA 
FIKEAQPGKK 
PIiFRGQEDAB 
PGIPGWPTKH 
liKDRQLAGAM 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13B0 
1440 
15O0 
1560 
1620 
16B0 
1740 
1B00 
1860 
1920 



51 
I 

CTH1IY&FAN 
GOTQ8RRTFI 
QIiI3>SAAIj&A 
PDRFSNTDYA 
AGTLAYYEIC 
VWATjDLDDFQ 



GCTGCCTAGG 
TCCGGCCCCC 
TTCCCCCAGC 
ATGTGGAAGG 

GXXGCCATGC 
AAGICTGGCT 
GAGGATCTGC 
GCCTCAAACG 
GTTGAGAAAa 
GCCATCGGTT 
TCGGCCTAAA 
TTCTGACTCT 
CGGGCCCATT 



11 
I 

GTCTGGAAAG 
CCACCGTCGC 
TCAGAATCTT 



GCACAGGCCA 
CAGGTGCCGA 
TGACAACTCT 
CAACTTCAGA 
TGGCCACCAG 
ATGGTTTGTC 
TCATTGGTGG 
GAGCTGAAGQ 
GTGGCCCTGT 
CAGATTCCAC 



21 
I 

CTCGGGCACC 
GCTCCTCCAG 
GCTGCTCGGC 
GCTCTTCGTT 
GGCAGAAGAT 
AGATGATGTG 
OGTGOGAACA 
AAGCACAGTC 
TCACTCCACG 
AACAGTGACC 
AATCATCGTT 
GTTACGCCCT 
CCCTGAGCTC 
GGTGACTTTC 



31 
I 

CTCCCTCTCC 
GCTGGGCCTG 
CCCCAGOAGA 



QTGftCTCCAG 
AGTGTCAACA 
CACGCGCAAG 
GAGAAAGTGG 
CTGGTTGOAA 
GTGGTXATGC 
GCTTGCCAAC 
GTGGGGAGAA 
CGTTTGCCAA 



41 
I 

GGGGCTCCTG 
TGGCCGCGGT 
GCAACAACTC 
CGTCGCTCTG 
CTACAGGTTT 
GAACCAGCGA 
GTGTAACAC3G 
AACAAAGTCC 
ATGGAGACAC 
TCATAGTTGG 
GAAAAATGTC 
GTGCTTTAAA 
OATGACCCTG 
ATTAACCGAG 



60 
120 
1B0 
240 
300 
360 



51 
I 

creccACCCC 

GCTTTTAATT 
AACGGGAACG 
GGTCCTQGCA 
GGAAGGCGGC 
AGACCGCTAT 
CAT TCG CATC 
AAGCGCCACA 
ACAGACAACA 
GGTCTTACTA 
GGGAAGGTAC 
AAAAQACCGT 
GGAACATTTG 
GAAAGACCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



1057 



WO 03/042661 



PCTAJS02/36810 



TCACCAGATT TGGTTCTTAA ACTTT 
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9eq ID NO j 272 Protein sequence 
Protein Accession #: NP_006465.1 

1 11 21 31 41 51 

I I i I I I 

MWKVSALLFV LGSASLWVLA EGASTGQPED DTETTGLEGG VAMPOAEDDV VTPGTSEDRY 
KSGLT TTjVAT SVN3VTGIRI EDL.PTSESTV HAQEQSP9AT ASNVATSHST EKVBGDTQTT 
VEKDGLSTVT LVGI I VGVLIi AIGFIGGIIV WMRKMSGRY SP 

Seq ID NO: 273 DMA sequence 

Nucleic Acid Accession «: CAT cluster 



60 
120 



GCGGCCGCCA 



GCGGGCACTG 
GGGGCCAOGA 
TCCGGGGCAG 
GAGAAGGGCG 
ACGGAGGCGG 
GTACTGCGGG 
GGAAACCTAC 
CCCTTGACTG 
GCCGCTGCGA 
CTTCTTTCCA 



11 
I 

GCTTGCAAAG 
CCGAGCCGGC 
GGGGCGCGGG 
AGGTCCCGCT 
GCGGCGGCGG 
CGGAGGCCGT 
CAGTCTTGCT 
GACCCCCGGA 
TQTACCCCGA 
CCCTCCGCGQ 
ATTTCTCCCG 
GACTGCGCTT 



21 
I 

CCGAAGTCTG 



AGGGGAOGTG 
GCGGGCAOGC 
GTG1GGCCCG 
GGAGTTCTTT 
TGCCGCCGAG 
GCTGGAGCCC 
GCCCTGGAGC 
CGGGTTGACC 
GCGGGGAGGA 
TGC 



31 
I 

GCCGCGCTCT 
GTGGCTGCGC 
GCAGGCCCCG 
AATCTGCCTC 
TCGGGGCCGG 
GAGCTGCTGG 
CCTCTCBACG 
GGCCTCTTTG 
GTCCCGGGCT 
TTGAACGAGC 
CGGGCCGGGC 



41 
I 

TOGACTCGCT 
CGGCGGCCGG 
CGGGGGCCAC 

ACGTGAGCTT 
GGCCCGACTA 
TGTTCCCCGC 
AGCCGCCSCC 
GCTCOCCGAC 
CCTTGAGCCC 
CATTTQGCTT 



51 
I 

GCGCCACGTC 
GCTAGGAGGT 
GGOGATCCCA 
CACGGAGOCG 
GGGOGACCTG 
CGGCGCCGGC 



GGCAGTGGTG 
CAAAAAGAGC 
CCTGTACCCC 
CTTTCGCCCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NO: 274 dna sequence 

Nucleic Acid Accession #i Boa sequence 



1 
I 

CAAAGAGGCC 
CGTCGAGAGG 
CCAGCAGCTC 
CCGGCCCCGA 
GAGGCAGATT 
CGGGGCCTGC 
GCGCAGCCAC 
AGAGCGCGGC 



11 
I 

GGGCTCCAGC 
CTCGGCGGGA 
AAAGAACTCC 
CGGGCCACAC 



CTCACCCCCC 
CAGACTTCQG 



21 
I 

TCOGGGGGTC 
AGCAAGACTG 
ACGGCCTCCG 

AGOGGGACCT 

GCOGGCTCGG 
CTTTGCAAGC 



31 
I 

CCCGCAGTAC 
CCGCCrCGBT 
CGCCCTTCTC 

TCCTGGOCCC 
CAGTGCCCGC 
CACCCCCGGG 
TGGCGGCOGC 



41 51 

I 1 

GGAGGCTCCG GCGGGGAACA 
GCOGGCGCCG TAGTOGGGCC 
CAGGTCGCCC AAGCTCACGT 
CGGCTOCGTG AAGAAGGACG 
TGGGATCGCC GTGGCCCOCG 
ACCTGCTAGC CCGGCCGCCG 



60 
120 
180 
240 
300 
360 
420 



Seq 3D NQi 275 DMA sequence 

nucleic Acid Accession nk_oohib.i 

Coding sequences 74 - - 1651 

1 11 21 31 41 51 

11)111 

AGCCCAGAGA CACATTGGGG CTGACCTGCC GCTGCTGTCA GTGGGAGGCC AGTGGTGCTG 60 

GCCAAGAAGT GTCATGGCTG GTGTCGTGCA CGTTTCCCTG GCTGCTCACT GCGGGGCCTG 120 

TCCGTGGGGC CGGGGCAGAC TCCGCAAAGG ACGCGCAGCC TGCAAGTCCG CGGCCCAGAG 1B0 

ACACATTGGG GCTGACCTGC CGCTGCTGTC AGTGGGAGGC CAGTGGTGCT GGCCAAGAAG 240 

TGTCATGGCT GGTGTCOTGC ACGTTTCCCT GGCTQCTCTC CTCCTGCTGC CTATGGCCCC 300 

TGCCATGCAT TCTGACTGCA TCTTCAA3AA GGAGCAAGCC ATGTGCCTGG AGAAGATCCA 360 

GAGGGCCAAT OAGCTGATGG GCTTCAATGA TTCCTCTCCA GGCTGTCCTG GGATGTGGGA 420 

CAACATCACG TGTTGGAAGC CCGCCCATOT GGGTGAGATG GTCCTGGTCA GCTGCCCTGA 480 

GCTCTTCCGA ATCTTCAACC CAGACCAAGT CTGGGAGACC GAAACCATTO GAQAGTCTGA 540 

TTTTGGTGAC AGTAACTCCT TAX3ATCXCTC AGAC&TGGGA GTGGTGAGCC GGAACTGCAC 6O0 

GQAOGATOGC TGGTCGGAAC OCTTCOCTCA TTACTTTGAT GCCTGTGGOT TTQATGAATA 660 

TGAATCTGAG ACTGGGGACC AGGATTATTA CTACCTGTCA GTGAAGGCCC TCTACAOGGT 720 

TGQCTACAGC ACATCCCTCG TCACCCTCAC CACTGCCATG GTCATCCTTT GTCGCTTCCG 780 

GAAGCTGCAC TGOICACGCA ACXTCATCCA CATGAACCTG TTTGTGTCGT TCATGCTGAG 840 

GGCGATCTOC GTCTTCATCA AAGACTGGAT TCTGTATGCG GAGCAGGACA GCAACCACTG 900 

CTTCATCTCC ACTGTGGAAT GTAAGGCCGT CATOGTTTTC TTCCACTACT GTGTTGTGTC 960 

CAACTACTTC TGGCTGTTCA TCOAGGGCCT GTACCTCTTC ACTCTGCTGG TGGAGACCTT 1020 

CTTCCCTGAA AGGAGATACT TCTACTGGTA CACCATCATT GGCTGGQGGA CCOCAACIGT 1080 

GTGTGTGACA GTGTGGGCTA CGCTGAGACT CT ACTTT GAT GACACAGGCT GCTGGGATAT 1140 

GAATGACAGC ACAGCTCTGT GGTGGGTGAT CAAAGGCCCT GTGGTTGGCT CTATCATGGT 1200 

7AACTTTGTG CTTTTTATTG GCATTATCGT CATCCTTGTG CAGAAACTTC AGTCTCCAGA 1260 

CATGGGAGGC AATGAGTCCA GCATCTACTT GOGACTGGCC CGGTCCACOC TGCTGCTCAT 1320 

CCCACTATTC GGAATOCACT ACACAGTATT TGCCTTCTCC CCAGAGAATG TCAGCAAAAG 1380 

GGAAAGACTC GTGTTTGAGC TGGGGCTGGG CTCCTTCCAG GGCTTTGTGG TGGCTGTTCT 1440 

CTACTGTTTT CTGAATGGTG AGGTACAAGC GGAGATCAAG CGAAAATGGC GAAGCTGGAA 1500 

GGTGAACCGT TACTTCGCTG TGGACTTCAA GCACCGACAC CCGTCTCTGG CCAGCAGTGO 1560 

GGTGAATGGG GGCACCCAGC TCTCCATCCT GAGCAAGAGC AGCTOCCAAA TCCGCATGTC 1620 
TGGCCTCCCT GCTGACAATC TGGCCACCTG AGCCATGCTC CCCT 

Seq ID NO* 276 Protein sequence 
Protein Accession #z NP_001109.1 

1 11 21 31 41 - 51 

I I 1 1 I I 

MAGWHVSLA AHCGACPWGR GRLRKGRAAC KSAAQRHIGA DLPLLSVGGO. WCWPHSVMAG 60 

1058 



WO 03/042661 



PCT/US02/36810 
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WHVSIiAALL LI^MAPAMHS DCIFKKEQAM CIiKKlQRANE U4GFNDS3PG CPGHWDNITC 
WKPAHVGEMV LVSCPBIjPRT ENPDQVWETE TIGESDFGDS NSIJDLSDMGV V3RNCTEDGW 
SEPFPHYFDA CG FD EYE S ET GDQDYYYLSV KALYTVGYST SLVTZ/TTAMV ILCRFRKLKC 
TRNFIHMNLF VSFMLRAISV FIKDWIIjYAE QDSNHCFIST VECKAVMVFF HYCWSNYFW 
LFIEGLYI.Fr LLVETPFPER RYFYWYTIIG WGTPTVCVTV WATIiR1»YFDD TGCWDMNDST 
ALWWVI1CGPV VGSIMVNFVI> FIGHVlkVQ KLQSPDMGGN ESSIYLRIAR STIiLIPLFG 
IHYTVFAFSF ENVSKRERLV FELGLGSFQG FWAVLYCFL NGEUQAEIKR KWRSVIKVNRY 
PAVDFKHRHP SLASSGVNGG TQLS3XSKSS SQIRMSGLPA DNLAT 

ffeq ID NO i 2*77 DMA sequence 

Nucleic Acid Accession #: NM_004000.1 

Coding sequencer 36.. 1193 



1 
I 

AGAAGAAGCr 
GTGTAGTGGT 
CCAACTGGTC 
TCCTATGCTC 
AGGACAAGAG 
AACTGAAAAT 
TGGTGGATTC 
ACCATAACTT 
ATTTCACTGT 
CCAAGGAAAG 
GCTATCAAGT 
TCCATGGGTC 
GGCAGGACAG 
ATAAGGGAAT 
CACTGGCCTC 
CCATCACAGA 
CCAAGATCAC 
TGGGCTATC3A 
TGGQAGGAGC 
AGGGCCCTTA 
ACTTACAGAG 
CTCATG1GQG 
CTTCCTCTTA 
CCTGAAGTAC 



11 
1 

GGCCAAGOAT 
CTTGCTGCTT 
CCAGGACCGG 
TCATCTCATC 
TOAAQTQATG 
TCTCTTGTCC 
TTCTACATCA 
TGATGGACTG 
GCTGATTCAT 



TGAGAAACTG 
TTGGGAAAAG 
AGGGCCAAGC 
GCCATCAGAG 
TGCAGAAACC 
GTCTTCAGGC 
GCGCCTCCAG 
TGATGTOAAG 
CATGATGTGG 

AAGCAGGCAA 
ATTCCCCTTG 
GATCATAGAT 
AATAAAAAAA 



21 
1 

ATGGGAGCAA 
CTCCAGGGAG 
CAGQAACCAG 
TATTCATTCG 
CTCTACCAGA 
ATTGGAGGGT 
CGCTTGGAAT 
GATGTAAGCT 
GAC3TTAGCAG 
ACTGCGGGCG 
GCAAAAGATC 
CCCCTTATCA 
TCCTACTACA 
AAGGTGGTCA 
ACCGTGGGGG 
TTCCTGGCCT 
GATCAGCAGG 
AGTATGGAQA 
TCTATTGACA 
CPlAGCAGTCA 
GATGACCTTG 
CCAGGCTGGC 
TGGACCTGGT 
ATTCATTTTG 



31 
I 

CCACCATGGA 
GATCTGCCTA 
GAAAATTCAC 
CCAGCATCGA 
CCATCAACAG 
ACCTGTTTOG 
TCATTAACTC 
GGATCTACCC 
AAGCCTTTCA 
TATCTGCAGG 



41 
I 

CCAGAAGTCT 
CAAACTGGTT 
CCCTGAGAAT 
AAACAACAAG 
TCTCAAAACC 
TTCCAAAGGG 
CATAATCCTG 
AGATCAGAAA 
GAAGGACTTC 
GAGGCAAATG 



51 

I 

CTCTGGGCAG 
TGCTACT TTA 
ATTGACCCCT 
GTTATCATCA 
AAGAATCCCA 
TTCCACCCTA 
TTTCTGAGGA 
GAAAACACTC 
ACAAAATCCA 
ATTGATAACA 



CTGGCCACAA 
ATGTGGAATA 
TGGGCATCCC 
CCCCTGCCXC 
ATTATGAGAT 
TTCCCTAOGC 
CCAAGGTTCA 
TGGATGACTT 
AGAGAAGCCT 
CTGCCTGGGG 
CTTTGGATCT 
TTTGTXTTCC 
CTCCAGTA 



CAGCCCTCTG 
TGCTGTGGGG 
CACATATGGG 
TGGCCCTGGA 
CTGCCAGTTC 
AGTCAAGGGG 
GTTCTTAAAG 
CACTGGCAAA 
TGGCTCCTTG 
CCTGCTCTCT 
CTCTTOCAAG 
TGCAGCTGTT 



AGCAAGGGGT 
TACTGGATAC 
CACTCCTTCA 



CTGAAAGGAG 
AACGAGTGGG 
AATTTAAACC 
TCCTGCAACC 
TGAAGGATTA 
CCCAQOAATT 
CC TTTCC IGA 
GACTTGTTGC 



Seg ID NOs 27B Protein sequence 
Protein Accossion #s NP 003991.1 



1 

! 

MDQKSLKAGV 
IKfJNKVI IKD 
HSIILPLHNH 
AGRQMIDNSY 
EYAVOYWIHK 
EICQFIiKGAK 



11 21 31 41 51 

I I ! I 1 

WUJCLQGGS AYKXiVCYFTK HSQDRQEFGX PTPENTPPFIi CSHMYSFAS 
KSEVMLYQTI NSIiKTKNPKL KILLSIGGYL FGSKGFHPMV DfiSTSRLBFI 
NFDGLDVSWI YPDQKEHTHF TVLIHELAEA FQKDFTKSTK ERLLLTAGVS 
QVEKLAKDLD FZNLLSFDFH G5WBKPLITG HNSPLSKGWQ DRGPSSYYNV 
GMPSEKWMG IPTYGHfiFTL ASAETTVQAP ASGPOAA0P1 TESSQPXAYY 
ITRIiQDQQVP yAVKSNQWVG YDDVKSKETK VQFLKNLNLG GAMIWSIDMD 
PTPLVQAVKR SLGSL 



Seq ID NOi 279 DNA sequence 

Nucleic Acid Accession #■ NM_0l5l66.l 

Coding sequence x 116-* 1249 



1 
I 

TGCTGGAAGT 
AGAACTGGTG 
CCAGGAGCCA 
GCAAGACCCC 
ACTGCCCCCC 
CCTGGTGACC 
CTTGCGCTGT 
GAGGAACGCC 
CACTADGTGT 
CAACTTCAAC 
CGCTGCACGG 
CAACATTCTQ 
GGTAATCGCA 
AGTTTCAGGC 
TGCCATTGCC 
TGCCATAAGC 
CATCATGAGA 



11 
I 

CCCTCACCCA 
ACACGTGGCT 
TTCAGAQAGQ 
GCCAGCTATG 
TGCTTCAGCC 
TCGGGGTTTT 



AATGTGATTC 
TTAATTTGGT 
CTCATCCTGC 
TCCAGCBAGO 
OACGAAGTGG 
GGCATCTCTQ 
CCACACCTCT 
AGTCATGTOG 
AGCCTCACGT 
ATCGTGGAGA 



CGOCATCCAH 
CCAGAACGGC 
CGACaAGGAG 

ggaaaccggg 
ccctgagcta 

TGGGGTTTTC 
GOCTCAGTGA 



TGCGTGCGCT 
CCGCAGGAGC 
AAAGCCTGGA 
TGGCAGOGCC 
CTGCTTCTAA 
ACTTCCTAGG 
GATCAGTCAG 



21 
1 

GAGACCAGTG 
GTACA.TTCAG 
AGCTGGCCTA 
CCCCAGACGC 
ACAAQACGTG 
CGCTGTACCT 
CTTGCATCCG 
CCAACTTTCA 
TTGGATGCAA 
TGCTCCTGCT 
AGGACTGCAA 
CATTTCCTGC 



CAGTGACGTT 
CAGCAGAOTG 
CTCCGCTGCT 
TGTTTAAGGA 
TGCTAGTGCT 
TCAAGGTCAG 
GCCTGGCTGG 
QAGGCGTCGT 
CAGCCTGGCC 
CACCTCTTTT 
GTAGTTTAAT 
CTCCGAGTGG 




GGGGAAOGTG 
CnCGQCAATT 
GATATTGTTT 
ACTAGTCCTG 
GGAGCTGCTC 
GAAAAAGAAG 
TCGGGTCCTG 
GGGGATCATT 
CTTTTGGATC 
TCOCAGCAAG 
GTTCACAGCC 
TTACCCGCCA 
CCTGCTGCAQ 
TGCAAQGCTG 
GGAGGTGGCC 
GGTGCAAATG 
CCARGCATGG 
CCCTTGTGTQ 
TTTAAAATAG 
CTCCCGTGTC 



41 
I 

CAGAGCAGOG 
GTGTCCCCAA 
CCCACGCTGG 
GACCTGCAGC 
GTQCXGATGG 
TTCCCGGCTG 
GTGAGCTTCA 
GTTTCCACGT 
AACCCATCAG 
ATGGCGGCCA 
GGCTCCATGT 
AAATCrrACT 
GCCCTGAACG 
CTAGTGGCCT 
TGTCTGGTGG 



GCCATAAAAG 
GCCGGCCTCA 
CAGGGTGCAT 
AGOAQCCCCC 
GCCCAGTGAC 
AAACGCACAA 
AGGGCAAACC 
GOCAATGTTG 
GTAACAGCAG 



120 
1B0 
240 
300 
360 
420 
460 



60 
120 
1D0 
240 
3O0 
360 
420 
490 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
13 B0 



60 
120 
180 
240 
300 
360 



51 

\ 

GGGGAGATAA 



AGCGGGGOCG 
TGTCGAAGAG 
GGAGCTGCCT 
AGATGGATEA 
CCSGTCTCCAG 
TTGCIGMAC 
CAATAAACAT 
CGGTGATCAT 
CTGACAGCGC 
CAGTCGTCGA 
TGGATGACTC 
DCTTTCCRAG 
AGGTCCTGAT 
TGTCATTCAG 
CATCCTACGA 
ACACGQGCAC 
CCTGGGACAC 
TGAAGGAGTT 
CCCCAGACGC 
CCOCTAATCG 



GCTAGTCTGT 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



1059 



WO 03/042661 



PCT/US02/36810 
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GCAACTTCCC 
CTGGCCCCAG 
CTGACCAGAC 
GCTTAGGGtC 
GGTGCACCTG 
TTGGCGCCTC 
GCGACACAGG 
TGCCCTGTCT 
CCCAOSTCCT 
CGGGCACTGT 
TCATGCCCAG 
GGGCTGCATC 
GGGAAGCTGA 
GGTCAGAQCC 
TTGTCTTGAT 
GGGGTCTTAG 
GGTGTGAGAA 

GGGCGACTGT 
TTCACAATGG 
CGCCGTGGGG 
GTOaCCXGGG 
CGGCCTGCAG 
GAGTGACTCT 
CAACCATTGA 
CAOGCCCTCT 
CTTTGGCCAC 
CCGGATGCAG 
GaaOGGTCTO 
TCTTCTTGGG 
TTTTTQCTQ3 
ATGGGGTTTA 



AGGCCGAGGA 



TGTGGGTCCC 
TGTTCCCGQGT 
CCCCAGGCTC 
TGTCCTGGCC 
CTGCOGCTGC 
CCCTACTTAG 
GGGTGCTGTC 
CATGCCCAGC 
CTTGGAGTGC 
CAAGGCTTTG 
GCCCCTCCCT 
CAGGAATTCT 
CCTCACTGAG 
GGTACAGCCC 
TAACACTGGC 
GAGGGCGCCT 
GAAACTGGAA 
TTTTGTTTCT 
TGTATTGACT 
GCCATCAAGA 
ACCCTGCCAG 
GGCTGCCAAG 
AGGGGGATGC 
CTGCAGTCCT 
AGCCCCAAAC 
ACCCCACCCT 
CTCCTGGGAA 
GGCAGACCAG 
CTTGCACTTT 
AAAGAACTGT 
TTGCAGACGT 



AGGGCOCCCG 
TGCCTCTCTC 
TGCGTCTCCT 
ACTCAGTCAG 
AGGCTGCCCA 
CGGGAGCCGC 
TCTCCCAGGT 
ACCCAGTCAT 
ATGCCCAGCT 
TTGGAGTGCC 
CACGTGGCCG 
TAAACOGGCT 
CCTGTGTTTG 
CACAGGTTCA 
AAGGCCCCAG 
AGGCGGTCAC 
TTTGGGTAGT 



TGCTGTTACT 
TTCCTTCTGA 
GGTCCCCCAT 
TCAAAGAACC 
GGGGTTTGAG 
ATACCTTCAT 
AGGCAAGACA 
CGGCAGCTCC 
AAGCGCCACC 
GTCTGGAOGC 
TGGGGTGGGA 
AGAGCTCAAG 
CTTATTTTCT 
CTCTTTCAGT 
G 



GCTCGGCCTC 
ATGGCTTGGG 
GCCCACTCTG 
CCCAGTGGGA 
GGGGCTCXTC 
AGGGGCCCCT 
GAAATCCACA 
TCTAGAGGGA 
TGGAGTGCCA 
ACATGGCCGC 
CTGCTBTGAC 
GGACCACGTC 
CTCCCATTAC 
CCCAGCGCCC 
GGCAGCGQTC 
TGCCCACCTG 
GCCATGGCCA 
AGCTGACGGC 
TTGATAATTA 
TCTGAGAAGA 
GGGCTTQCTGG 
AGGAGGCCTG 
GACCCTCCCA 
GGTGTTCATG 
CCTTCCCAGC 
GGAAGGACAC 
GTGGGAGAGG 

GCCACAGGGA 
TTTCAGAGTC 
TATTTTAGAG 
CTACATTTTT 



TTGAGAGCOC 
CTGGAGTGGG 
ACCGGGCTTC 
TCTTACCCAC 
CTGGACAGTG 
OCTCCAGAGC 
CCAGTCCACG 
TCCAOCGCCA 
CGTGGCCGCT 
TCCCCACGTC 
AGGGA&TGTT 
TCCCTGGCCC 
TCAAAATGCA. 
TCTACCTCCT 
TTCTCCATCT 
CCAGGCTGCA 
GQAGTGGGTT 
AGTAAATCCA 
CTTTOCAGCA. 
CATGAACGTT 
AAAGGCCCGG 
GGAGATGCAG 
GGTTTCCCAC 
ACAAGTGGAA 
TGCTCCTAGA 
AGTCAGGGGC 
AGAGGCTGCT 
TCCCTGCAGC 
CGACCCGGGG 
AGAATTAGGC 
CGCTTAAAAA 
GTTTAATAGG 



CACCCCTGAA 
CTCTCTGGAC 
CTCCCTCCAC 
TTCCCTGCAA 
AGAGCAGGGC 
CTGGGCGCAA 
CCGGGTCGCC 
CACTGGCCGG 
GCCCACGTCC 
CCGGGCACTG 
CTTGGGGOTG 
CAGTGACCGG 
GGACAGATCA 
AGCAAGTACT 
CCQ CTGTTTT 
GGGACAGTTG 
TCCCTQCGTC 
CAGTGAGTTG 
GGTGTTTTCC 
TTCTCXTCAC 
AGATGCATCT 
CTGGATGGGG 



TCATTATTTT 
GGGGACAAGC 
OGGGCAAACA 
GTCACTGGTA 
TTTGAGGCTG 
CGGGCTGATG 
ACTTOGAACG 
ATCGGGAAAA 
CTTGAGCAAT 



1560 
1620 
Iff SO 
1740 
1800 
I860 
192D 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S80 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



Sec. ID MOs 280 Pirate In. sequence 
Protein Accession #i np_O55901.1 



1 11 21 

1 1 1 

MTQBPFRKEL AYDRMPTLEH GRQPPASYAP 
CLLVTSGFSIi YLGNVFPAEH DTTLRCAAGSC 
VTTTCLIWPG CKLVLNPfiAl NINFNLXLI/L 
SAHTLDEVPF PARVMCSYSV VEVIAGISAV 
PSAIASHVAA ECPSKCLVBV LIAISSUTSP 
YDVLliIaLIrLI* VLLXiQAGLNT GTAIQCVRFK 
EFDKEKAWHA VWQMAQ 



31 41 51 

1 I I 

UAXPSDLQLS KRIiFPCFHHK. TWVFgVLWGS 
IPSATVSFTV SRRNANVIPN FQILFVSTFA 
liDELIiMRATV IIAARggSED CKKKKG9MSD 
LGGIIALNVD DSVSGPHLSV TFFWILVACF 
I^LFTASGYl^S FSIMRIVEMF XDYPPAIKPS 
VSARLOGASW DTQKGPQKEUj AGEVARSPIJC 



Seq ID NOs 281 DMA sequence 

Nucleic Acid Accession «: NMJ>04S18.i 

Coding sequence: 43.. 2577 



1 
1 

GCTGAGOCTG 
AACGGCGGCG 
CTGGACCCCG 
GCCCCCAAGC 
CCCCCCAAGC 
CGGCCGCGOG 



TACATCCTGG 
GCCGGAGGCT 



OGQATGATCC 
GCCCACAGCA 
TGGTTCCTGQ 
GCACTCTGGT 



11 
1 

AGCCCGACCG 
TAT AC 

GCGCGOCCGA 
GCGGCAGCAT 
GCAACGCCTT 
GCTGGGCGTT 
CTGTGTTTTC 
AAATCGTGAC 
GCTGCTGCXX3 
TGAT TGACAT 
ACGTCTTTGC 
GCATGGACCG 
AGGAGCTGGT 
TGTACTTGGC 
GGGGCCTGAT 



CTGCCTGCAG 
AAGCACTTTG 
TACGCCACCA 
GTCACCGTGC 
CTCAAGAGTA 
CAGAAGGTCA 
AAGGGGTCCC 
GACAGCCCCA 
GCTTTCOGCA 
GAGGACATTG 
GGCCTCAAAG 
TTCAAGGAGA 
CACCTGGACA 
CGGGGCCCAG 
GAGQACCCCA 



GCATCTTGGG 
AGAAGAGGCG 
AOCTCTCGOG 
CCATGTACAG 
AATCTGGACT 
GT1TGAAAGA 
CGCAGGCCCA 
GCAAGGTGCC 
TCAAGGGTGC 
TGGATGACAA 
TCAGCATCAG 
GCCTGCGGCC 
TGCTGTCCCG 
CGATCAGGGA 
GCATGATGGG 



21 
I 

CCCGAGCGGG 
CTCCACCCGG 
CCTCAGCAAA 
CTACCGCAAG 
CATCTACCAC 
CACCATCAAG 
TATOGTGGTG 
GTACCGTGGC 
CATGGTGCTC 
CAjCATCTGCG 
GOGGGGAGGC 
CACTGCCTGG 
AGAGAAGGGG 
CACGCIGACC 
TGCGGCAACC 
GTCTGGGTTT 
GAACCOGGCA 
CACAGACCTG 
ACTTATCXZCC 
CGCTTTCAGG 
TCGTGTCTTC 
GACTGTGAGG 
CAAGABCTGG 



31 
1 

CCGGCAGGCA 
GAGAAGAAGC 
GACGGGGCGC 
CCTCGCGCGG 
CTGCAGAATT 
GCCTACGTGT 
GAGTATGAGA 
TTTGGCQTGG 
TGGAGGGGGC 
ATCGGCCCCA 
CTOCGGAGCC 
ACCTGGAAGC 
TACATCGGCT 
GAGAACGACC 
ACCATTGGCT 
TTCACCCTCA 
GCCCTGAAGG 
GCAQGCCTGA 
CACTCCACGT 
CCQCTOAACC 
AAGGACCGCC 
TCCAGCCCCC 
CGGTCACCGA 
AGCTTCBOOG 



41 51 

1 I 

CCATGGTGCA QAAGTCQOQC 
TGAAGGTGGG CTTCGTGGGG 
TGCTGATCGC CGGCTGCGAG 
GCGGCGCGGG CGGCGGGAAG 
TCCTCTACAA cgtgctggag 



GAGCTGOCCC 
AGCCGTGTGT 
CTACGACGTG 
AATTAAGAGC 
CAAGGACCGC 
ACGGCTCGGG 



TOCGAGTTTG 
GTCATGCGGT 
ATGGACGTCA 
CTGCAGTCCA 
ACCAAGGGCC 
AAGGTGGAGA 



AGAGCTCGGA 
A0TACTTOGT 
GGCTCAAGTT 
TTGCGOTGCT 
TGCGCTTCCT 
TGCTGGGCTC 
TCCTTTGTCT 
ACTTTGACAC 
ACGGGGACAA 
TOGGTGTCTC 
TTGAGGAGCA 
TCCAGTOGGC 
GGCAGrACTA 
AGCTGGAGCT 
CGCCGGAGOC 
OAGGCGTGGC 
GCGCCGACCA 
ACCQCAGCCG 
AAGAAGCAAG 
•TGACCGAGGA 
TCCTGOTGTC 
TCGAGCAGTA 
GAGTGOACCA 
CGGCCGAGGC 
AGCAGGXCTT 



GGGGGCCCTC 
GCGGATCTGG 
TGCCCGGAAA 



60 
120 
180 
240 
300 
360 



GCAGATTCTG 
TGTQGTCTAT 
CATCCTGGCC 
CTACGCQGAT 
GTAOCCCCAG 
CTTCTTCGCX3 
GCACAGGCAG 
CTGGAGATTC 
CGAGCGAACG 
GCTGAGGAAC 
GTCTCCAAGC 
TGCCAAGGGG 
GAGOCTCGAG 
OGCACGCCAG 
CCTCCCCGGA 
CCTGACOCCG 
CAAGCGGAAG 
CTCAGCCOGC 
GATCGTGGGG 
GGAGCTGCCC 
OTCCATGGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
ISOO 
1560 
1620 
1600 
1740 
1800 
1860 



1060 



WO 03/042661 



PCT/US02/36810 



AAOAAGCTGG ACTTCCTGGT GAATA.TCTAC ATGCAGCGGA TGGGCATCCC CCCGACAGAG 1920 
ACCGAGGCCT ACTTTGGGGC CAAAGAGOCG GAGCCGGCGC CGCCGTACCA CAGCCCGGAA 1900 
GACAGCCGGG AGCATGTCOA CAGGCACGQC TGCATTGTCA AGATCGTQCG CTCCAGCAGC 2040 
TCCACGGGCC AGAAGAACTT CTCGGCGCCC CCGGCCGCGC CCCCTGXCCA GTGTCCGCCC 2100 
5 TCCRCCTCCT GGCAGCCACA GAGCCACCCG CGCCAGGGCC ACGGCACCTC CCCOGTGGGG 2160 

GACCACGGCT CCCTGGTGCG CATCCCGCCG CCGCCTGCCC ACGAGCGGTC GCTGTCCGCC 2220 
TACGGCGOGG GCAACCGCGC CAGCATGGAG TTCCTGCGOC AGGAGGACAC CCOGGGCTGC 22 BO 
AGGCCCCCCG AGGGGAOCCT GCGGGACAGC GACACGTCCA TCTCCATCCC GTCCGTGGAC 234 D 
CACGAGGAGC TGGAGCGTTC CTTCAGCGGC TTCRQCATCI CCCAGTCCAA GGAGAACCTG 2400 
10 GATGCTCTCA AC3W3CTGCTA CGCGGCCGTG GCGCCTTGTG CCAAAGTCAG GCCCTACATT 2460 

GCGGAGGGAG AGTCAGACAC CGACTCCGAC CTCTGTACCC CGTGCGGGOC CCCGOCACQC 2S2D 
TCGGCCACCG GCGAGOGTCC CTTTGGTOAC GTGGGCTGGG OCGGGCCCAG GAAGTGAGGC 2580 
GGCGCTGGGC CAGTGGACCC GCCCGCGGCC CTCCTCAGCA CGGTGCCTCC GAGGTTTTGA 2640 
GGCGGGAACC CTCTGGGGCC CTTTTCTTAC AGTAACTGAG TGTGGCGGGA AGGGTGGGCC 2700 
15 CTGGAGGGGC CCATGTGGGC TGAAGGATGG GGGCTCCTGG CAGTGACCTT TTACAAAAGT 2*760 

TATTTTCGAA CAGGGCACTC CCAGGCCCTG TCGCCATTGA GGTGCCTCCG CTGGGCTGTC 2820 
TCCTCACCCC TCCCTGTGCT GGAGCCTGTC CCAAAAAGGT GCCAACTGGG AGGCCTCGQA 2880 
AGOCACTOTC CAGGCTCCCA CTGOCTOTCT GCTCTGTTCC CAAAGGCAGC GTGXGTGGCC 2940 
TCGGGCCCTG CGGTGGCATG AAGCATCCCT TCTGGTGTGG GCATCGCTAC GTGTTTTGGG 3000 
20 GGCAGCGTTT CACGGCGGTG CCCTTGCTGT CTCCCTTGGG CTGGCTCGAG CCTGGGGTCC 3060 

ATGTCCCTTT GCGGTCCOGT CAIGGGGCAG GGAATCCATA GOGGGGCCCA CAGGCAGGGG 3120 
TATGAGTGCG TCCCACCCAA CGCAGCACCA GCOCCGGCCA CCGCTCCCCG TGTCCCCAGT 3180 
TCCGTCTCAG CTACCTGOAC TCCAGGACCC TOGAGAAGGG AGACCXGGCA GTGGAGGGAG 3240 
GCTGTGCrrar GTGTCGCCCT GCAGGTGTGA CCCCGCCTGC TCTTTCCTCC CCCGCCAGGr 3300 
25 GTGGCCCCGC CTGCTGTTTC CTCXTCCCACC AQTATGGCCC CACCTGCTCT TTCCTHCCCC 3360 

CCCAAGOTGT GGCCCCACCT GTTCTTTCCT CCCCTGCCGA GGTGTGACCC CACCTGCTCT 3420 
TTCCTCCCTC CCRGTATGGC CCCACCTGCT CTTTCCTCCC CCGAGGTGAG GCCCCGCCTG 3480 
CTCTTTCCTC CCATGGGAGC OSCTGAGGCG TGCGCACCTG GGCACAGGTT GGGGCTCTGC 3S40 
AGGATGAGOA AGACAGGCCA ATCCCTTCCC TCCCAGAAQC TGGCCGCCCA GCAGGAGGGA 3600 
30 CTGAGGCCAG ACTCATGTCC AGCAAGGAAC GTGTGGTGTG TCCCCTGGGA AGTCTCTGGG 3660 
CCCTGGGAAQ AGGGAAGGTG CACOTCCTGG GATGGTTGCG GGGCCCTGTT TTGGGAGACA 372tl 
AAGGGGTAGA GGGTCTGTCT TGKGCCCCCC CAGACTCTAG OCCGAGCAGT GCAGCCACCT 37B0 
ACTGCCCCAC CTCAGAlGAAG TGCAGCX3GGA AGGAGGCTGG AGGTGGTGCG GCQCTGCCTC 3840 
GGGTGTCTGC GTGAATGAGC GTGGOCAAGG ACCAGTGCCA CCTCATGGCA AAGAGCTCCC 3 900 
35 GCAGTGTTTG TTAGAGTGCA CATCCTACGT GCCCACTGGC ACACACACGT GCTCACATAC 396D 

ANGTCHGMGT ACAGGNGTAC ACATGCANGC TTGCACACAT GCACACAGAC CACATAGCAC 4020 
ACATGTGCAC TGACCACACC TGTATAGACC ATGCACAGTA CACATACGTG CATACACATG 4080 
CCTGCATACA GGCATACACA TGCACGCTTA CATGTACACG TGCACRGATC ACACACATQC 4140 
ACACACGTGT AGCTCACACA CAGTATACAC ATACACRAQT GCACAGACCA CACACAGCAC 4200 
40 TAACACATGC ACACACAAAG TGCATAGGCC ACACAGCACA TGCACACAGG TGCACAGACC 4260 
ACACAGCACA CACAAGTGCA CAGAGCRCAC TGCACACATG CACACACACA CGCOTGCATG 4320 
CACACTCCTC GCACTTCCAB CCTTGGAGCC CTTCTGTCTC TGGTCTTTCT CTTTGACCCT 4380 
GCTGAGTGTA AGOTGCCTGG GGAGGGGCTA CAAGGAGTAA TTGTGGCTTT AGSGGTCGTG 4440 
GTGATGCTGG AATGTCAAGC GCCGTCGTGG GGTATCCGAC TGTCCGOGCT CCTGGTCCGC 4500 
45 AGTGGCAGAG CGCCAGGCAG AGCCAATCAG GGTCTCGTGC TGCCCTTCCC CCCCACAGCC 4560 
TGGCAGCCAT CCAGAGGAGG GGCTCTACCA GATGCCAAGG TGCCCCGGTG TCTGTATGGG 4620 
TGTCCGGTTG OGTCCTGTGT TTGGTCTGCC CTGGAGGTGG CTQOGCCCTC CTGGGATGGG 4680 
TGGCTCAGCC TCGAATCOCA QGCCCCAGCC CAGGCAGGTG CTGCTGCCTG TTGTGGTTTC 4740 
. CTGGCCCAGC TTCTCCTTCT CCCTCTGCAT AAAAT CACAO TCCGTGAGTC TTCCAGCTGC 4800 

50 CACCACGGCT GGGACACGCT GGGGGAGGGC TCCTCCCATG CCTCCTGCAC ACAGCCGTCT 4860 
GAGCAGGGCA GGTGCCAACA CCCCCCACCG GAGACACGCT GC<X:CTCAGC GATGOCCCTA 4920 
CCTTTTGGGG GGCCTCGTCT CAAGCCCCCC CTTGGAGGCT GAAATCACCC CAGGCACTGT 4980 
GAGGGCTTCT CCAGGGGGAC ACCCTTTGAG CTGTGGGTCT GATCACCCCA AGTCCCGCAC 504O 
ACGGAGGAGA GGCACAQCCA GGGCGTGTGG TTTAATGTTT GCCCCTTCGG GGCTGQAGGT 5100 
55 CTCAGTGTTT CTAGATTCCA GACCCTGCTG CCAGAGAGAC CTGCTGCCGG AGAGAAGGGG 5160 
AGGAGGACTC CAGCTGGGCT CGGTCCCCCA CAGTCAGGGA CCCCCATAAA GGACACCCCC 5220 
TTCTCTCTAG AAAGAGCTGG GCTCTCAGCT ATTTCTAGTT GCTTCCCAGA AGCCGAGGAG 52 B0 
CAGAAGGAGC TGTGAGAGCT TTGCAGAAAC GCCCTTGTCC CCGCCCTCCT GAGCTATGAA 5340 
TGCCGTACAG AGCAGAGGCT GGGGCATTGG CAAQATCACA GGTTGATGCT GCACAGCCCC 5400 
60 ATTGACACAA ACCCTCAAAG CAGADGTGAG AGGGACGGTT CACAAA GCTT GGACCTGCCG S460 
TGGAGGGTGC CCGGCAOACG TGGCGTGAGA GGGACGGCTC ACGAGGCTTG GACCTGCTGT 552 0 
GGAGGQTGCC CAGCAGACGT GGTGTGAGAG GAACGGCTCA CGAGACTTGG ACCTGGTGGA 5580 
GGGTGCCCAG CAQACGTGGT GTGAGAGGGA CGGCTCACAG GGCTTGGACC GGAGAGAGAT 5640 
C3GCTCATGAG ACTTGGACCT GCCGTGGAGG GTGCCCAGCA GACGTGGTAT GAGAGGGATG 5700 
65 GCTCAGGAGG CTTGGACCTG GTGGAGGGTG CCCGGCAGAC GTGTGAGAGG GACGGTTCAC 5760 
AAGGCTTGGA CCTGCCATQG AGGGTGCCCA GCAGACGTGG TGTGAGAGGG ACAGCTCACG 5820 
AGGCTTGGAC CTGCCGTQGA GGGTGCCCAG CAGGGGGCTG AGCTCTGAGG GGTGGGTGCT 5B80 
CAGTGCACGG GTGCCCCCAG TOTCCTCTGA TCCTGTOCGG TGCCTCCCCC AACCCCCACA . 5540 
CCCATGCAGA ACTCOCAGGT CACATGCACG TATGTCCAGG GCATGGGGGT GGCGTGAAGA 6000 
70 GGCCTGGTCA GGGCCTTTAG GGGCTGCAGG ACGGAATGGC CACCTGGGGA GCCTGTGTGG 6060 
CTGTGCOGGG CAGCCATCCT GCATTCCCAC CCfiGCGCGCA GTCTCClACCX CGGCCCCAGC 6120 
AAAGCGCTAA GCAGCCGGAG AGACAGCCAG GGCGGCTTCC TGAAGGATGT GGGATGGTGG 6180 
ACTCCGGGGT CGAGGGAATA OGCAGGTTCC TGTCCTCCGG GAGACCTAGA GAAGCTGCAC 6240 
ACCCAGGAOC TTTCCATGAC CCGGGAGCAT GAGTGAATGG GGGGTTCCAG TTTOCTGAAC 6300 
75 TTTGCTGTCT TGTAAGGGTG GGGGCTGACG GCCGACCCTG GGAGGAQGTG ACACCGCAGG 6360 
GGGAGGTTGT GGGCAACGGT GGAGGAGQAG AGACGGGAGG GGACCATTTG GOATGGAGGG 6420 
GCCTCTTCAG AGTTTTAAAA GGCGTTTGTG GGGTGGAGTT GAGTGTGCTC TGGGCTTGGA 6480 
CACTTGCCGT GGTGCCCCTG GCTGGCCX3AG GAGACTGGCT CTGGCCAGGG CCCCGTCCTG 6540 
AGAGGTCCTC AGOGTCTGAC TCTCfflGCCAG GCGCCAGCAA GGAGGGGCCG GTCCCCGOGG 6600 
80 CTACCAGGCA GGCACGTQCA CATCGCCATC GCCACACGCC AACTCCGCCT GGGTTTTACA 6660 

AA GTQGTT GC CTTAATGCAT GTGGACAGGA ACTCCCTGAG QTCGCCCCAT GGCCCCTGGC 6720 
TGTGCCAGGT ACGGACGCCC TGGACX3CTGC GAACAGGTGG GGCGGGCGAG GGGCCCAAGG 67B0 
GACGGGCICC AGAGACACGC GCAGGGCAGG AGGGGTCTCA CGGAGGGGTC TCGCACTGAG 6640 
CTGGTGGTCC OGCTGGACGC CATCCCTCTG CCCGGOATCC ACACGGCCCA 6900 

1061 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CGTGTGCCOG 
GTGGCTGCAG 
CCAAAQAACA 
TCCTTGTGCC 
CTCAACTTTG 
TCTAACAGAT 
AGAATGCGCC 
CCGTGQAAGQ 
TGTGCACTCA 



CCATGCCCGC 
CTTCCCCATT 
TTTCATAATG 
GTAGTGACCA 
TGCACCGTCC 
TAGCTGAGAT 
AGTGCGGGTT 
TTCTGGGGAT 
GTAAAACGCA 



GCCCCAOGCC 
TGCGCOGTTG 
CCTTCAGTAC 
GGCCTTTTTT 
ATTTTCTAGG 
ATTAACTTAC 
TGCQGGGGAC 
CCCTGCTGCC 
GTGATTCAAA 



ATTGCAGTCT 
CCTCTGGCTG 



TGGTGGATGT 
GATAGACGCC 
TCACACGGAC 
TTCGGGTGTG 
ACGGGGACGA 
AAAAAAAAAA 



TCCATOCTCT 
TCTGCACTTT 
TTCTGACCAT 
GTTACCCCGC 
CAGGGAATGA 
AGGTTGATGC 
GGGTCCTGCG 
GTTCGOACGC 



GGCCGTGAGB 
TGTTCATGCT 
TTTGTATGTG 
ACACTTCAAT 
ACTCTAGTTT 
CAGAGCCGTA 
GCCDCGATGG 
CAGGTGGACC 



6960 
7020 
70B0 
7140 
7200 
7260 
7320 
7380 



seq ID NO: 282 Protein sequence 
ProteixL Accession Its NP0045O9- 1 



1 
I 

MVQKSRNGGV 
GAGAGKPPKR 
SSEGAbYILE 
AVXiAAGSQGN 
LCLIIaASFIiV 
GVSFFALPAG 
QYYERTVTVP 
GVAAKGKGfiP 
BASIsPGEDIV 
EQYSAGHLDM 
QVLSMEKKLD 
ZVRS688TGQ 
ER9LSLAYGGG 
QSKENUJAIiN 
GPRK 



XI 
I 

YPGPSGEKKI* 
UAPYRKLQHF 
IVTIWFGVE 
VFATSALRSIi 
TLAEKGENDH 
ILGSGFAMCV 
MYRLIPPLNQ 
QAQTVRRSPS 
DDKSCPCEFV 
LSRIKSLQSR 
FLVWIYKQRM 
KNFSAPPAAP 
KRASMEFLRQ 
SCYAAVAPCA 



21 
1 

KVGFVGLDPG 
ItYMVLERPRG 
YFVRIWAAGC 
RFIiQILRMXB 
PDTYADALWW 
QEQHRQKHFE 

ADQBLEDSPS 
TEDLiTPOLKV 
VDQIVGRGPA 
GIPPTBTEAY 
PVQCPPSTSK 
EDTPGCRPPE 
KVHPYXAEGB 



31 
1 

APDSTRDGAL 
WAFIYHAYVF 
CCRYRGWRGR 
HDERGGTWKIj 
GLITLTTIGY 
TCRRNPAAGLI 
SGLAFRKDPP 
KVPXSKSFGD 
SIRAVCVMRF 
ITEKDRTKGP 
FGAKEPEPAP 



41 
1 

LIAGSEAPKR 
U^VFSCLVLS 
IiKPARKPFCV 
LGSWYAHfiX 
GDKYPOTW23G 
OSAWRP YATN 
PKPSPSQKVS 
ESRARQAFRl 
LVSKRKFKES 
ABAELiPEDPS 
PYHBPKD3RE 
OTSPVGDEGS 



GTLHDSDTSI 
CDTDSDIiCTP 



51 
I 

GSIliSKPRAG 
VF6TIKEYEK 
IDIMVMASI 
ELVTAWY1GF 
RLIiAATFTIil 
LSRTDLHSTW 
LKDEVF6SPR 
KGAASRQNSE 
lAPYDVMDVI 
MMGRLGKVEK 
HVDSHGCIVK 
t-VRIPPPPAK 
ERSFSGFSIS 
EGPFQDVGWA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
700 
840 



Seq ID NO: 2838 DMA sequence 
Nucleic Acid Accession #i AF152496.1 
Coding sequence: 1..2391 



ATGGAGGCGG 
CTGGGAGGGT 
AA0GGCTTTT 
GOGAGGGGGG 
ACAGGTGATT 
CCATGCATAC 
CTCOGTATCA 
AAAATCCTAG 
GATGTGGGAA 
CTCACTCGCA 
GATCGGGAGG 
OCTCGGTCTG 
GAATTTGCAC 
ATTGTCACTG 
TTTTTTCATG 
ATGCAACTDG 
GCCAAGGATG 
AACGACAACC 
GGAGAGACTG 
GTGATGTGTT 
TACACCCTAG 
ATCACTATCA 
GTCTCCGACG 
CGCQAGAACA 
GGCAOCAAGG 
TCTTCCCTGG 
TACGAGGCCC 
TTGAGCAGCG 
QTGCTGT AG C 
GAGC0GGGCT 



11 
I 

GAGGAGAGCG 
CTCTGGCTGG 
TAATAGCCAA 
CCCAAGTTGT 
TGCTCCTGAA 
TACATTTT CA 
TAGATGTAAA 
AAAGCACTCT 
GAAACAGCCT 
GTCGTAGGGA 



GOACAGCCCA 
AGCCGCTCTA 
TCTOQGCTTC 
CTTCTGAAGA 



GCGGAGGCCT 
CACCGGAACT 
TACTGGCXGT 
CCATTGAGAA 
TGTCAGAAGG 
CTGACCTGG3 
TCAATGACAA 
ACAGCCCCGC 
CCCAGGTAAC 
TCTCCATCAA 
TGCAGGCGTT 
AOGCGCTGGT 
OGCTGCAGAA 
ACCTGGTGAC 



CTCCTGGTGG 
GCCCAGGCCG 
TTOCTCTTTT 



TGCGCACCGC 
TCAAGGACAA 
ACGGCTTCTC 
ACTTGCTCAC 




GGCACDGGGA 
ACAAATGAGT 
AGGGTTAGCG 



21 
I 

ATTTCTTAGA 
GTOCGAGTC& 
CCTAG CAAAG 
GTCCAAAGGG 
TGAGAAATTG 
GATATTACTG 
TGACCATTCT 
GCCAGGAACA 
CCAAAACTAC 
CGGAAGGAAG 
ACTCAGCTTA 
GATAAACATC 
TGAGGTTGCA 
TGACTTAGAT 
AATTCGCAAA 
GAATTTTGAA 
ATOCGGAAAG 
GACCTTGTCT 
TTTCAGTGTT 
CAATCTC2CCC 
CGOGCTGGAC 

gacacccagg 
cgcccccqcc 
cctgcacatc 

CTACTCGCTG 
CGCGGACAAC 
CGAGTTOGGC 
GCGCGTGCTG 
CGGCTCCGCG 
CAAGGTGGTG 
CAAGGCCACG 
CAGGCXGCTG 
TGGCGAGCCT 

ccagccctac 
cgtctacctg 

GTTCGTGGCG 

gcccgagggc 
gagctagcag 
gaagccaatt 

CAOTTTCAGG 



31 
I 

CAAAGGCAAG 
AGA06CTATT 
GATCTGGGAC 
AACAAACAGC 
GACOGGGAGG 
CAAAACCCTT 
CCGGTATCTCT 
GTAATTCCTT 
ACTATCACTC 
TACCGGGAAC 

acgctcaccg 

CAGGTCITAQ 
GTTCTAGAGA 
ACAGGAAGTT 
ACTTTTCAGC 



41 
I 

CTGTGGCTGA 
TAAGGGTAGA 
ATTTTCAGCT 
AGCTATGCGG 
TGCAATTCGT 

ttgaaaatga 
tgggaaatgc 

CGAATTCCCA 
TAGTACTGGA 



TCTACAGTCA 

tcagtaaaca 
tCttgatctag 

TTCTTCCTGA 
AGAGAGACCA 
CTGAAAACCA 
TTCAOCCAAA 
GGCAGTGTCA 
CTGCCGCCCC 
GGCCACCTGT 
GTGGGCGCCA 
GTGCTGGACG 
CCCTOCACOG 
GCGGTGGACG 



ATATAAACGA 
ATACCCCCGT 
TTGGGACAAT 
TAAATCGAAT 
GTTATGAAGT 
TAGTCCAGGT 
GCCCTATTCC 
ACTCTGGAGA 
AACCATCTGT 



SI 
I 

GGAAAAAGAG 
GGAACTGGCC 
CAGTCATCAG 
CCCCACAGAA 
TACAAACGAG 
AATGCATCTG 
TGAGQACTTG 
CTTCCACGTA 
TAAAGCGCTC 
CGGCTCTCCC 
CAATGCACCA 
TAACTCTGTC 
ATCATA1GCA 
HACTGGTGAT 



AGTACAACAT 
TCTCCTACAC 
GCGCCACAGA 
AGGACCCGCA 
TTGCCCTCAG 
CAGACCGTGG 
CCAACGACAA 
AGCTGGTGCd 
GCGACXCGGG 



AGCGAGGGCG 
OCGOGCTOGG 
CTOCCrCTCC 
GTGGTGGCAT 




ACGCGGCCAA 
CCACGGCCAC 
OGGAGGCGGC 
TGGCCTCGGT 
GCAGGAGGAG 
GGCAGATGGT 
GTCTGACTGG 
TCGTTGCTCA 
AATTCAGTTA 



GGTTGATGTC 
TGAGAACTCG 
CAACGGAAGA 
AGAGAATXTT 
CAACATTACC 
AACCGTGCTG 
CCTGTTC3GTC 
CAGAGACTCA 
CCTGCCCCTC 
GTCGCTGGAC 
CTCCCOGGCT 
CTGGOCCTTC 
CCGGGCGGCT 
CCAQAAOGCC 
GTGGGCGCAC 
GCACAGGCTG 
GCTGCATGTG 
ACCGGOCCAG 
GTCTTCGCTC 
CAGGGOGGCC 
GGAGGTGAGC 
AGGCTCCGGG 
GGGTGCAGAG 
A 



60 
120 
100 
240 
300 
360 
420 
4 B0 
540 
600 
660 
720 
780 
840 
500 
960 
1020 

xoao 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1B00 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



Seq ID NO; 284 Protein, sequence 
Protein Accession 41* AAD43757.1 

1 11 21 31 

I I 1 I 

MBAGGERFliR QRQVLLLFVF LGGS1AGSE3 



41 51 

I I 
KQFTiIANIiAK DLGkFVEEIiA 



60 



1062 



WO 03/042661 



PCT/US02/36810 



ARGAQWSKG 
tRIIDVNDHS 



39KQHFQLSHQ 
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EFAQPLYEVA 
MQLVKVXNFE 
GETVliAVFSV 
ITITDUGTPR 
GTMA(JVTY3I> 
IrSSEALVRVL 
WLSYQI.LXAT 
LLVDGFSQPY 
SVORCSVPEO 
RVSEANPSFR 



YPELVLDKAD 
VLENTFVNSV 
AINSYEVDIE 
SDIiDSGDNGR 
LKTKYNITVL 
LPPQDPHIjPI, 
VLDANDHSPF 
EPGIiFOVKAH 
liPLPEAAPAQ 
PFPGQMVDVS 



TGDLlLIjNEKL 
KILESTLPGT 
CREEQPELSL 
rVTVSASDCLD 
AKDGGGLSGK 
VMCSIEWNLP 
VSDVNDNAPA 
SSLVSINADN 
VLYPTX3NGSA 
NGEVRTARLL 
AQADLLTVYIi 
GTGTLSQSYQ 



DRESLCQPTE 
VIPLGNAEDL 
TLiTALDGGSP 
TG9PGTISYA 
STVIVQWDV 
PFLEZPSVEKF 
FTQISYTXiFV 
GHLFALRSU5 
PCTEL.VPRAA 
SERDAAKHRL 
WAIASVSSL 
YEVCLTGGfiG 



PClLSFQIIi 
DVGRNSLQNY 
PRSGTAQINI 



NDNPPELTLS 
YTUVSEGALD 
RENNSPAtHI 
YEALOAFEFR 
EPGYIiVTKW 
WLVKDMGEP 
FLFSVLLFVA 
TKEFKFLKPI 



ONPLQFVTNE 
TITPW3HFHV 
QVLDUIDNAF 
TFQLNPITGD 
SVNSPIPENS 
RETRSEYHIT 
GSVSATDRDS 

vgatd rg Spa 

AVDGDSGQNA 
PR SAT AT LHV 
VSLCRRSRAA 
IPNFVAQQAE 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Esq ID NO: 285 DHA gegugngg 

Nucleic Acid Accession #t HM_001794.2 

Coding sequence : IS . .2765 

1 11 21 31 41 SI 

I. 1 I I I 1 

CGCCGGCGGG GAAGATGACC GCGGGCGCCG GCGTGCTCCT TCTGCTGCTC TCGCTCTCCG 
GCGC33CTCCG GGCCCATAAT GAGOATCTTA CAACTAGAGA GACCTGCAAG GCTGGGTTCT 
CTGAAGATGA TTACACGGCA TTAATCTCCC AAAATATTCTT AGAAGGGGAA AAGCTACTTC 
AAGTCAAGTT CAGCAQCTGT GTGGGGACCA AGGGGACACA ATATGAGACC AACAGCATQG 
ACTTCAAAGT TGGGGCAGAT GGGACAGTCT TCGCCACCCG GGAGCTGCAG GTCCCCTCCG 
AGCAGGTGGC OTICACGGXG ACTGCATGGG ACAGCCAGAC AGCAGAGAAA TGQGACBCCG 
TGGTGCGGTT GCTGGTGGCC CAGAOCTOOT CCCCGCACTC TGGACACAAG CCGCAGAAAG 
GAAAGAAGGT CGTGGCTCTG GACCOCTCTC CGCCTCOGAA GGACACCCTG CTGCCGTGGC 
CCCAGCACCA GAACOCCAAC oaGCTGAGGC GGCGGAAACG GGACTGGGTC ATCCCGCCCA 
TCAACGTGCC CGAGAACTCG CGCQGGCCCT TCCOGCAGCA GCTCGTGAGG A7/CCGGTCCG 
ACAAAGACAA TOACATCCCC ATCCGGTACA GCATCACGGG AGTGGGCGCC GACCAGCCCC 
CCATGGAGGT CTTCAGCATT GACTCCATGT CCGGCCGGAT GTACGTCACA AGGCCCATGG 
ACCGGGAGGA GCACGCCTCT TAGCACCTCC GAGCCCACGC TGTGGACATG AATGGCAACA 
AGGTGGAGAA CCCCATCGAC CTGTACATCT ACGTCATCGA CATQAATGAC AACCGCCCTG 
AQTTCATCAA CCAGGTCTAC AACGGCICCG TGGACGAGGG CTCCAAGCCA GGCACCTACG 
TGATGACCGT CACGGCCAAC GATGCTGACG ACAGCACCAC GGCCAACGGG AYGGTGCGGT 
ACOGGATCGT GACCCAQACC CGACAGAGCC CGTGCCAGAA TATGTTCACC ATCAACAGCG 
AGACTGGAGA TATCGTCACA GTGGCGGCTG GCCTGGACCG AQAGAAAGTT CAGCAGTACA 
CAGTCATCGT TCA3GCCACA GATATGGAAG GAAATCTCAA CTATGGCCTC TCAAACACAG 
CCACAGCCAT CATCACGGTG ACAGATGTGA ATGACAACCC GCCAGAATTT ACCGCCAGCA 

O BTTTO Caag ggaggtcccc gaaaaccgcg tggagaccgt ggtcgcaaac ctcacggtga 

TGGACCGAGA TCAGCCCCAC TCTCCAAACT GOAATQCCGT TTACCGCATC ATCAGTGGGG 
ATCCATCCQG QCACTTCAGC GTCOGCACAG ACCCCGTAAC CAACGAGGGC ATGGTCACCG 
TGGTGAAGGC AGTCGACTAC GAGCTCAACA GAGCTTTCAT QCTOACAGTG ATGGTMCCA 
ACCAGGCGCC CCTGGCCAGC GGAATCCAGA TOTCCTTCCA GTCCACGGCA GGGGTGACCA 
TCTCCATCAT GGACATCAAC GAGGCTCCCT ACTTCCCCTC AAACCACAAG CTGATCCGCC 
TGGAGGAGGG CGTGCCCCCC GGCACCGTGC TGACCAJCGTT TTCAGCTGTG GAOOCTGACC 
GGTTCATGCA GCAGGCTGTG AGATACTCAA AGCTGTCAGA CCCAGCGAGC TGGCTOCACA 
TCAATGCCAC CAACGGCCAG ATCACCACGG CGGCAGTGCT GGACCGTGAG TCCCTCTACA 
CCAAAAACAA CGTCTACGAG GCCACCTTCC TGGCAGCTOA CAATGGGATA CCCCCGGCCA 
GOGGCACCGG GACCCTCCAG ATCTATCTCA TTGACATCAA CGACAAGGCC CCTGAGCTGC 
TGCCCAAGGA GGCGCAGATC TGCGAGAAGC CCAACCTQAA CGCCATCAAC ATCAGGGCGG 
CCGACGGTGA COTCOACCCC AACATCGGCC CCTACGTCTT CGAGCTGCCC TTTGTCCCGG 
CGGCCGTGCG GAAGAACTGG ACCATCACCC GOCTGAAOGG TGACTATGCC CAACTCAGCT 
TGOGCATCCT GTACCTGGAG GGCGGGATGT ATGACGTCCC CATCATCGTC ACA0ACTCTQ 
GAAACCCTOC CCTGTCCAAC ACGTCCATCA TCAAAGTCAA GGTGTGCCCA TGTGATGACA 
ACGGGGACTG CACCACCATT 0GCGCAGTGG CAGCGGCTGG TCTGGGCACC GGTGCCATCG 
TGGCCATCCT CATCTGCATC CTCATCCTGC TGACCATGGT CCltSCMXCTT GTCATGTGGA 
TGAAGCGGCG AGAQAAGGAG CGCCACACGA AGCAGCTGCT CATTGACCCC GAGGACGACG 
TCOGCGACAA CATCCTCAAG TATGACGAGG AAGGCGGTGG CGAGGAGGAC CAGGACTACG 
ACCTCAGCCA GCTGCAGCAG CCGGAAGCCA TGGGGCACGT GCCAAGCAAA GCCGCTGGGG 
TGCGTCQCBT GGATGAQCGG CCGQTGGGGG CTGAGCCCCA GTACCCGATC AGGCCCATGG 
TGCCGCACGC AGGCGACATC GGTGACTTCA TCAATGAGGG ACTCCGCGCT GCTGACAACG 
ACCOCACGGC AOCCCCCTAT GACTCCCTGC TGGTCTTCGA CTACX3AGGGG AGC33GCTCCA 
CCGCAGGCTC COTCAGCTCC CTGAACTCAT CCAGTTCCGG GGACCAAGAC TACGATTACC 
TCAACGACTG GGGGCCCAGA TTCAAGAAGC TGGCGGACAT GTATGQAGGT GGTGAAGAGG 
ATTGACTGAC CTCGCATCTT OGGACCGAAS TGAGAGCCGT GCTCGGACGC CGGAGGAGCA 
GGACTGAGCA GAGGGGGCCG GTCTTCCCGA CTCCCTGCGG CTGTGTCCTT AGTGCTGTTA 
QGAGGCCCCC CAATCCCCAC GTTGAQCTGT CTAGCATGAG CACCCACCCC CACAGCGCCC 
TK3CACCCGGC CGCTGCCCAG CAOCGCGCTG GCTGGCACTG AAGGACAGCA AGAGGCACTC 
TGTCTTCACT TGAATTTCCT AGAACAGAAG CACTGTTTTT AAAAAAAAAA AAAAAAAAAG 
AAG 
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S&q ID NO; 286 Protein sequence 
Pirotein Accession ft* MP_001785.2 

1 11 21 31 41 51 

I I I I I 1 

KEAGAG VT J iT i LLSLSGALRA HHEDLTTRET CKAQFSEDDY TAI>ISQNILE GEKLLGVKFS 60 
SCVGTKGTQY ETNSMDFKVG ABGTVFAXRB LQVPSEQVAF TVTAWDSQTA EKWDAWHIiIj 12 0 
VAQTSSPHSG HKPQKGKKW ALDPSPPPKD TLLPWPQHQN ANGLRRRKRD WVIPPBOVPE 180 
NSRGPFPQQL VRIR8DKDND IPIRYSITGV GADQPEMEVF SIDSMSGRMY VTRPMDREEH 240 
AGYHLRAHAV DMMGHKVENP IDLYIYVIDM NHSRPEFINQ VYNGSVPEGS KPGTYVMTVT 300 
ANDADDSTTA MGMVRYRIVT OTPOSPSQNM FTINSETGDI VTVAAGLDRE 3CVQQYTVXVQ 360 

1063 



WO 03/042661 



PCT/US02/36810 



ATDMEGNLNlf 
PKSPHWMftVY 
ASGIQMSFQS 
AVRYSKL.SDP 
LQI¥I>IDIND 
NWTITRLHGD 
TIGAVAAAGL 
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DIGDFINHSIj 
PRFKKkADMY 



GLSNTATAII 
EIISGDPSOH 
TAGVTISIMD 
ASWLKINATN 
NAPELLPKBA 
YAOLSWIIiY 
GTGAIVAII>I 
EDQDYDLSQIi 
RAADNDPTAP 



TVTDVNDNPP 
FSVRTPPVTN 
INEAPYFPSN 
GQITTAAVLD 
QICEKPNLNA 
LEAGMYDVPI 
CILlLliTMVI, 
QQPEAMGKVP 
PYDSTJiVFDY 



EFTASTFAGE 
BGMVTWKAV 
HKblRLEEGV 
RESliYTKNNV 
INITAADADV 
rVTDSOJPPI, 
LPVMWMKRRE 
SKAPGVRKVD 



VPENRVETW 
DYHLNRAFML 
PPGTWTTFS 
YEATFliAADN 
DPKIGPYVPE 
SNTSlIKVKV 
KERHTKQZiLZ 
ERPVGAEPQY 



ANLTVMDRDQ 
TVMVSHQAPL 
AVDPDRFMQQ 
GIPPASGTGT 
liPPVPAAVHK 
CPCEfcDNGECT 
DPHDDVHDNT 
FIRPMVPHPG 
QDYDYLMDHG 



420 
4BD 
540 
600 
660 
720 
7B0 
840 
900 



Seq ID NO; 287 DNA sequence 

Nucleic Acid Accession ftj AF152495.1 

Coding sequences 1..2397 

1 11 21 31 41 51 

I I I i I I 

ATGGAGGCCG GAGAGGGGAA GGAGCGCGTT COGAAACAAA GGCAAGTCCT GATATTCTTT 
GTTTTGCTGG GCATAGCTCA GGCTAOTTGC CAGCCTAGGC ACT ATT CAGT GGCCGAGGAA 
ACGGAGAGTG GCTCCTTTGT GGCCAATTTG TTAAAAGACC TGGGGCTGGA GATAGGAGAA 
CTTGCTGTGA GGGGGGCCAG GGTCGTTTCC AAAGGAAAAA AAATGCATTT GCAGTTCGAT 
AGGCAGACCG GGGATTTGTT GTTAAATGAG AAATTGGACC GGGAGGAGCT GTGCGGCCCC 
ACAGAGCCCT GTGTCCTACC TTTCCAGGTG TTACTAGAAA ATCCCTTGCA GTTTTTTCAG 
GCGGAGCTAC GGATTAGGOA CGTAAATGAT CATTCCCCAfi TTTTCCTAGA CAAAGAAATA 
CTTTTGAAAA TTCCAGAAAG TATCACT C CT GGAACTACTT TCTTAATAGA ACGTGCCCAG 
GACTTGGATG TAGGAACCAA CAGTCTOCAA AATTACACAA TCAGTCCCAA TTTCC3WCTTT 
CATCTTAATT TACAAGACAG TCTCGATGGC ATAATATTAC CACAGCTGGT GCTGAACAGA 
GCCCTGGATC GCGAGGAGCA GCCTGAGATC AGGTTAACCC TCACAGCGCT AGATGGCGGG 
AGTCCACCCA GGTCOGGCAC GGCCCTGGTA CGGATTGAAG TTGTGGACAT CAATGACAAC 
GTCCCAGAGT TTGCAAAGCT GCTCTATGAG GTGCAGATCC CGGAGQACAG CCCCGTTGGA 
TCCCAGGTTG CCATCGTCTC TGCCAGGGAT TTAGACATTG GAACTAATGG AGAAATATCT 
TATGCATTTT CCCAAGCATC TGAAORCATT CGCAAAACGT TTCGATTAAG TGCAAAATCG 
GGAGAACIGC TTTTAAGACA GAAACTGGAT TTCGAATCCR TCCAGACATA CACAGTAAAT 
ATTCAGGCGA CAGATGGTGG GGGCCTATCT GGAACTTGTG TGGTATTTGT CCAAGTGATG 
GATTTGAATG ACAATCCTCC GGAACTAACT ATGTCGACAC TTATCAATCA GATCCCAGAA 

AACTTGCAGG ACAOCCTCAT tgctotattc agcgtttcag atcctgactc cggagacaac 
ggaaggatgg tgtgctocat ccaagatgat cttccttttt tcttgaaacc ttctgttgag 
aacttttaca ctctggtgat aagcacggcc ctggaccggg agaccagatc cgaatacaac 
atcaccatca ccgtcaccga cttcgggaca cccaggctga aaaccgagca caacataacc 
gtgctggtct ccgacgtcaa tgacaacgcc ccogccttca cccaaacctc ctacaccctg 
ttogtccgcg agaacaacag ccccgccctg cacatcggca gcgtcagcgc cacagacaga 
gactogggca ccaacgccca ggtcacctac tcgctgctgc cgccccagga cccgcacctg 
cccctcgcct occtggtctc catcaacgcg gacaacggcc acctgttcgc tctccagtcg 

CTGOACTACG AGGCCCTGCA GGCGTTCGAG TTCOGCGTGG GCGCCGCAGA CGGCGGCTCC 
CCGGCGTTGA GCAGCGAGGC GCTGQTGCGC GTGCTGGTGC TGOACGCCAA CGACAACTCG 
CCCTTCGTGC TGTACCOGCT GCAGAACGGC TOCGOGCCCT GCACCGAGCT GGTGCCCCGG 
GCGGCCGAGC CGGGCTACCT OGTGACCAAG GTGGTGGCGG TGGACGGCGA CTGGGGCCAG 
AACGCCTGGC TGTCGTACCA GCTGCTCAAG GCCACGGAGC CCGGGCTGTT CGGCGTGTGG 
GCGCACAATG GCGAGGTGCG CACCGCCAGG CTGCTGAGOQ AGGGGGAOGC TGCCAAGCAG 
AGGCTGGTGG TGCTGGTCAA GGACAATGGC GAGOCTCOGC GCTCGGCCAC CGCCACGCTG 
CACGTGCTCC TGGTGGACGG CTTCTCCCAG CCCTAOCTGC TGCTCCCGGA GGCGGCACCG 
GCGCAGGCCC AGGCCGACTT GCTCACCGTC TACCTGGTGG TGQOGTTGGC CTCGGTGTCT 
TCGCTCTTCC TCTTCTCGOT GCTCCTGTTC GTGGOTGTGC GGCTGTGCAG GAGOAGCAGG 
GCGGCCTOGG TGGGTCGCTG CTCGGTGOCC GAGGGCCCCT TTCCAGGGCA GATGGTGGAC 
GTGAGCGGCA CCGGGACCCT GTCCCAGAGC TACCAGTACG AGGTGTGTCT GACTGGAGGC 
TCCGGGACAA ATGAGTTCAA GTTCCTGAAG CCAATTATCC CCAACTTOGT TGCTCAGGGT 

gcagagaggg ttagcgaggc aaatcccagt ttcaogaaga GCTTTGAATT CACTTAA 

Seq id HO i 288 protein sequence 
Protein Accession #i AAD437S6.1 



LAVRGAKWS 

AELP.IPJJVND 
HLULQDSLDG 
VPHPAKLLYE 
GELLLRQKLD 
MliQDTIiIAVF 
ITITVTDFGT 
D3GTHAQVTY 
PALSSEAI»VR 
NAVJLSYQLUC 
HVl*IiVDGF8Q 
AASVGRCSVP 
AERVSEANPS 



11 
I 

PKQRQVLIFF 
XGKKKHLQFD 
HSPVFLDKEI 
IIIiPQIiVIiNR 
VQIPBDSPVG 
FESIQTYTVN 
SVSDPDSGDU 
PRLKTEHITIT 
SLLPPQDPHL 



ATEPGLFGVW 
PYLIiLPEAAP 
EGPFPGQMVD 
FRKSFEFT 



21 
\ 

VLLGIAQASC 
HQTGDLLEWB 
LUCTPESITP 
AIiDRHEQPEI 
6QVAIV3ARD 
IQATDGGGLS 
GRMVCSIQDD 
VLVSDVNDNA 
PliASLVSINA 
PFWYPDQSG 
AHNGEVRTAR 
AQAQADEJiTV 
VSGTGTLSQS 



31 
I 

QPRHYSVAEE 

kldreedcgp 
gttflieraq 

ELTUTAIiDGG 
LDIGTNGBIS 
GTCWFVQVM 
laPFFZiKPSVE 
PAFTQTSYTJ» 
DNGHIiPALQfi 
SAPCTELVPR 
IJaRBKDAAKQ 
YLWALAHVS 
YQYEVCLTGG 



41 
I 



TEPCVLPPQV 
DUOVGTNSIiQ 
SPPRSGTAIiV 
YAFSQASED1 
DLMDKPPELT 
HFYTLVT8TA 
FVRKHNSPAL 
LDYEALQAFE 
AAEPGYIiVTSt 
RLWLVKDNG 
SIiFLFSVLL-F 
SGTNSFKFUC 
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Seq ID NO: 289 UNA sequence 

Nucleic Acid Accession ft: KM_018674.1 

Coding sequence: 3 90.. 2 00 9 



11 
1 



21 
1 



31 
I 



41 



51 
I 

LKDLGLEIGE 
IiIfENPIiQFFQ 
NYTISPN3HF 
RIEWDINDN 
SXT27RLSAKS 
MSTLINOIPE 
URETRSEYM 

higsvsatdr 
frvgaadrgs 

WAVDGDSGQ 
EPPR9ATATL 
VAVRLCRRSR 
PIIPNFVAQG 
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1064 



WO 03/042661 PCT/US02/36810 



CGQAGCACAT GCTGAGCGGA GCGGCTGGGG CTGCGCGGCG TGGCGQAQCA GCGCTCGCTC 60 

CCTGGCTCAC TCGCT03CTC GCAGGGACAC ACaCAGGGGC TGACAGCTGT GCTGGTGCTG 120 

ATAAGGGAAG CCM2AAGGAG ACGATCGAGG AGAGAGACAA GCGGCAGCAG AGGCAGCAGC 180 

GGCAGAGGCA GCACCAGQGC TGCG6AGCTG CTGGSAGTGG GAGTGACTCC CCCACCTCGG 240 

5 GCCOCCACCC TGTCCCTGTC CTCTTCOOGC TTOCCCTGAG TTTAGAAGAG CAGCCGCTGC 300 

CACCACTGCC ACTCGGGAGG GCACCTlGGGO TGCTGGCTAG GGAGGGACAG GGCAGGGAGG 3 GO 

CTCTGGCCAG TCCCAGCAGC CGGGGACAGA TGCCGATCGA GATTGTGTGC AAAATCAAAT 420 

TTGCTQAGGA GGATGCGAAA CCCAAGGAGA AGOAGGCAGG GGATGAGCAG AGCCXCCTCG 4BQ 

A GGGCTGlTGC CCCTGGAGCA GCCCCCCGAG ACCTGGCCAC CTTTGOCAGC ACCAGCRCCC 540 

10 TOCATQGACT GGGCCGGGCC TGTGGCCCAG GCCCCCACGG ACTGCGCAGA ACCCTGTGGG 600 

CACTGGCCCT ACTCACCTCG CTGQCTGCCT TCCTGTACCA GGCGGCTGGC CTGGCCCGGG 660 

GCTACCTGAC COGGCCTCAC CTGGTGGCAA TGGACCOCGC TGCCCCAGCC TCAGTGGCGG 720 

GCTTCCCGGC TGTCACCCTC TGCAATATCA ACCGCTTGCG GCATTCGGCA CrCAGOGATG 780 

CCGACATCTT CCACCTGGCC AATCTGACAG GGCTGCCCGC GAAAGACCGG GATGGGCACC 840 

15 GTGCGGCTGG CCTGCGCTAC CCAGAGCCTG ACATGGTAGA CATCCTCAAC CGCACTGGCC 900 

ACCAGCTCGC CGACATGCTT AAGAGCTGCA ACTTCAGTGG GCATCACTGC TCCOCCAQCA 9G0 

ACTTCTCIGT GGTCTATACT CGCTATGGGA AGTGTTACAC CTTCAACGCG GACCCGCGGA 1020 

GCTCGCTGCC CAGCCGGGCA GGGGGCATGG GCAGTGGCCT GGAGATCATa CTGGACATCC 1080 

AGCAGGAGGA GTACCTGCCC ATCTGGAGGG AGACAAATGA GACGTCGTTT GAGGCAGGTA 1140 

20 TTCGGGTGCA GATCCACAGC CAGGAGGAGC CGCCCTACAT CCACCAGCTO GGGTTCGGGG 1200 

TGTCCCCAOG CTTCCAGACC TTTSTGTCCT GCCAGGAACA GCGGCTGACC TACCTGCCCC 1260 

AGCCCTGGGG CAACTGCCGC GCAGAGAGTG AGCTCAGGGA GCCTGAGCTT CAGGGCTACT 1320 

CGGCCTACAG TGTGTCTGCC TGCCGGCTGC GCTGTGAAAA GGAGGCCGTG CTTCAGCGCT 1380 

_ _ GCCACTGCCG GATGGTGCAC ATGCCAGGCA ATGAQACCAT CTQCOCACCA AATATCTACA 1440 

25 TCGAGTGXGC AGACCACACA CTGGACTCCC TGGGTGGGGG CCCTGAGGGC CCGTGCTTCT 1500 

GCCCCACCCC CTGCAACCTG ACACGCTATG GOAAAGAGAT CTCCATGGTC AGGATCCCCA 1560 

ACAGGGGCTC AGCCCGGTAC CTGGCGAGGA AGTACAACCG CAACGAGACC TACATACGGG 1620 

AGAACTTCCT GGTCCTAGAT GTCTTCTTTG AGGCCCTGAC CTCTGAAGGC ATGGAGCAGC 1680 

GAGCAGCCTA 1GGCCTGTCA GCOCTGCTGG GAGACCTCGG GGGACRGATQ GGCCTGTTCA 1740 

30 TTGGGGCCAG CATCCTCACG TTGCTGGAGA TCCTCGACTA CATCTATGAG GTGTCCTGGG 1800 

ATCGACTGAA GCGGGTATGG AGGCGTCCCA AQACCCCCCT GCGGAGCTCC ACTGGGGGCA I860 

TCTCCACTTT GGGGCTTCAG GAGCTGAAGG AACAGAGTCC CTGCCCGAGC CTGGGCCGAG 1920 

OGGAGGGTGG GGGGGTCAGC AGTCTGCTCC CCAATCACCA CCACCOCCAC GGTCCCOCRG 1980 

GAGGTCTCTT TGAAGATTTT GCTTGCTAGG ACGGTGCTGT GACTGAAAGG ACCCAGGAGT 2040 

35 CTGGGACCCC TCCTGGGATC CCCAGCACAT TCTCCTGCTC CTGOGAGAGG OCTGGGGGCG 2100 

GTGCTCACTG GGAGGGCCAG GACTCAGTTC CTGCTCTCAT CCTCCCCTGC CCTGATGTCA 2160 

GCTGCTTTGC ACAAAGGTCC TTCTTGTCCA CACCCCTTAT CCCCAGGCTG GTGCCOOGGG 2220 

AGGGCTGGAG ACCAGGCCAT GGGCCCTCAC GGAGAGGAAG GGAAGGAAGG AGAGGGAGGG 2280 

GGAGGATAGA GCCCATCCCA GCCGGGGAGG GGGAGCCCTC TGTACATTTG TAAATATTTA 2340 

40 GGGAAAGOCG GGTGGGQGGA GGGGATACAG ASGTAGAAGG TGGGTAGGGC TACAGGGGTG 2400 

GGTGATTTAG GGACAGCCAG GGTCCCAGCC CCAATGTCAQ CAGGATAGGG AGAGCCOCAG 2460 

GACTCAGGAQ TGCTGGGCTG GTGCTACTTC CTGCCCCTCT CCAGGCCCAG CTCCCCTCTT 2520 

GGCAGGGGGA GAGGATGGCC CAGCAGGCCT GGCCCAGCTC OMTTOCCC CTGCACCAGC 2580 

CCCACCCCTA GAGTCCCTTC TATAGGGAGG GGGCAGGAGA CXIXTCCAGAC TTCGGCTGAG 2640 

45 CTTGGAGGGT GGGAAGGGAG CCT TCTCAGT CCTCTCTCCC TCCAGTCTGA TTTTATAAAG 2700 



50 
55 
60 
65 
70 
75 
80 



Seq ID NDs 290 Protein gggagncg 
Protein Accession NP_061144.1 



1 11 21 

S 1 I 

MPIEXVCKIK FAEEDAKPKE KEAGDEQSIJj 
GPHGLHRTLW ALAliLTSIAA FLYQAAGIAR 
NRFEHSALSD ADIPHLAHLT QLPPKDRDGH 
NFSGHHCSAS NFSWYTRYG KCYTFNADPR 
ETNETSFEAG IRVQZHSQKE PPYTHQLGFG 
ELREPELQGY SAYSVSACRIj HCEKRAVLQR 
LGGGPEGPCP CPTPCNIiTRY QKEISMVRIP 
EALTSEAMEQ RAAYGLSALIi GDLGGQMGLF 
KTPLRTSTGG I STLGLQELK EO/SPCPSLGR 

Seq ID NO: 291 DUA setyni*Tire 
Nucleic Acid Accession #s Eo 
Coding sequence: 62.. 895 



31 
I 

GAVAPGAAPR 
GYLTRPHLVA 
RAAGLRYPEP 
SBLPSRAGGN 
V3PGPQTFVS 
CHCRMVHMPG 
MRGSARYIiAH 
IGASILTLLB 



41 
I 

DLATFA3T5T 
MDPAAPAPVA 
DMVDlINRTG 
GSGIiEIM&DI 
CQBQRLTYLP 
NBTICPPNIY 
KYNS1IETYIR 
ILDY1YEV3W 



51 
I 

LHGLGRACGP 
GFPAVTXCM1 
HQLADMLESC 
QQEEYItPIWR 
QPWGHCRAES 
IECADHTLD3 
EtJFLVLDVFF 
DRLKRVWRRP 



1 
I 

CACTGCTCTG 
TATGATCATC 
CAAGGATGGA 
AGAAGCACGG 
TGAAGGCGGC 
TGTCTGTGCT 
GCCCAACTGT 
TGAAAGATGG 
AGATCCAAAO 
CTGCTACTGG 
TGACCTTGAA 
TGATGTCCAT 
TACAGGAART 
CCAAATCAAA 
TACTACTTCT 



11 
I 

AGAATTTGTG 
TTAATITACT 
ATTTTTC3VTA 
TCTGGCAAAT 
CATCTCGCAA 
GCTGGATGGA 
GGATTTGGAA 
GATGCCTATT 
CAAATTT TTA 
CACATTAGAC 
GATGACCCAG 
GGCTTTGTGG 
GTCATGACCT 
TATGTTGCRA 
ACTGGAAATA 
TGATCAAAAC 



21 

\ 

AGCAGGCCCT 
TATTTCTCTT 
ACTOCATATG 
ACAAGCTCAC 
CTTACAAGCA 
TGGCTAAGGG 
AAACTGGCAT 
GCTACAAC CC 
AATCTCCAGG 
TCAAGTATGG 
GrTGCTTGGC 
GAAGATACTG 
TGAAGTTTCT 
TGGATCCIGT 

AAAAcrrrrr 

ACACAGTGTT 



31 
1 

AACAGGCTGT 
GCTATGGGAA 
GCTTGAACGA 
CTACGCAGAA 
GCTAQAGGCA 
CAGAGTTGGA 
TATTGAXTAT 
ACACGCAAAG 
CTTCCCAAAT 
TCAGCGTATT 
TGATTATGTT 
TGGAGATGAG 
AAGTGATGCT 
ATOC3UVATCC 
AGCTGGAAGA 
TATGTTGGAA 



41 51 

1 I 

TACTTCACTA CAACTGACGA 
GACACTCAAG GATGGGGATT 



GCTAAGGOGG 
GCCAGAAAAA 
TACOCCATTG 
GGAATCCGTC 
GAGTGTGGTG 
GAGTACGAAG 
CACCTGAGTT 
GAAATATATG 
CTTCCAGATG 
TCAQTGACAG 
AGTCAAGGAA 
TTTAGCCACT 
TCTTTTGGAA 



TGTGTGAATT 
TTGGATTTCA 
TGAAGCCAOG 
TCAATAGGAG 
GCGtrCZTTAC 
ATAACCAAAT 
TTTTAGATTT 
ACAGTTACGA 
ACATCATCA3 
CTGGAGGTTT 
AAAATACAAD 
TATAAAAAAA 
CTCCTTTGAT 



60 
120 
180 
240 
300 
360 
420 
4B0 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



1065 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 



CTCACTGTTA 
TAGQGAAAAT 
ACTGCATAGA 
TTTGTGGTAT 
TCTATGTACA 
TCATTGATTA 
TGTTTTATGC 
ATTGTTGCAA 



TTATCAACAT 
TGGAAAATAT 
AATAACAAGC 
ATGTATATAT 
GTTTTGTATT 
TTCTACAAAA 
ATTATTTAAG 
TAAATATCCT 



TTATTTATTA 
AGGAAACTTT 
GTTAACATTT 
GTACCTATAT 
ATACTTTTTA 
ACATGATTTT 
CCTGTCTCTA 
TGAACACACA 



TTTTTCTAAA 
AAACGAGAAA 
TCATATXTTT 
GTATTTGCAT 
AATCTTGAAC 
AAACAGCTOT 
TTGTTGGAAT 
AAAAAAAAAA 



TGTOAAAGCA 

atgaaacctc 
ttctttcrgt 

TTGAAATTTT 
TTTATAAACR 
AAAATATTCT 
TTCAGGTCAT 
AA 



ATACATAATT 1020 

TCATAATCCC 1080 

CATTTTTCTA 1140 

GGAATCCTGG 1200 

TTTTCTGAAA 1260 

ATGATATGAA 1320 

TTTCATAAAT 1380 



Sea; ID NO; 292 Protein sequence 
Protein Accession #r Eos sequence 



1 11 

1 1 
HI ILI YLFLIj LWEDTQGWGP 
EGGHLATYKQ LEAARKIGFH 
BRWDAYCYHP HAKBCGGVFT 
DLEDDPGCLA DYVEIYD3YD 
QIXYVAMDFV SKSSQGKNTB 



21 31 41 51 

I 1 1 I 

KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAXAVCEF 
VCAAGWMAKG KVGYPIVKPG PNCGFGKTGI I D YG I KLNRS 
DPKQIFKSPG PFNEYEDNQI CYMHIRLKYG QRIHLSPLDP 
DVHGPVGRYC GDELPDDII8 TGHVMTLKFI, SDAEVTAGGF 
TTSTGNXNFI* AGRFSHLi 



Seq ID HOj 293 DNA SgOjjgnge 

Hucleic Acid Accession #i NM00711S. 

Coding sequence? 69.. 902 



1 

1 

GAATTCGCAC 
CTGAOGATAT 
GGGGATTCAA 
AOCACAGAGA 
GTGAATTTGA 
GATTTCATGT 
AGCCAGGGCC 
ATAGGAGTGA 
TCTTTACAGA 
ACCAAATCTG 
TAGATTTTGA 



TCATCftGTAC 
GAGGTTTCCA 
ATACAAGTAC 
AAAAAAAAAA 
TTGATCTCAC 
TAATTTAGGG 
ATCCCACTGC 
TTGTATTTGT 
CCTGCTCTAT 



ATGAATGTTT 
TAAATATTGT 



11 
1 

TGCTCTGAGA 

gat cat err A 

GGATGGAATT 
AGCAOGGTCT 
AGGCGGCCAT 
CTGTGCTGCT 
CAACTGATGA 
AAGATGGGAT 
TCCAAAGCGA 
CTACTGGCAC 
CCTTGAAGAT 
TGTCCATGGC 
AGGAAATGTC 
AATCAAATAT 
TACTTCTACT 
AAGGATGATC 
TGTTATTATT 
AAAATTGGAA 
ATAGAAATAA 
GGTATATGTA 
GTACAGTTTT 
BATTATTCTA 
TATGCATTAT 
TGCAATAAAT 



21 
I 

ATTTGTGAGC 
ATTTACTTAT 
TTTCATAACT 
GGCAAATACA 
CTCGCAACTT 
GGATGGATGG 



GCCTATTGCT 
ATTTTTAAAT 
ATTAGACTCA 
GACCCAGGTT 



ATGACCTTGA 
GTTGCAATGG 
GGAAATAAAA 
AAAACACACA 
AACATTTATT 
AATATAGGAA 
CAAGCGTTAA 
TATATGTACC 
GTATTATACT 
CAAAAACATG 
TTAAGCCTGT 
ATCCTTCGGA 



31 
I 

AGCCCCTAAC 
TTCTCTTGCT 
CCATATGGCT 
AGCTC2ACCTA 
ACAAGCAGCT 
CTAAGGGCAG 
CTGGCATTAT 
ACAACCCACA 
CTCCAGGCTT 
AGTATGGTCA 
GCTTGGCTGA 
GATACTGTGG 
AGTTTCTAAG 
ATCCTGTATC 
ACTTTTTAGC 
GTGTTTATGT 
TATTATTTTT 
ACTTTAAACG 
CATTTT CATA 
TATATGTATT 
TTTTAAATCT 
ATTTTAAACA 
CrCTATTGTT 
ATTC 



41 
I 

AGGCTGTTAC 
ATGOGAAGAC 
TGAACOAGCA 
CGCAGAAGCT 
AOAGGCAGCC 
AGTTGGATAC 
TGATTATGGA 
CGCAAAGGAG 
CCCAAATGAG 
GCGTATTCAC 
TTATGTTGAA 



CAAATCCAGT 
TGGAAGATTT 
TGGAATCTTT 
CTAAATGTGA 
AGAAAATGAA 
TTTTTTXCTT 
TGCATTTGAA 
TGAACTTTAT 
GCTGTAAAAT 
GGAATTTCAG 



51 
i 

TTCACTACAA 
ACTCAAGGAT 



AAGGC3GGTGT 
AGAAAAATTG 
CCCATTGTGA 
ATCC&TCTCA 



TACGAAGATA 
CTGAGTTITT 
ATATATGACA 
CCAGATGACA 
GTGACAGCTG 
CAAGGAAAAA 
AGCCACTTAT 
TGGAACXCCT 
AAGAAATACA 
ACCTCTCATA 
TCAGTCATTT 
ATTTTGGAAT 
GAACATTTTC 
ATTCTATGAT 
GTCATTTTCA 



60 
120 
180 
240 



60 
120 
180 
240 
300 
3 60 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
12 00 
1260 
1320 
1380 



Beg ID NO* 294. Protein sequence 
Protein Accession #■ MP_O09046.1 

1 11 21 31 41 51 

I 1 1 1 I I 

MIJLlYLFIiL LWEDTCGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG KVGYPIVKPG PNXXFGKTGI IDYGIRLNRS 
BRWDAYCYHP HAKECGGVFT DPXRIFKSPG FPNSYEDNGI CYHffiCtLLKYG QRIHLBFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGPVGRYC GDELPDDIIS TGNVHTLKPIi 3DASVTAGGF 
QIJCYVAMDFV eKSfiQGKMTfl TTSTGNKUFL AGRFSHL 



60 
120 
180 
240 



Seq ID MO i 295 DNA sequence 
65 Mticleic Acid Accession #i NH00121B.2 
Coding sequence i 116.. 1180 

1 11 21 31 41 51 

H<\ I I * 1 1 1 

7U GTACTCGCCA CGGCACCCAG GCTGCGCGCA CGCGGTCCCG GTGTGCAGCT GGAGAGCGAG 60 

CGOCCACCGG GAGCCCCCGG CACAGCCCGC GCCCGCCCCG CAGGAGCCCG CGAAGATGCC 120 

OCGGCGCAGC CTGCACGCGG CGGCCGTGCT CCTGCTGGTG ATCTTAAAGG AACAGCCTTC 180 

CAGCCCGGCC CCAGTGAACG GTTCCAAGTG GACTTATTTT GGTCCTGATG GGGAGAATAG 240 

CTGGTCCAAG AAGTACCCGT CGTGTGGGGG CCXGCTGCAG TCCCCCATAG ACCTGCACAG 300 

75 TGACATCCTC CAGXATGACG CCAGCCTCAC GCCCCTCQAG XTCCAAGGCT ACAATCTGTC 360 

TGCCAACAAG CAGTTTCTCC TGACCAACAA TGGCCATTCA GTGAAGCTGA ACCTGCCCTC 420 

GGACATGCAC ATCCAGGGCC TCCAGTCTCG CTACAGTGCC ACGCAGCTGC ACCTGCACTG 480 

GGGGAACCCG AATGACCCGC AOGGCTCIGA GCACACCGTC AGCGOACAGC ACXTCGCCGC 540 

_ _ CGAGCTGCAC ATTGTCCATT ATAACTCAGA CCTTTATCCT GACGCCAGCA CTOCCAGCAA 600 

80 CAAGTCAGAA GGCCTCGCTG TCCTGGCTGT TCTCATTGAQ ATGGGCTCCI TCAATCCGTC 660 

CTATGACAAG ATCTTCAGTC ACCTTCAACA TGTAAAGTAC AAAGGCCAGG AAGCATTCGT 720 

CCCGGGATTC AACATTGAAG AGCTGCTTCC GGAGAQGACC GCTGAATATT ACCGCTACCG 780 

GGGGTCCCTG ACCACACCCC CTTGCAACCC CACTGTGCTC TGGACAQTTT TCOGAAACCC B40 

CGTGCAAATT TCCCAGGAGC AGCTGCTGGC TTTGGAGACA GCCCTGTACT GCACACACAT 900 



1066 



WO 03/042661 



PCT/US02/36810 



GGACGACCCT TCCCCCAGAG AAATGATCAA CAACTTCCGG CAGGTCCAGA AGTTCGATGA 960 

GAGGCTGGTA TACACCTCCT TCTCCCAAGT GCAAGTCTQT ACTGCGGCAG GACTGAGTCT 1020 

GGGCATCATC CTCTGACTGG CCCTGGCTGG CATTCTTGGC ATCTGTATTG TGGTGGTGGTT 1080 

QTCCAT^IGG CTTTTCAGAA GGAAGAGTAT CAAAAAAGGT GATAACAAGG GAGTCATTTA 1140 

5 CAAGCCAGCC ACCAAGATGG AGACTGAGGC CCACGCTTGA GGTCCCCGGA GCTCCOGGGC 1200 

ACATCCAGGA AGGACCTTGC TTTGGACCCT ACACACTTCG GCTCTCTGGA CACTTGCGAC 1260 

ACCTCAAGGT GTTCTCTGTA GCTCAATCTG CAAACATGCC AGGCCTCAGG GATCCTCTGC 1320 

TGGGTGCCTC CTTGCdTGG GACCATGGCC ACCCCAGAGC CATCCGATOG ATGGATGGGA 1380 

TGCACTCTCA GACCAAGCAG CAGGAATTCA AAGCTGCTTG CTGTAACTGT GTGAGATTGT 1440 

10 GAAGTGGTCT GAATTCTGGA ATCACAAACC AAGCCATGCT GGTGGGCCAT TAATGGTTGG 150 0 

AAAACACTTT CATCCGGGGC TTTGCCAGAG CGTGCTTTCA AGTGTCCTGG AAAGTCTGCT 1560 

GCTTCTCCAA GCTTTCAGAC AAGAATGTGC ACTCTCTGCT TAGGTTTTGC TTGGGAAACT 1620 

CAACrrCTTT CCTCTGGAGA CGGGGCATCT CCCTCTGATT TCCTTCTGCT ATGACAAAAC 1680 

CTTTAATCTG CACCTTACAA CTCGGGGACA AATGGGGACA GGAAGGATCA AGTTGTAGAG 1740 

15 AGAAAAAAGA AAACAAGAGA TATACATTGT GATATATTAG GGACACTTTC ACAGTCCTGT IB 00 

OCTCTGGATC ACAGACACTG CACAGACCTT AGGGAATGGC AGGTTCAAQT TCCACTTCTT I860 

GGTGGGGATG AGAAGGGAGA GAGAGCTAGA GGGACAAAGA GAATGAGAAG ACATGGATGA 1920 

TCTGGGAGAG tctcacxttg gaatcagaat TGGAATCACA TTCTGTTTAT CAAGCCATAA 19 bo 

TOTAAGGACA GAATAATAGA ATATTAAGTC CAAATCCAAC CTCCTGTCAG TGGAGCAGTX 2040 

20 ATGTTTTATA CTCTACAGAT 1 TTACAAATA ATGAGGCTGT TCCTTGAAAA TGTGTTGTTG 2100 

CTGTGTCCTG GAGGAGACAT GAGTTCCGAG ATGACCCAAT CTGCCTTTGA ATCTGGAGGA 2160 

AATAGGCAGA AACAAAATGA CTGTAOAACT TATTCTCTGT AGGCCAAATT TCATTTCAGC 22 20 

CACTTCTGCA GGATCCCTAC TGCCAACCTG GAATGGAGAC TTTTATCXAC TTCTCTCTCT 22 BO 

CTGAAGATGT CAAATCGTGG TTTAGATCAA ATATATTTCA AGCTATAAAA GCAGGAGGTT 23 40 

25 ATCTGTGCAG GGGGCTOGCA TCATGTATXT AGGGGCAAGT AATAATGGAA TGCTAGTAAG 2400 

ATACTCCATA TTCTTCCCCG AATCACACAG ACAGTTTCTG ACAGGCGCAA CTCCTCCATT 2460 

TTCCTCCCGC AGGTGAGAAC CCTGTGGAGA TGAGTCAGTQ CCATGACTGA GAAGGAACCG 2520 

ACCCCTAGTT GAGAGCACCT TGCAGTTCCC CGAGAACTTT CTGATTCACA GTCTCATTTT 2S80 

GACAGCATGA AATGTCCTCT TGAAGCATAG CTTTTTAAAT ATCTTTTTCC TTCTACTCCT 2640 

30 CCCTCTGACT CTAAGAATTC TCTCTTCTGG AATCGCTTGA ACCCAGGAGG OGGAQSTTGC 2700 

AGTAAGCCAA GGTCATGCCA CTGCACTCIA GOCTGGOTGA CAGAGCQAOA CTCCATCTCA 2760 
AAAAAAAAAA AAAAA 

Seq ID HO; 296 Protein sequence 
35 Protein Accession #: WP_001209.1 

1 11 21 31 41 SI 

111111 

MPRRSLHAAA VLLliVILKEQ PSSPAFVNGS KHTYFGPDGE NSWSKKYPSC GGLLQSPIDX. 60 
40 HSDIIiQYDAS LTPI^FQGYH I»BAMRQPLLT WMGHSVXLN1. PSDMHIQGLQ SRYSATQLItL 120 
HWGNPNDPHG 9EHTVSGQHF AAHLBIVHYN 6DLYPDA3TA SNKSEGItAVL AVLIEMGSFN 180 
PSYDKIFSHL QEVKYRGQHA FVPGEWIEEIi LFERTAEYYR YRGSLTTPPC NPTOLWTVFR 240 
NPVQISQEQL DALiETALYCT HMDDPSPRH4 INNFRQVQKF DEEOOTTSPS QVOVGTAAGX. 300 
SLGllLSLAIi AGILGICIW WfilWIiFSSK SIKKGDNKGV lYKPATKMHT KAHA 

45 

Seq ID MOs 297 DNA gegggngg 

NUCl^iC Acid Accession. #; NK_006 632.1 

coding sequence; 377.«15B2 

50 1 11 21 31 41 51 

] I ! I I I 

ACGCGTCCGC OCAGGCGTCC GCCCACGCGT CGGGTCGGGG GCAGAGCGCA QGTGTACCTG 60 

GCGGCCGTGC TGGAGCACCT GACOGCCGAG ATCCTOOAGC TGGCTGGCAA CGCGGCCCGC 120 

GACAAGAAGA CCCQCATCAT CCTGCGCCAC CTGTAGCTGG CCATTCGCAA OGGCGAGGAG 180 

55 CXTAACAAGG TGCTGGGCGA AGTCAOCATC GCGCAGGGCG GTGTCCTGCC CAACATTCAG 240 

GGCGTGCTTC TGCCCCAGAA GACCAAGAGC CACCACAAGG CCAAGGGTGA AAACCATTCA 300 

CTAOGAGAGG AGAAACACAA TGGCCACCAA GACAGAGTTG AGTCCCACAG CAAGGGAGAG 360 

CAAGAACGCA CAAGATATGC AAGTGGATGA GACACTGATC CCCAGGAAAG GTCCAAGTTT 420 

ATGTTCTGCr CGCTATGGAA TAGCCCTCGT CTTACATTTC TGCAATTTCA CAACGATAGC 480 

60 ACAAAATGTC ATCATGAACA TCACCATGGT AGCCATGGTC AACAGCACAA GCCCXCAATC 540 

CCAGCTCAAT GATTCCTCTG AGGTGCTGCC TOTTGACTCA TTTGGXGGCC TAAGTAAAGC S00 

CCCAAAGAGT CTTCCTGCAA AGTCCTCAAT ACTTGGGGGTf CAGTTTGCAA TTTGGGAAAA 660 

GTGGGGCCCT CCACAAGAAC GAAGCAGACT CIGCAGCATT GCTTTATCAG GAATGTTACT 720 

gggatgcttt actgccatcc tcrtaggtcg cttcatiagt gaaacocttg GGTGGCCCTT 7 bo 

65 TGTCTTCTAT ATCTTTGGAG GTGTTGGCTG TGTCTGCTGC CTTCTCTGGT TTGTTGTGAT 840 

TTATGATGAC OCCTTTTCCT ATCCATGGAT AAGCACCTCA GAAAAAGAAT ACATCATATC 900 

CTCCTTGAAA CAACAGGTCG GGTCTTCTAA GCAGCCTCTT OCCATCARAG CTATGCTCAG 960 

ATCTCTACCC ATTTGGTCCA TATCTXTAGG CTGTTTCAGC CATCAATGGT TAGTTAGCAC 1020 

AATGGTTGTA TACATACCAA CTTACATCAG CTCTGTGTAC CATGTTAACA TCAQAGACAA 1080 

70 TGGACTTCTA TCTGCCCTTC CTTTTATflGT TGCCTGGGTC ATAGGCATGG 1GGGAGGCTA 1140 

TC1GGCAGAT TTCCTXCTAA OCJAAAAAGTT TAGACTCATC ACTGTGAGGA AAATTGCCAG 1200 

AATTTTAGGA AGTCTCCCCT CTTCAI3CACT CATTGTGTCT CTGCCTTACC TCAATTCCGO 1260 

CTATATCACA GCAACTGOCT TGCTGACGCT CTCTTGCGGA TTAAGCACAT TGTGTCftGTC 1320 

AGGGAT5TAT ATCAATGTCT TAGATATTGC TOCAAGGTAT TCCAQTTTTC TCATGGGAGC 13 BO 

75 ATCAAGAGGA TTTTOGAGCA TAGCACCTGT CATTGTACCC AJCTGTCAGCG GATTTCTTCT 1440 

TAGTCAGGAC CCTGAGTTTG GGTGGAGGAA TGTCTTCXTC TTGCTGTTTG OCGTTAACCT 1500 

GTTAGGACXA CTCTTCTACC 1TCATATTTGG AGAAGCAGAT GTCCAAGAAT GGQCTAAAGA IS 60 

GAGAAAACTC ACTCGTTTAT GAAGTTATCC CAOCTTGGAT GGAAAAGTCA TTAGGCACCG 1620 

TATTGCATAA AATAGAAGGC TTCCGTGATG AAAATACCAG TGAAAAGATT TTTTTTTCCT 1680 

80 GTGGCTCTTT TCAATTATGA GATCAGTTCA TTATTTTATT CAGACTTTTT TTTGAGAGAA 1740 
ATGTAAGATG AATAAAAATT CAAATAAAAT GATAACTAAG AAAAAAAAAA AAAAA 

Seq ID KOi 298 Protein Bequence 
Protein Accession ft: WP_006623 .1 



1067 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 

! 

MQVDETIilPR 
SEVLPVDSFG 
IIjIGGFISBT 
VGSSKQPLPI 
LrPFIVAWVlG 
ALLTIiSCGLS 
FGWRNVFFLL 



11 
I 

KGPSLCSARY 
GLSKAPKSIiP 
LGWPFVFYlF 
KAKLRSLPJW 
MVGQYIADFIi 
TLCQSGIYXN 
FAVNIiLQIiLF 



21 

i 

GIALVLBFCN 
AKSSILGGQF 
GGVGCVCCli 
SICIX3CFSHQ 
I»TKKFEIiITV 
VLDIAPRYSS 
Y1*IFGEABVQ 



31 
I 

FTTJAQNVIM 
AIWEKHGPPQ 
WFWIYDDFF 
W1>VSTKWYI 
RKIATILGSI* 
FLMGASRGFS 



41 
I 

NITMVAMVNS 
ERSRIiCSIAI. 
SYPWISTSEK 
PTYISSVYHV 
FSSAI«IVSXiP 
SIAPVIVPTV 
L 



51 
I 

TSPQSQLNDS 
SGMLLGCFTA 
EYIISSLKQQ 
NIRDNGIiLSA 
YLN SGY ITAT 
SGFLLSQDPE 



GGQTTTGTGC TQAGCTGGCT 
TCGGGCCCTC CIBCC7GCM3 
GGGGAGTTTC ACTTTTTOCA 
TTCGCGCCCA TCTACGTGGG 



Seq ID NO: 299 DMA sequence 
Nucleic Acid Accession #: NMJ)030S8,1 
Coding sequence; 145 - .1012 

1 11 21 31 41 SI 

1 1 I I I I 

TCACTTGCAG AQGTAAACTC 

GCAGCCGCTC TCAGCCTCGC TCOGGGCACG TCGGGCAGCC 
GATCATGCCC ACCACCGTQG AC6ATGTCCT GGAGCATGGA 
GAAGCAAATG TTTTTCCTCT TGGCTCTGCT CTCGGCTACC 
CATDGTCTTC CTGGGCTTCA CCOCTGACCA CCGCTGCCGG 
GAGTCTGCGC TGCGGCTGGA GTCCTGCAGA GGAACTGAAC 
TACACGGTGC CGGGCCCAGG AOCTGCQOGC GAAGCCICCC CAAGACAGTG TAGGCGCTAC 
GAOQTGGACT GGAACCAGAG CACCTTTGAC TGOGTGGACC OCCTGGGCAG CCTGGACACC 
AACAGGAGCC GCCTGCCACT GGGCCCCTGC CGGGACGGCX GGGTGTACGA GACGCCTQGC 
TCGTCCATCG TCACCOAGTT TAACCTGGTA TGTGCCAACT CCTGGATGTT GG&CCTATTC 
CAGTCATCAG TGAATGTAGG ATTCTTTATT GGCTCTATGA GTATCGGCTA CATAGCAGAC 

AGGrrrGGCC gtaagctctg cctcctaact acaqtcctca taaatgctgc agctggagtt 

CTCATGGCCA TTTCCCCAAC CTATACGTGG ATGTTAATTT TTCGCTTAAT CCAAGGACTG 
GTCAGCAAAG CAGGCTGGTT AATAGGCTAC ATCCTGATTA CAGAATTTOT TGQQCQQAGA 
TATCQGAGAA CAGTGGGGAT TTTTTACCAA GTTGCCTATA CAGTTGGGCT CCTGGTGCTA 
GCIGGGGTGG CTTACGCACT TCCTCACTGG AGGTGGTTGC AGTTCACAGT TGCTCTGCCC 
AACTTCTTCT TCTTGCTCTA TTACTGGTGC ATACCTGAGT CTCCCAGGTG GCTGATCTCC 
CAGAATAAGA ATGCTGAAGC CATGAGAATC ATTAAGCACA TCGCAAAOAA AAATGGAAAA 
TCTCTACCCG CCTCCCTTCA GCGCCTGAGA CTTGAAGAGG AAACTGGCAA GAAATTGAAC 
CCTTCATTTC TTGACTTOGT CAGAACTCCT CAGATAAGGA AACATACTAT GATATTQATG 
TACAACTGGT TCACGAGCTC TGTGCTCTAC CAGGGCCTCA TCATGCACAT GGGCGTTGCA 
GGTGACAATA TCTACCTGGA TTTCTTCTAC TCTGCCCTGO TTGAATTOCC AGCTGOCTTC 
ATOATCATCC TCACCATCGA CCGCATCGGA CGCCGTTACC CTTGGGCTGC ATCAAATATG 
GTTGCAGGGG CAGCCTGTCT GGCCTCAGTT TTTATACCTQ GTGATCTACA ATGGCTAAAA 
ATTATTATCT CATGCTTGGG AAGAATGGGG ATCACAATGG CCTATGAGAT AGTCTGCCTG 
GTCAATGCTG AGCTGTACCC CACATTCATT AGOAATCTTG QCGTCCACAT CTGTTCCTCA 
ATGTGTGACA TTGGTGGCAT CATCACGCCA TTCCTGGTCT ACCGGCTCAC TAACATCTGG 
CTlGBfiCTCC CQCTGATGGT TTTCGGCGTA CTTGGCTTGG TTGCTGGAGG TCTGGTGCTG 
TTGCTTCCAG AAACTAAAGG GAAAGCTTTG GCTGAGAGCA TCGAGGAAGC CGAAAATATG 
CAAAGACCAA GAAAAAATAA AGAAAAQATG ATTTACCTCC AAGTTCAQAA ACTAGACATT 
CCATTGAACT AAGAAGAGAG ACCGTTGCTG CTGTCATGAC CTAGCTTTGA TGGCAGCAAG 
ACCAAAAGTA GAAATCCCTG CACTCATCAC AAAGCCCATA CAACTCAACC AAACTTAOCC 
CTGAGCCCTA TCAAOCTAGG TCTACAGCCA GTGGAGTCTA TTGTACACTG TGGAAAAATA 
CCCATGGGAC CAGATCCTGC CAAATTCTTC CAGCTCACTT TATTCTCAQC ATTCCTAGGA 
CATTGGACAT TGGTTTTCTG GAGGGTTTTT TTTCCGATCT TTGTA3TTTT TTAAATTTGA 
TTCTTTTCTT TGCAATGCTA GCAACCAGAA TACATAGGGG 
AAATAGAAAA AGTGTOAAAA ACAGTAAAGT TGGGAGAGGA 
ATAAAACACC NRAAACAAAA AAAAAAAAAA AAAAAAA 



GCATCTATTT TCTTAAAGAA 



Seq ID NO: 300 Protein sequence 
Protein Access ion fli NF_C03Q49 . 1 



1 

! 

MPTTVBDVLE 
LRCGWSFAEE 
FCRDGWVYET 
LTTVLIKAAA 
YQVAYTVGLL 
RIIKHIAKKN 
IiTOJIflWHMG 
SVF1PGDCLOW 
TPFIiVYRIiTH 
KMXYIiQVOJKli 



11 
1 

HGGEFHFFQK 
LMYTVPGPGP 
PGSSIVTEFN 
GVLMAI3PTY 
VTAGVAYALP 
GKSliPASLQR 
LAGDNIYLDF 
LKIIISCLGR 
IWLELPXiHVF 



21 
I 

OMFFIiIiALLS 
AGEAaPRQCR 
LVCANSWMLO 
TWHLlFRIilD 
HWKWLQFTVA 
LRTiKKBTGKK 
FYSAliVEFPA 
WQITMAYEIV 
GVLGLVAGGI. 



31 
1 

ATFAP1YVG1 
RYEVPWNO;J3T 
ZrFQSSVNVGF 
Gr*V3KA0HI,X 
LPKFFFLLYY 
liNPSFJJIIiVR 
ATM I I LT IDS 
CLVHAELYPT 
VT.T.T.PSTKQK 



41 

I 

VFIjGFTFDHR 
FDCVDPLASL 
FXGSMSXGYI 
GYIIjITEFVG 
WGIPESPRWL 
TPQIBKHTHI 
LGRRYPWAAfi 
FIRNLGVHIC 



Seq ZD NO i 301 DMA sequence 

Nucleic Acid Accession #: KM_D12206.1 

Coding sequence: 52-- 1131 



1 
I 

GTXACCCAGC 
CAAGTGGTCA 
GTTGGTGGAG 
TCAATGTGCT 
ACCAATGGAA 
*TCAAGAAGGG 



11 
I 

ATTGTGAGTG 
TCTTAAGCCT 
AGGCAGGXCC 
GGAATAGAGG 
CCCACC3TCAC 
ATGTCTCTTT 



21 
I 

ACAGAGCCTG 
CAtCCTACAT 
ATCTOTCACA 
CTCATGTTCT 
CTATCGGAAG 
GACCATAGAA 



31 
I 

GATCTGAACG 
CTGGCAGATT 
CTACCCTGCC 
CTATTCACAT 
GACACACGCT 
AATACAGCTG 



60 
120 
ISO 
240 
300 
360 



GO 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
19 B0 
2040 
2100 
2160 
2220 



51 
1 

CRSPOVAEIiS 
DTWRSRIxPLG 
ADRFGRKLCIi 
BHYRRTVGIF 
I8QNKNABAM 
liMYNKFTSSV 
NNVAGAAJCLA 
8SMCDIGGII 



60 
120 
180 
240 
300 
360 
420 
480 
540 



41 51 

1 I 

CTGATCCCAT AATGCATCCT 
CTGTAGCtGG TTCTGTAAAG 
ACTACAQTGG AGCTGTCACA 
GCCAAAATGG CATTGTCTGG 
AXAAGCTATT GGGGGACCTT 
TGTCTGACAG TGGCGTATAT 

1068 



60 
120 
LB0 
240 
300 
360 



WO 03/042661 PCT/US02/36810 



TOTTOCCGTG TTGAGCACOG TGGGTGGTTC AATGACATGA AAATCACCGT ATCATTGGAG 420 

ATTGTGCCAC CCAAGGTCAC G AC TACT CCA ATTGTCACAA CTGTTCCAAC OGTCACGACT 400 

GTTCGAACGA GCACCACTGT TCCAACGACA ACGACTGTTC CAACGACAAC TX3TTCCAACA S40 

ACAATGAGCA TTCCAACGAC AACGACTGTT CCGACGACAA TGACTGTTTC AACGACAACG 600 

5 AGCXJTTCCRA CGACAACGAG CATTCCAACA ACAACAAGTG TTCCAGTGAC AACAACGGTC 660 

TCTACCTTTG TTCCTCCAAT GCCTTTGCOC AGGCAGAACC ATGAACCAGT AGCCACTTCA 720 

CCATCTTCAC CTCAGCCAGC AGAAACCCAC CCTACGACAC TOCAGGGAGC AATAAGGAGA 7S0 

GAACCCACCA GCTCACCATT GTACTCTTAC ACAACAGATG GGAATGACAC CGTGACAGAfi 840 

TCTTCAGATG GCCTTTGGAA TAACaATCAA ACTCAACTGT TCCTAGAACA TAGTCTACTG 90 0 

10 ACGOC CARTA CCACTAAAGG AATCTATGCT GGAGTCTGTA TTTCTGTCTT GGTGCTTCTT 960 

GCTCTTTTGG GTGT CAT CAT TGCCAAAAAG TATTTCTTCA AAAAGGAGQT TCAACAACTA 1020 

AGTGTTTCAT TTAGCAGOCT TCAAATTAAA GCTTTGCAAA ATGCAGTTGA AAAGGAAGTC 10B0 

CAAGCAGAAG ACAATAT CT A CATTGAGAAT AGTCTTTATG OCACGGACTA AOAGCCAGTG 1140 

. GTGCTCTTTG AGAGTTTACG CC CATOACT □ CAGAAGACTG AACAGGTATC AGCACATCAG 1200 

15 ATGTCTTTTA GACTCCAAGA CAATTT TTCT GTTTCAOTTT CATCTGGCAT TCCAACATGT 1260 

CAGTQATACT GGGTAGAGTA ACTCTCCCAC TCCAAACTGT GTATAGTCAA CCTCATCATT 1320 

AATGTAGTCC TAATTTGTTT TGCTAAAACT GGCTCAATCC TTCTGATCAT TGCAQAGTTT 1390 
TCTCTCAAAE ATGAACACTT TAGAATTQTA TGTTCTCTTT ACACCCCATA AATCCTGTAT 

20 8eg ID KO= 302 Protein sequence 
Protein Accession #: NP_036338.1 

1 11 21 31 41 51 

ZD KHPQfWIIiSIi TliHLADSVAG SVKVGGEAGP SVTLPCHYSG AVTSMCWNRG 5CSLFTCQNG 60 

IVWTNGTHVT YKKDTRYKU* GDliSRRDVSL tientavsds gvyccrvekr gwpndmkitv 120 

SLEIVPEKVT TTPIVTTVPT VTTVRTSTTV STXTTVPTTT VFTTMSIFTT TTVPTTMTVS 1B0 

TTTSVPTTT6 IPTTT5VPVT TTVSTFVPPM PI»PHQHHEPV ATSPS8PQPA BTHPTTLQGA 240 

IRREPTSSEI* YSYTTDGNDT VTESSDQIjWN KNQTQLFXJEH SKLTANTTKG IYAGVCI3VI* 300 

30 VIiIAI*LGVII AKKYFFKKEV QQLSVSFSST* QIKALQHAVE KEVQAEDNITf lEUSLVATD 



Seq ID NO: 303 DMA aecpience 
Nucleic Acid Accession # - HI 
Coding sequences 129. .1991 



001044,1 



35 

1 11 21 31 41 51 

111(11 

ACCGCTCCGG AGCGGGAGGG GAGGCTTCGC GGAACGCTCT CGGCGCCAOG ACTCGCGTGC 60 

AAAGCCCAGG CCCGGGCGGC CAGACCAAGA GGGAAGAAGC ACAGAATTCC TCAACTCCCA 120 

40 GTGTGCCCAT GAGTAAGAGC AAATGCTCOG TGQOACTCAT GTCTTCCGTG GTGGOCCCGG 180 

CTAAGGAGCC CAATGCCGTG GGCCCQAAGG AGGTQGAGCT CATCCTTGTC AAGGAGCAGA 240 

ACGGAGTGCA GCTCACCAQC TCCACCCTCA CCAACCCGCG GCAGAGCCCC tJTQQBJSGQCC 300 

AGOATCGGGA GACCTGGGGC AAGAAQATCG ACTTTCTCCT GTCCGTCATT GGCTTTGCTG 360 

TGGACCTGGC CAACGTCTGG CGGTTCCCCT ACCTGTGCTA CAAAAATC3GT GGCGGTGCCT 420 

45 TCCTOaTCCC CTACCTGCTC TTCATGGTCA TTGCTGGGAT GCCACTTTTC TACATGGAGC 490 

TGGCCCTCGG CCAGTTCAAC AGGGAAGGGG CCGCTGSTGT CTGGAAGATC TGCCCCATAC 540 

TQAAAGQTGT GGGCTTCACG GTCATCCTCA TCTCACTGTA TGTO30CTTC TTCTACAACO 600 

TCATCATCGC CTGGGCGCTG CACTATCTCT TCTCCTCCTT CACCACGGAQ CTCCCCTQGA 660 

TCCACTQCAA CAACTCCTGG AACAGCCCCA ACTGCTCGGA TGCCCATCCT GGTGACTCCA 720 

50 GTGGAGACAG CTCGGGCCTC AACGACACTT TTGGGACCAC ACCTGCTGCC GAGTACTTTG 790 

AACGTGGCGT GCTGCACCTC CACCAQAGCC ATGGCATCGA DGACCTGGGG OCTCCGCGOT 840 

GGCAGCTCAC AGCCTQCCTO GTGCTGGTCA TCGTGCTGCT CTACTTCW3C CTCTGGAAGG 900 

GCGTGAAGAC CTCAGGGAAG GTGGTATGGA TCACAGCCAC CATGCCATAC GTGGTCCTCA £60 

CTQCOC3QCT CCTGCGTGGG GTCACCCTCC CTGGAGCCAT AGACGGCATC AGAGCATACC 1020 

55 TGAGCGTTGA CTTCTACCGG CTCTQCGAGG CGTCTGTTTG GATTGACGCG GCCACCCAGG 1080 

TOXGCTTCTC CCTGGGOGTG GGGTTCGGGG TGCTGATCGC CTTCTCCAGC TACAACAAGT 1140 

TCACCAACAA CTGCTACAGG QACGCGATTG TCACCACCTC CATCAACTCC CTGAOOAGCT 1200 

TCTCCTCCQG CTTCGTCGTC TTCTOCTTCC TGGGGIACAT GGCACAGAAG CACAGTGTGC 1260 

CCATCGGGGA CGTGGCCAAQ GACGGGCCAG GGCTGATCTT CATCATCTAC CCGGAAGCCA 1320 

60 TCGCCACGCT CCCTCTGTCC TCAGCCTGGG CCGTGGTCTT CTTCATCATG CTGCTCACCC 1300 

TGGGTATCGA CAaCGCCATG GGTGGTATGG AGTCAGTGAT CACCGGGCTC ATCGATGAGT 1440 

TCCAGCTGCT GCACAGACAC CQTGAGCTCT TCACGCTCTT CATCGTCCTG GCGACCTTCC 1500 

TCCTGTCCCT GTTCTGCGTC ACCAA03GTG GCATCTAOGT CTTCACGCTC CTGGACCATT 1560 

TTGCAGCOGG CACGTCCATC CTCTTTGGAG TGCTCATCGA AGCCATCGGA GTGQCCTGGT 1620 

65 TCTATGOTGT TGGGCAGTTC AGCGAdOACA TCCAGCAGAT GACCSGGCAG CGGCCCAGCC 1680 

TGTACTGGCG GCTOGK3CTGG AAGCTGGTCA GCCCCTQCTT TCTCCTGTTC GTGGTCGTGG 1740 

TCAGCATTGT GACCTTCAGA CCCCCCCACT ACGGAGCCTA CATCTTCCCC GACTGGGCCA 1B00 

ACGCGCTGGG CTGGGTCATC GCCACATCCT CCATGGCCAT GGTGCCCATC TATG0GQCCT 1860 

ACAAGTTCTG CAGCCTGCCT GGGTCCTTTC GAGAJSAAACT GGCCTACGCC AT TGCACCCG 1920 

70 AGAAGGACCG TGAGCTGGTG 0ACAGAGGGG AGGTGCGCCA GTTCACGCTC CGCCACTGC3C 1980 

TCAAGGTGTA GAGGGAGCAG AGACGAAGAC CCCAGGAAGT CATCCTGCAA TGGGAGAGAC 2040 

ACGAACAAAC CAAGGAAATC TAAGTTTCGA GAGAAAGGAQ GGCAACTTCT ACTCTTCAAC 2100 

CTCTACTGAA AACACAAACA AGAAAGCAGA AC3ACTCCTCT CTTCTGACTG TTTACACCTT 2160 

TCCGTGCCGG GAGCGCACCT CX3CG&TGTCT XGTGTTGCXQ TAATAACGAC GTAGAT CTGT 2220 

75 OCAGCGAGOT CCACCCCGTT GTTGTCCCTG CAGGGdAGAA AAACGTCTAA CTTCATGCTG 2280 

TCTGTGTGAG GCTOOCTCCC TCCCTGCTCC CTGCTCCCGa CTCTGAGGCT GCCCCAGGGG 2340 

CACTGTGTTC TCAGGCGGGG ATCACGATCC TT3TAGACGC ACCTGCTQAG AATCCCCGTG 2400 

CTCACA6TAG CTTCCTAGAC CATTTACITT GCCCATATTA AAAAGCCAAG TGTCCTQCTT 2460 

GGTTTAGCTG TGCAGAAGGT GAAATGGAGG AAACCACAAA TTCATGCAAA GTCCTTTCCC 2528 

80 GATjGCGTGGC TCCCAOCAOA GGCCGTAAAT TGAGOGTTCA GTTOACACAT TGCACACACA 2S80 

GTCTGTTCAG AGGCATTGGA GGATGGGGQT CCTGGTATGT CTCACCAGGA AATTCTGTTT 264 O 

ATGTTCTTGC AGCAGAOAGA AATAAAACTC CTTGAAACCA GCTCAGGCTA CTGCCACTCA 2700 

GGCAGCCTGT GGGTCCTTGT GGTGTAGGGA ACGGCCTGAG AGGAGCGTGT CCTATCCCCa 2760 

GACGCATQCA GGGCCCOCAC AGGAGCGTGT CCTATCCCCQ GACGCATGCA GGGCCCCCAC 282 0 

1069 



WO 03/042661 



PCT/US02/3681© 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



AGGAGCATGT 
AACGCATGCA 
TGGAGCGTGT 
GACCGGACGC 
CCCACAGGAG 
CCCAGGACGC 
TGAGCCGTGA 
CATCAATAAC 
ATCAAAACAA 
TTTACAAGAA 
CACACTGCCC 
TOGGTGGGCC 
TCCCCTGCAA 
GCCTGTGAAC 
GGGAGGGACA 
GGGGCTTCCC 
GTTGTTGAAG 
AOCAAOCCAG 
TAAGCACAAT 



CCTATCCCTG 
GGGCCCCCAC 
ACTACCCCAG 
ATGCAGGGCC 
CGTGTACTAC 
ATGCAGGGCC 
CCTCCAGGAA 
AACAGTTTTT 
ATTCAAGAAT 
TAATTAGCAA 
TCTGCCACTG 
TGCCTACXSIG 
GTGGACGTGG 
TGCCAGGCAG 
CAOAGGACGG 
CATTGCCTTC 
ACAGCACAGA 



GACGCATGCA 
AGGAGCGTGT 
GACGCATGCA 
OCCACAGGAG 
CCCAGGATGC 
CCXZATGCAGG 
GGGACCCCAC 
ATGTTTGCGA 
GCAGTATCCG 
TACTGAGTGA 
ACAGGAAAGT 
CTGCCCGAGG 
GCTCCAGGGA 
CTGCAGTTAG 
CTTCCCCATC 
TGGGGAGGGA 
GAGCGGCTTC 



AAAAGACATC CACAATGGAA 



GGGCCCCCAC 
ACTACCCCAa 
GGGCCCCCAC 
CGTGTACTAC 
ATGCAGGGCC 
CAGCCTGCAG 
TGGAATTTTA 
ATGGCTTTTT 
CGAGCCTGCT 
AGGATGTTGG 
GGATGCCATA 
GCAGGGGCCG 
CTGGAGTGrA 
CACAGAGGAT 
GCCTTCTGGC 
CACAGAGGAC 
CCCATCGCCT 
CCAATCTCTA 
AAAAAAAAAG 



AGGAGCGTGT 
GACGCATGCA 
AGGAGCGTGT 
CCCAGGAOGC 
CCCACAGGAG 
ACCAACACTC 
TTTCTCTCAG 
AAAATCATAT 
TGCTGATATT 
CCAAAAGCTG 
GTTTGAATTC 
TGCAGGGCCA 
ATGCTCGGTG 
GGCTTCCCCA 
CGCTGCAGTC 
AGTTTCCCCA 
TCTGGGGAGG 
TTCAGCATCG 
GAATTC 



ACTACCCCAG 
GGGCCCCCAC 
CCTATCCCCG 
ATGCAGGGCC 
CGTGTACTAC 
TGCCTGGCCT 
GTGCGTGCCA 
TTACCTGTGA 
GCAGTTTTTG 
CTTTCCATGG 
ATGCCTCAAG 
GTCATGGCTG 
GGAGCCGTCA 
TTGCCTTCTG 
AGCACAGAGA 
TCGCCTTCTG 
GGCTCCGTGT 
TGTGGGTCCC 



Seq ID NO i 3 04 Protein sequence 
Protein Accession ft* NP 001035-1 



fcSKSKCSVGL 
ETVTGKKIDFL 
GQFNREGAAG 
NNSHNSPNCS 
T AdiVLV TVli 
DPYHIiCEASV 
GFWP5FLGY 
DSAMGGMESV 
GTGILFGVIiI 
VTFRPPHYGA 
RELVERGEVR 



11 
I 

MSSWAFAKE 
L8VIGPAV0L 
VWKICPILKO 



IiYFSliWKGVK 
WIDAATQVCF 
MAQKHSVPIG 
ITGLIDEFQL 
EAIGVAWFYQ 
YIFPDWANAL 
OFTliRHWLfKV 



21 
I 

PKAVGPKEVB 
ANVWRFPYLC 
VGPTVUilSL 
SSGLTJDTFGT 
TSGKWWITA 
SIiGVGFGVIiX 
DVAKDGPGLI 
LGRHRSLiFTI* 
VGQFSDDIQQ 
GWVIATSSMA 



31 
I 

LILVKBOWGV 
YKNGGGAFLV 
YVGFFYNVII 
TPAAEYFERG 
TMPYWLTAL 
AF355YMKFTN 
FIXYPEAlAT 
FIVIATPLIiS 
MTGQKPSLYW 
KVPIYAAYKF 



41 

1 

QLTSSTLTN? 
PYXJ.FMVTAG 
AWALHYLFSS 
VIiHIjHQSflai 
LTiRGVTLPGA 
NCYHDAIVTT 
LPLSSAWAW 
LPCVTNGGIY 
RLCWKLVSPC 



51 
I 

RQSPVEAQDR 
MPLFYKELAli 
FTTELPWIHC 
DDDGPPRWQIj 
IDGIHAYIjSV 
3INSLT3FSS 
PPIMLLTJjGI 
VPTUJQHFAA 
PIiLPWWSI 
IAYAIAPEKD 



Seq ID NO? 305 DWA sequence 

Nucleic Acid Accession #* NMJ> 01216.1 

Coding sequence i 43 - * 1422 



1 
1 

GCCCGTACAC 
AGCCCCTGGC 
CTGTCACTGC 
TCCCCCTTGG 
AGTGAAGAGG 
GAGGATCTAC 
TGCCTGAAGT 
AATAATGCCC 
CCGCCCTGGC 
CGCCCCCAGC 
CTCOCGCCGC 
CCTCCTGGGC 
CTGCACTGGG 
CCTGCCGAGA 
GGGCGCCCGG 
AGTGCCTATG 
CAGGTCCCAG 
TATGAGGGGT 
CAGACAGTGA 
GGTGACTCTC 
GAGGCCTCCT 
AATTCCTGCC 
ACCAGCGTCG 
GTGAGCTACC 
TGTGAGAAGC 
ATGCCACTTC 



11 
1 

ACCGTGTGCT 
TCCCTCTGTT 
TGCTTCTGAT 
GAGGAGGCTC 
ATTCACCCAG 
CTGGAGAGGA 
TAGAGGATCT 
ACAGGGACAA 
CCCGGGTGTC 
TCGCOGCCTT 
TCCCAGAACT 
TAGAQATGGC 
GGGCTGCAGG 
TCCACGTGGT 
GAGGCCTGGC 
AGCAGTTGCT 
GACTGGACAT 
CTCTGACTAC 
TGCTGAGTGC 
GGCTACAGCT 
TCCCTGCTOa 
TGGCTGCTGG 
CGTTCCTTGT 
GCCCAGCAGA 
CAGCCAGAGG 
CTTTTAACTG 



21 
I 

GOGACACCCC 
GATCCCGGCC 
GCCTGTCCAT 
TTCTGGGGAA 
AGAGGAGGAT 
GGATCTACCT 
ACCTACTGTT 
AGAAGGGGAT 
CCCAGCCTGC 
CTGCCCGGCC 
GCGCCTGCGC 
TCTGGGTCCC 
TCGTCCGGGC 
TCACCTCAGC 
CGTGTTGGCC 
GTCTCGCTTG 
ATCTGCACTC 
ACCGCCCTGT 
TAAGCAGCTC 
GAACTTCCGA 
AGTGGACAGC 
TGACATCCTA 
GCAGATGAGA 
GGTAGCCGAG 
CATCT0AGGG 
CCAAGAAATT 



31 
I 

ACAGTCAGCC 
CCTGCTCCAG 
CCCCAGAGGT 
GATGACCCAC 
CCACCCGGAG 
GAAGTTAAGC 
GAGGCTOCTG 
GACCAGAGTC 
GCGGGCCGCT 
CTGCGCCCCC 
AACAATGGCC 
GGGCGGGAGT 
TCGGAGCACA 
ACCGCCTTTG 
GCCTTTCTGG 
GAAGAAATCG 
CTGCCCTCTG 
GCCCAGGGTG 
CACACCCTCT 
GCGACGCAGC 
AGTCcrCGGG 
GCCCTGGTTT 
AGGCA0CACA 
ACTGGAGCCT 



TTTTAAAATA 



41 
I 

GCATGGCTCC 
GCCTCACTGT 
TGCCCCGGAT 
TGGGCGAGGA 
AGGAGGATCT 
CTAAATCAGA 
GAGATCCTCA 
ATTGGOGCTA 
TCCAGTCCCC 
TGGAACTCCT 
ACAGTGTGCA 
ACCGGGCTCT 
CTGTGGAAGG 
CCAGAGTTGA 
AGGAGGGCCC 
CTGAGGAAGG 
ACTTCAGCCG 
TCATCTGGAC 
CTGACACCCT 
CTTTGAATGG 
CTGCTGAGCC 
TTGGCCTCCT 
GAAGGGGAAC 
AGAGGCTGGA 
ACTGTOCTGT 
AATATTTATA 



2080 

2 940 
3000 

3D60 

3120 
3130 
3240 
3300 
3360 
3420 
3480 
3540 
360D 
3660 
3720 
3780 

3 840 
3900 



51 
I 

CCTGTGCCCC 
GCAACTGCTG 
GCAGGAGGAT 
GGATCTGCCC 
ACCTGGAGAG 
AGAAGAGGGC 
AGAACCCCAG 
TGGAGGCGAC 
GGTGGATATC 
GGGCTTCCAG 
ACTGACCCTG 
GCAGCTGCAT 
CCACCGTTTC 
CQAGGCCTTG 
GGAAGAAAAC 
CTCAGAGACT 
CTACTTCCAA 
TGTGTTTAAC 
GTGGGGACCT 
GCGAGTGATT 
AGTCCAGCTG 
TTTTGCTGTC 
CAAAGGGGGT 
TCTTGGAGAA 
CCTGCTCATT 
AT 



Seq ID NO* 306 Protein sequence 
Protein Accession #? NP_O012O7.1 



1 11 

I 1 
MAPLCPSPKD PLDIPAPAPG 



21 
I 

LTVQIALSLL 



31 41 

t I 

LLMPVEPQRIi PRMQED9PLVG 



DPQEPQNNAH RDKEGDDQSH 
EUJ3FQLPPL PEJjRLRNNGH 
VEGHRPPABI HWHLSTAFA 
BBG5ETQVPG LDISALI>PSD 
DTIiHGPGDSR LQLNFRATQP 
GLIiPAVTSVA FLVQMRRQHR 



WRYGGDPPWP 
SVQL/T LPP G£i 
RVPEAIiGRPB 
FSRYFQYEGS 
l^NGKVIEASF 
RGTKGGV3YR 



RVSPACAGRF QSPVD1RPQL 
EMALGPGREY RALQLHLHWG 
GIiAVLiAAFLiE EGPEENSAYE 
liTTPPCAQGV IWTVPNQTVW 
PAGVDfi SPRA ABPVQL>NSCIi 



60 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1X40 
1200 
1260 
1320 
1380 
1440 
1500 



EDLPTVEAPG 
AAFCPALRPL 
AAGRPGSEHT 
QIiIjSRIiEEIA 
LfSATCQLETLB 
AAQDILAIjVF 



60 
120 
180 
240 
300 
360 
420 



1070 



WO 03/042661 



PCTAJS02/36810 



Seq ID NO: 307 DMA Hecjuence 

Nucleic Acid Accession #: NM_00303 9.l 

Coding sequencer 76.. 1581 

5 1 11 21 31 41 51 

1 I I I I 1 

CITCTCTCTC CATTCAGTGC ACGQ3T TACT TTGGCTAAAA GGAGGTGAGC GGCACTCTGC GO 

CCTTCCAGAG CAAGCATGGA GCAACAGQAT CAGAOCATGA AGGAAGGGAG GCTGACGCIT 120 

GTGCTTGCCC TGGCAACCCT GATAGCTGCC TTTGGGTCAT CCTTCCAGTA TGGGTACAAC 160 

10 GrrGGCTGCTG TCAACTCCCC AGCACTGCTC ATGCAACAAT TTTACAATGA GACTTACTAT 240 

GGTAGGACCG GTGAATTCAT GGAAGACTTC CCCTTGACGT TGCTGTGGTC TGTAACCGTG 300 

TCCATGTTTC CATTTGGftGG GTTTATCGGA TCCCTCCTGG TCGGCCCCTT GGTGAATAAA 360 

TTTGGCAGAA AAGGGGCCTT GCTGTTCAAC AACAIATTTT CTATCGTGCC TGCGATCTTA 420 

ATGGGATGCA GCAGAGTCGC CACATCATTT GAGCTTATCA TTATTTCCAG ACTTTTGGTG 480 

15 GGAATATGTG CAGGTGTATC TTCCAACGTG GTOCCCATGT ACTTAGGGGA GCTGGCCCCT 540 

AAAAACCTGC GGGGGCCTCT CGQGGTGOTG CCCCAGCTCT TCATCACTGT TGGCATCCTT 600 

GTGGCCCAGA TCTTTGGTCT TCGGAATCTC CTTGCAAAOG TAGATGGCTG GCCGATOCTG 660 

CTGGGGCTGA CCGGGGTCCC CGCGGCGCTG CAGCTCCTTC TGCTGCCCTT CTTCCC03AG 720 

AGCCCCAGST ACCTGCTGAT TCAGAAGAAA GACGAAGCGG CCGCCAAGAA AGCCCTACAG 780 

20 ACGCTGCGCG GCTGGGACTG TGTGGACAGG GAOGTGGCCG AGATCCGGCA GGAGGATGAG 840 

GCAGAGAAGG COGCGOGCTT CATCTCCGTG CTGAAGCTGT TCOGGATGCG CTCGCTGCGC 900 

TGGCAGCTGC TGTCCATCAT CGTCCTCATG GGCGGCCAGC AGCTGTCGGG 0GTCAAOGCT 960 

ATCTACTACT ACZGCX3GACXZA GATCTACCTG AGCGCCGGC3 TGCCGGAGGA GCACGTGCAG 1020 

TACGTGAGGG CCGGCACCGG GGCOQTGAAC GTGGTCATGA CCTTCTGCGC CGTGTTCOTG 1080 

Z5 OT3QAGCTCC TGGGTGGGAG GCTGCTGCTG CTGCTGGGCT TCTCCATCTG CCTCATAGCC 1140 

TGCTGCGTGC TCACTGCAGC TCTOGCACTG CAGGACACAG TGTCCTGGAT GCCATACATC 1200 

AQCATCGTCT GTGTCATCTC CTACGTCATA GOACATGCCC TCGGGCCCAG TCCCATACCC 1260 

GOSCTGCTCA TCACTOAGAT CTTCCTGCAG TCCTCTOGGC CATCTGCCTT CATGGTGGGG 1320 

GGCAGTGTGC ACTGGCTCTC CAACTTCACC GTGGGCTTGA TCTTCCCDTT CATCCAGOAG 13 B0 

30 GGCCTCGGCC CGTACAGCTT CATTGTCTTC GCCGTGATCT GCCTCCTCAC CACCATCTAC 1440 

ATCTTCTTGA TTGTCCCGGA GACCAAGQCC AAGACGTTCA TAGAGATCAA CCAGATTTTC 1500 

ACCAAGATGA ATAAGGTGTC TGAAGTGTAC CCG3AAAM3G AGGAACTGAA AGAGCTTCCA 1560 

CCTGTCACTT CGGAACAGTG ACTCTGGAOA GGAAGCCAGT GGAGCTGGTC TGCCAGGGGC 1620 

TTCCCACTTT GGCTTATTTT TCTGACTTCT AGCTGTCTGT GAATATCCAG AAATAAAACA 1680 

35 ACTCTGATGT GGAATGCAGT CCTCATCTCC AGCCTCCCCA CCCCAGTGGG AACTQTGCAA 1740 

AGGGCTGCCT TGCTGTTCTT GAAGCTGGGC TGTCTCTCTC CATGTTGGCC TGTCACCAGA 1800 

CCGGAGTCAA TTAAACAGCT GGTCCTOCAC TTTGCTGGTT CAGCCTTCGT GTGGCTCCTG I860 

QTAACQTGGC TCCACCTTGA TGGGTCAACC TTTGTGTGGC TCCTGGTAAC ATAACAACAA 1920 

CAGTTACTAT AGTGGTGAGA TGGAAQGAAT CAAATTTTGC CAGAGAAACT AACTCGQTGG 19B0 

■40 CGCCAACABG TCTTCCGGGG CCATGGGCAT TTGTTTAGAQ CCAAATTCAT CCTCTTACCA 2040 

GATCCTTTTC CAGAAATACC TGTCTAGGAA GGTGTGATGT CAGAAACAAT QACATCCAGA 2100 

AAGCTOAGGA ACAGGTTCCT GTGGAGACAC TGAGTCAGAA TTCTTCA.TCC AAATTATTTT 2160 
GTTAGTGGAA AATGGAATTG CTTCTGTGTA GTCAATAAAA TGAACCTGAT CACTTTTC 

45 Seq id MO s 308 Protein sequence 
Protein Accession fl = NP_o 03030 .1 

1 11 21 31 41 51 

Kn 1 1 \ 1 I I 

JU MEQQDQ9MKE GRLTLVLALA TLIAAFGSSF QTCYNVAAVN SPADLMQQFY KETYYGRTGE 60 

FMSDPPLTLL WSVTVSMFPF GGFIGSIJWG PLVNKFGHKG ADLFWHIP3I VPAILMGCSH. 120 

VATSFELIII SRIiLVGICAG VS6NWPMYI> GKI>A.PKHLRG ADGWPQLFI TVGItiVAQIF 180 

GLBNUbANVD GWPILLGI»TG VPAALQLLLti PPFPBSPRYL LIQKKDEAAA KKALQTLRGW 240 

DSVDREVAEI RQEDEAEKAA GFIBVLKIiFS. MRSLEWQLIxS IIVLMGGQQL SGVNAIYYYA 300 

55 DQIYLSAGVP EEHWQYVTAQ TGAVNWMTF CAYFWEUjG RRIiLIoLLGFS ICLIACCVLT 360 

AALALQDTVS WMPYISIVCV ISYVIGHALG PSPIPALL2T EIFLQB6RPS AFMVGG9VHW 420 

LSNFTVGIiXF PPIQBOIJ3PY SFIVFAVICL LTTIYIFLIV PETKAKTPIE IMQIFTXMHK 480 
VSEVYPEKKE LKELPPVTSE Q 

60 Seq ID NO; 309 DKR. sequence 

Nucleic Acid Accession # : NW_001252.1 
Coding sequence; 138.. 719 



1 11 21 31 41 51 

65 1 | 1 | | | 

GGCTGGTCCC CTGACAGGTT GAAGCAAGTA GACGCCCAQG AGCCCCGGGA GGGGGCTGCA 60 

GTTTCCTTC5C TTCCTTCTCG GCAGCGCTCC GCGCCCCCAT CGCCCCTCCT GCGCTAGCGG 120 

AGGTGATCGC CGCGGCGATQ CCGGAGGAGG GTTCGGGCIG CTCGOTGCGG CGCAGGCCCT 180 

ATGGGTGCGT CCTGCQGGCT GCTTTGGTCC CATTGGTCGC GGGCTYGGTG ATCTGCCTCG 240 

70 TGGTGTGCAT CCAGCGCTTC GCACAGGCTC AGCAGCAGCT GCCGCTCGAG TCACTTGGGT 300 

GGGACGTAGC TGAflCTGCAG CTGAATCACA CAGGACCTCA GCAGGACCCC AGGCTATACT 360 

GGCAGGGGGG CCCAGCACTG GGCCGCTCCT TCCTGCATGG ACCAGAGCTG GACAAGGGGC 420 

AGCTACGTAT CCATCGTGAT GGCATCTACA TGGTACACAT CCAGGTGACG CTGGCCATCT 480 

_ _ GCTCCTCCAC GACGGCCTCC AGGCACCACC CCACCACCCT GGCCGTGGQA ATCTGCTCTC 540 

75 CCGCCTCCCG TAGCATC3U3C eiXSCTGOGTC TCSVQCTTCCA CCAAGGTTGT ACCATTGCCT 600 

CCCAOCGCCT GACGCCCCTG GCCCGAGGQG ACACACTCTG CACCAACCTC ACTnGGACAC 660 

TTTTGCCTTC CCGAAACACT GATGAGACCT TCTTTGGAGT GCfiGTGGGTG CGCCCCTGAC 720 

CACTGCTGCT GATTAGGGTT TTTTAAATTT TATTTTATTT TATTTAAGTT CAAGAGAAAA 780 

AGTGTACACA CAGGGGCCAC CCGGGGTTGG GGTGGGAGTG TGGTGGGGGG TAGTGGTGGC B40 

OO AGGACAAGAG AAGGCATTGA GCTTTTTCTT TCATTTTCCT ATTAAAAA 



Seq ID NO: 310 Protein sequence 
Protein Accession # f NP_001243 -1 



1071 



WO 03/042661 



PCT/US02/36810 



10 



1 11 21 31 41 51 

! S ! 1 | | 

MPEEGSGCSV RRRPYGCVLR AAXVPLVAGL VICLWCIQR FAQAQQQLPL ESLGWDVAEL GO 
QLNHTGPQQD PRL1TWQGGPA LGRSFLHGPE LDKGQLRIHR J3QIYMVHIQV TIiAICSSTTA 120 
SRHHPTTIAV GICSPASRSi SLLKUSFHQG CTIASQRLTP LARGDTLCTN LTGTLLPSRN 180 
TDE T FFGVQW VRP 

Seq ID NO: 311 UNA sequence 

Nucleic Acid Accession #: Bob sequence 

Coding sequence : 1 . . 3 9 78 



1 11 21 31 41 SI 

I I I I 1 I 

ATGGTGGGTG AAGGACCCTA CCTTATCTCA GATCTGGACC AGCGAGGCCG GOGGAGATCC 60 

1 J TTTGCAGAAA GATATGACCC CAGCCTGAAG ACCATGATCC CAGTGC3A.CC CTGTGCAAGG 120 

TTAGCACCCA ACCCGGTGGA TGATGCCGGG CTACTCTCCT TCGCCACATT TTCCTGGCTC 180 

ACGCCGGTGA TGGTGAAAGG CTACCGGCAA AGGCTGACCG TAGACACCCT GCCCCCATTG 240 

TCGACATATG ACTCATCTGA CACCAATGCC AAAAGATTTC GAGTCCTTTG GGATGAAOAG 300 

rsp. GTAGCAAGGG TGGGTCCTGA GAAGGCCTCT CTGAGCCACG TGGTGTGGAA ATTCCAGAGG 360 

ACACGCOTGT TGATGGACAT CGTGGCCAAC ATCCTGTGCA TCATCATGGC AGCCATAGGG 420 

CCGACAGTTC TCATTCACCA AATCCTCCAG CAGACTOAGA GGACCTCTGG GAAAGTCTGG 400 

GTTGGCATTG GACTGTGCAT AGCCCTTTTT GCCACCGAGT TTAOCAAAGT CTTCTTTTQG 540 

GCCCTTGCCT GGGCCATCAA CTACCGCACG GCCATOCGGT TSAAGGTGGC GCTCTCCACC 600 

TTGGTTTTTG AAAACCTAGT GTCCTTCAA3 ACATTGACCC ACATCTCTGT TGGCGAGGTG G60 

ZD CTCAATATAC TGTCAAGTGA TAGCTATTCT TTGTTTGAAG CTGCCTTGTT TTGTCCTTTG 720 

CCAGCCAOCA TCCCGATCCT AATGGTCTTT TGTQ03GCGT ACGCCTTTTT CATTCTGGGG 700 

CCCACAGCTC TCATCGGGAT ATCAGTGTAT GTCATATTCA TACCCGTCCA GATGTTTATG 840 

GCCAAGCTCA ATTCAGCTTT CCGAAGGTCA GCAATTTTGG TGACAGACAA GCGAGTTCAG 900 

ACAATGAATG AQTTTCTGAC CTGCATCAGG CTOATCAAAA TGTATGCCTG GGAGAAATCT 960 

J\) TTTACCAACA CTAtOCAAGA TATAAGAAGG AGGGAAAGAA AATTACTGGA AAAAGCTGGA 1020 

TTTGTCCAAA OTGGAAACTC TGCCCTGGCC CCCATCGTGT CCACCATAGC CATCGTGCT3 1080 

ACATTATCCT GCCACATCCT CCTGAGACCC AAACTCACOG CACCOSTGGC ATTTAGTGTG 1140 

ATTGCCATGT TTAATGTAAT GAAGTTTTCC ATTGCAATCT TGCCCTTCTC CATCAAAGCA 1200 

ATGGCTGAAG CGAATGTCTC TCTAAGGAGA ATGAAGAAAA TTCTCATAQA TAAAAGCCCC 1260 

CCATCTTACA TCACCCAACC AGAAGACCCA OATACTGTCT TGCTTTTAGC AAATGCCACC 1320 

XTGACATGGG AGCATGAAGC CAGCAGGAAA AGTACOCCAA AGAAATTGCA GAACCAGAAA 13B0 

AGGCATTTAT QCAAQAAACA GAGGTCAGAG GCATACAGTG AGAGGAGTCC ACCAGCCAAG 1440 

GGAGCCACTG GCCCAGAGGA GCAAAGTGAC AGCCTCAAAT OGGTTCTGCA CAGCATAAGC 1500 

. TTTGT GGTG A QAAAQTTATG TCGTTAtCCC GAAGCCCAGC TCCTGGCTTG GAGGTGGCCA 1560 

H\J GCAGTGTTTG TTGGGAGAAT CATCAGAGGA TACAGGCCTC ATGGATTTtC TGCTAAAGAC 1620 

AAGGATGAAT CTAGAAGGCT TCTTACTTGG CCCCAAGAAG TGGATAGGAC TCAAAGGGCA 1680 

GCCAAATACC TGGGGAAGAT CTTGGGAATA TGTGGGAATG TGGEAAGTGG AAAGAGCTCC 1740 

CTCCTTGCAG CTCTCCTAGG AGAGATGCAG CTGCAGAAAG GGGTGGTGGC AGTCAATGGA 1800 

Ac ACTTTGGCCT AOGTTTCACA GCAGGCATGG ATCTTTCATG GAAATQTGAG AGAAAACATA I860 

4 J CTCTTTGGAG AAAAGTATGA TCACCAAAGG TATCAGCACA CAGTCCGCGT CTGTGGCCTC 1320 

CAGAAGGACC TGAGCAACCT CCCCTATGGA GACCTGACTG AGATTGGGGA GGGGGGCCTC 1980 

AAOCTCTCTG GGGGGCAGAG GCAGAGGATT AGCCTGGCCC GOGCTGTCTA CTCCQACCGT 2040 

CAGCTCTACC TGCTGGAEOA CCCCCTGTCG GCCGTGQACQ CCCAOGTGGG GAAGCACGTC 2100 

cp. TTTGAGGAGT GCATTAAGAA GACGCTCAGG GGAAAGACAG TOGTCCTGGT GACCCACCAG 2160 

DU CTACAGTTCT TAGAGTCTTG TGATGAAGTT AT T TT ATT AG AAGATGGAGA GATTTGTGAA 2220 

AAGGGAACCC ACAAGGAGTT AAtGGAGGAG AtJAGGGCGCT ATGCAAAACT GATTCACAAC 2280 

CTGOGAGGAT TGCAGTTCAA GGATCCTGAA CACCTTTACA ATGCAGCAAT GGTGGAAGCC 2340 

TTCRAGGAGA GCCCTGCTGA GAGAGAGGAA GATGCTGGTA TAATCGGGTA CCTCCTTTCT 2400 

CTCTTCACTG TGTTCCTCTT CCTCCTGATG ATTGGCAGCG CTGCCTTCAG CAACTGGTGG 2460 

CTGGGTCTCT GGTTGGACAA GGGCTCAOGG ATGACCTOTQ GGOCOCAGGG CAACAGGACC 2520 

ATGTGTGAGG TCGGCGCGGT GCTGGCAGAC ATGGGTCAGC ATGTGTACCA GTGGGTOTAC 2580 

ACTGCAAGCA TGGTGTTCAT GCTGGtGTTT GGCqTCACCA AAGGCTTCGT CTXCACCAAG 264Q 

ACCACACTQA TGGCATCCTC CTCTCTGCAT GACACGGTGT TTGATAAGAT CTTAAAGAGC 2700 

Kf\ CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA TGAACCGTTT TTCCAAGGAT 2760 

OU ATGGAGGAGC TGGATGTGAG GCTGCCGTTT CACGCAGAGA ACTTTCTGCA GCAGTXTTTT 2820 

ATGGTQGTGT TTAT TCtCQT GATCTTGGCT GCTGTGTTTC CTQCXGTCCT TTTAGTCGTG 2880 

GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTACCCATTT TCCACAGAGG AGTCCAGOAG 2940 

CTCAAGAAGG TGGAGAATGT CAGCOGOTCA CCCTGGTTCA CCCACATCAC CTCCTCCATG 3000 

£r CAGGGCCTGG GCATCATTCA CGOCTATGGC AAGAAGGAQA GCTGCATCAC CTATACITCA 3060 

OD TOCAAAGGCC TGTCATTGTC ATACATCATC CAGCTGAGCG GACTGCTCCA AGTGTGTGTG 3120 

CQAAEGGGAA CAGAGACGCA AGCCAAATTC AOCTCCOTGG AGCTGCTCAG GGAATACATT 31 SO 

TCGACCTGTG TTCCTGAATG CACTCATCCC CTCAAAGTGG GGACCTGTCC CAAGGACXGG 3240 

CCCRGCTGTG GGGAGATCAC CTTCAGAGAC TATCAGATQA GATACAGAGA CAACACCCCC 3300 

CTTGTTCTCG ACAGCCTGAA CTTGAACATA CAAAGTGGGC AGAjCAGTCGG GATTGTTGGA 3360 

/U AGAACAGGTT CCGGAAAGTC ATCGTTAGGA ATGGCTTTGT TTCGTCTGGT GGAGCCAGCC 3420 

AGTGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC TCAGCTTGGA AGAOCTCAGA 34B0 

ACCAAGCTGA CTGTGATCCC ACAGGATCCT GTCCTGTTTG TAGGTACAGT AAGGTACAAC 3540 

TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC AGOTTCTGGA GAGAACATXC 3600 

ATGAGAGACA CAATAATQAA ACTCCCAGAA AAATTACAGG CAGAAGTCAC AGAAAATGGA 3660 

/D OAAAACTTCT CAGTAGGGGA ACQTCAGCTG CTTTGTGTGG CCCGAOCTCT TCTCCGTAAT 3720 

TCAAAQATCA TTCTCCTTGA TGAAGCCACC GCCTCTATGG ACTGCAAGAC TGACACCCTG 3780 

GTTCAGAACA CCATCAAAGA TGCCTTCAAG GGCTGCACTG TGCTGACCAT OGCCCACCGC 3B40 

CTCAACACAG TTCTCAACTG CGATCACGTC CTGGTTATGG AAAATGGGAA GGTOATTGAG 3900 

TTTGACAAGC CTGAAGTOCT TGCAGAOAAG CCAGATTCTG CATTTQCOAT GTTACTAGCA 3960 
OLT QCAGAAGTCA GATTGTAG 

9eq ID NOs 312 Protein, sequence 
Protein Accession #i Bob seqnence 



1072 



WO 03/042661 



I 11 21 31 41 51 

1 I I 1 1 1 

MVGEGPYX^IS DLDQRGRRRS FAERYDPfiLK TMIPVRPCAR LAPNPVDDAQ LL S FATFSWb 60 

TPVMVKGYRQ RLTVDTliPPL STYDSSDTNA KRFRVLWDEE VARVGPEKAS L3HVVWKFQR 120 

TRVIjMDIVAN ILCIIMAAIG PTVtlHQTLQ QTERTSGKVW VGlGliCIADF ATEFTKVFFW 1B0 

AliAWAINYRT AIRLJCVALST LVFENIiVfiFX TLTHISVGBV UJILSSDSYS LFfiAAUPCPL 240 

PATIPILMVF CAAYAFPILG PTALIGISVY VIFIPVQMFM AKIjNSAFRRS AII»VTDKRVQ 300 

TMHEFMCIR IiIKMYAWKKS FTNTIQDIRS RERKLLEKftO FVQSGWSAIA PIVSTJAIVI* 360 

TLrfiCHlLLRR KlrTAPVAFSV IAMFtfVMKFS lAIXPFSlKA MAEANV5LRR KKKILIDKSP 420 

PSYITQPEDP DTVLLTiANAT LTWEHEASRK STPKKLQWQK RHLCKKQRSE AY SHR 9 PPAK 4B0 

GATGFEEQSD SLKSVLHSIS FWHKLCRYP RAQblAWRHP AVFVGRIIRG YRPHGFSAKD 540 

KDESRRLLTW PQEVDRTQRA AKYLOKIU3I CGHVGSGK88 IJLAALLGQMQ LQKGWAVNG 600 

TLAYVSQQAW IFHGNVRENI I»FGEKYDHQR YQHTVRVCGIi QKDLSSJI.PYG DLTEIGERGL 660 

NLSGGQRQSI 8IARAVYSDR QLYLLDDFLS AVEAEVGKEV FEECIKKTlxR GKTWkVTHO 720 

LQFLESCDEV ILLEDGEICE KGTHKELMEB RGR YAKXi I EN IiRGLQFKDPB HIiYJJAAMVEA 780 

FKESPAEREE DAGIIQYLIjS LFTVFIjFI>IiM IQSAAFSNWW LGLWLDKGSR MTCGPQGKRT 840 

MCEVGAVIAD IGQHVYQWVY TASMVFMLVF GVTKGFVFTK TTLMASSSLH DTVFDKILrKS 900 

PWSFFDTTPT GKLMNRFSKD MDELDVRIiPF HAENFLQQFF MWFIL.VILA AVFPAVLIiW 960 

AfiliAVGP F IL LRIFHRGVQE IiKKVEHVSRS PWFTHITSSM QGLGIIHAYG KKBSCITYT8 1020 

SKGLSLSYTI QLSGI/LQVCV RTGTETQAKF TSVKLLREYI 8TCVPECTHP LKVGTCPKDW 1080 

PSCGEITFRD YQMRYRDNTP IiVmSIJSIIjHI QSGQTVQIVQ RTGSGK88XG MALFRIiVEPA 1140 

SGTIFIDEVD rCILSLEDLR TKLTVIPQDP VLFVGTVRYN tDPFBSHTDE MLWQVLERTF 1200 

MRDTIKKLPE KLQAEVTENG ENFSVGBRQI* I/TVARAIlLRK SKIIIiLDEAT ASMD9KTDTL 1260 

VQNTITCDAFK GCTVKTIAHR LNTVU5CDHV l*VriENGKVIE FDKPEVlAEK PDSAFAMLIxA 1320 
AEVRL 

Seq ID NO: 313 DMA sequenca 
Nucleic Acifi Accession #t Z31560 
Coding sequences 1-966 

1 11 21 31 41 51 

111111 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCGGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 120 

AGCCCGGACC GGBTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 1B0 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGQAG AAGCGGCCGT TCATOGACGA GGCTAAGCGG 300 

CTGCGAGCGC TGCACATGAA GGWGCACCCG GATTATAAAT ACCGGCCCDG GCGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCOCGGCGGC 420 

AATAGCATGG CGAGCGGGGT OGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 480 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGGT ACCOGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGGCC 600 

ATGCACCGCT ACGACGTGAG CGCCCTGCAQ TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAQCATGTCC TACTCGCAGC AGGGCACOCC 1GGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CCAGCCCCCC TGTGGTIACC 780 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCOT GCGCCGAGGT GCGGGAACCC GCCSCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGOC CCPCTCACAC 960 

ATGTGAGGGC OGQACAGGGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA AGAGCATOQA GAAAACCOGG TACTCTCRAA 1080 
AAAAA 

8eg ID KOx 314 Protein sequence 
Protein Accession £i CAA83435 

1 11 21 31 41 SI 

111111 

HSARMYKMME TELKPPQPQQ T8GGGGGN8T AAAAGGNOKBI SPDRVKRPMM AFMVKSRGQR 60 

RKMAQEMPKM HN8EISKRXG AEWKL*t*SETE KRPFIDKAKR 1KALEMKEHP DYKYRFRRKT 120 

KTLMKKDKYT LPGGUAFGG NSMASGVGVG AGOjGAGVWQR «DSYAHHHGW SNGSYSMHOD 1B0 

QLGYPQHPGL HAHGAftQMQP MHRYDVSALQ YHSMTSSQTY MNGSPTYSMS Y8QQGTPGP4A 240 

liGSMSSrVVKS EASfiSPPWT SSSHSHAFCQ AGDLRDMISK YLPGAEVPEP AAPSRU3MSQ 300 
HYQ8GPVPGT AINGTLPIjSS M 

8©g ID WO: 315 DMA sequence 
Nucleic Acid Accession #t upisia 
Coding sequence: 29 . .541 

1 11 21 31 41 51 

1 I i 1 I I 

CGGACTTTGGC TTGTTAQAAG GCTGAAAGAT GATGGCAGGA ATGAAAATCC AGCTTGTATG 60 

CATGCTACTC CTGGCTTTCA GCTCCTGGAG TCTGTGCTCA GATTCAGAAG AGGAAATGAA 120 

AGCATTAGAA GCAGATTTCT TGACCAATAT GCATACATCA AAGATTAGTA AAGCA.Ca.TGT 180 

TCCCTCTTGG AAGATGACTC TGCTAAATGT TTGCAGTCTT GTAAATAATT TGAACAGCCC 240 

AGCTGAGGAA ACAGGAGAAG TTCATGAAGA GGAOCTTGTT GGAAGAAGGA AACTTCCTAC 300 

TGCFTTAGAT QGCTTTAGCT TGGAAGCAAT GTTGACAATA TACCAGCTCC ACAAAATCTG 360 

TCACAGCAGG GCTTTTCAAC ACTGGGAGTT AATCCAGGAA GATATTCTTG ATACTGGAAA 420 

TGACAAAAAT GGAAAGGAAG AAGTCATAAA GAGAAAAATT CCTTATATTC TGAAACGGCA 480 

GCTGTATGAG AATAAACCCA GAAGACCCTA CATACTCAAA AGAGATTCTT ACTATTACTG 540 

AOAGAATAAA TCATTTATTT ACATGTGATT GTGATTCATC ATCGCTTAAT TAAATATCAA 600 

ATTATATTTG TGTGAAAATG TGACAAACAC ACTTATCTGT CXCTTCTACA ATTGTGGTTT 660 

ATTGAATGTG TTTTTCXGCA CTAATAGAAA TTAGACTAAG TGTTTTCAAA TAAATCTAAA 720 
TCTTCAAAAA AAAAAAAAAA AAATGGGGCC GCAATT 
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Seq ID NO; 316 Protein sequence 
Protein Accession #: AAB5D564 

1 11 21 31 41 51 

1 I I I I I 

HMAGMKIQL.V CMLLLAF9SW SLCSDSEEBM KAI/EADFLTN MHTSKISKAH VPSWKMTIjLK 
VCSLVNNLNS PAEETGEVHE EEX.VARRKI/P TALDGFSLEA MLTIYQLiHKI CHSRAFQHWE 
ItfQEDIliDTG NDKNGKBEVI QLYEHKPRHP YIUCRDSYVY 

Seq ID NO: 317 DMA sequence 

nucleic Acid Accession tt= NM_0D6536.2 

Coding sequence! 109- .2940 



£0 
120 



1 11 21 31 41 

I I I I 1 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC 
ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAQGCTTC 
AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC 
□AACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA 
ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA 
ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA 
ATAAAGATTT TAATACCTGC CACATGGAAA G CTAATAAT A 
TOfkTATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG 
TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT 
TTCCTACTGA ATG ATAACT T AACAGCTGGC TACGGATCAC 
GAATGQGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA 
ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGT T CAT 
GTGTGTGAAA AAGGTCCTTG CCCOCAAGAA AACTGTATTA 
GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG 
AOTTTATCTT CTGTGOTTGA ATTTTGTAAT GCAAGTACCC 
CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG 
TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC 
GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGC70GATG 
GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT 
ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG 
CAfiCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT 
TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA 
AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG 
CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT 
CTGGGTTCAT CTGCAGCCCC AAATCTGGAQ GAA1TATCAC 
TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG 
TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG 
AAAGCTCACC ATCRATTGAA AAACACAGTG ACTGTGOATA 
ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA 
GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA 
TGGATFCCAG GAACRGCTAA GCCTGGGCAC TGGACTTACA 
TCTCTGCAAQ CCCTGAAAGT GACAGTGACC TCTCGCGCCT 
GCCACtGrGG AAGC C TTTGT GGAAAGAGAC AGCCTCCATT 
TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG 
GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG 
GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT 
TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA 
CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA 
GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGOGAA 
AGCTCAGGAG GCTOCTTTTC AGTGCTGGGA GTTCCAGCTG 
CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG 
TGOACAGCAC CTQGAGAAGA CTTTGATCAG GGCCAGGCTA 
AGTAAAAGTC TACAGAATAT CCftAOATGAC TTTAACAATG 
AAGOGAAATC CTCAGCAAQC TGGCATCAGG GAGATATTTA 
ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG 
GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG 
CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGOCA 
GGAGTTTTAA CAGCAAXT3GG TTTGATAGQA ATCATTTGCC 
CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA 
ATAAATATCC AAAGTCTCTT CCTTCTTAGA TATAAGACCC 
CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA 
ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT 
CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT 

GCAAAGGGAA gggtaaagtc ggaccagtgt caaggaaagt 

AATAGCCCGA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA 
TCATTTAGTT ACTTTGATTA ATTTTTCTJT TCIOCTTATC 
TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC 
CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT 
TTTCACTOTA AGAGGTAACC TTTAACAATA TGGGTATTAC 
TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT 
TTTCTAAGTT T ATtGC CT TG GGTTATTATG GAATGATAGT 
TACCTAGGAA A 

Seq ID NO: 318 Protein sequence 
Protein Accession #i np_d 0652 7.1 



51 



CATATTGAAA 
TCTACAACAT 
TCCTGGTTGC 

ATGGGTATAA 
TCICAAACAT 
GAAGAGTATT 
ACAGCAAAAT 
GGGCACATGG 
ACAT TCATTT 
GAGGCCGAGT 
ACAATGACAA 
CTGACATCAC 
TTAGTAAGCT 
CATCAATAAT 
ACAACCAAGA 
TAATCACAGA 
CTCCTCCCAC 
TGTCCAGCAA 
ATTTGATGCA 
GAGAGATCAG 
CATATCTGCC 
AAGGATTTGA 
TGACCAGOGG 
CS^ACAATTCA 
GTCTTACAGG 
ATX3CTTTCAG 
AAAGTAC3VGG 
ATACTGTGGG 
TTATATTATT 
CTTTTCGGAC 
CCCTGAACAA 
CCAACTCAGC 
TTCCTCATCC 
CCACTGTGAC 
ATGATGGAGC 
CCTTTGCTGC 
GCACCGCAGC 
ACGGTAATAT 
AGTGGGGCTT 
GCCCCCACCC 
AAGAGGAATI 
CAAGCTATGA 
CTATTT TAGT 
CGTTCTCACC 
AAAGCCACAG 
TATCTAACAT 
GAGATTATCT 
TTATTATAGT 
ATGGAACAAA 
ATGGCCTTCG 
ATGCATTGAG 
TTGGGGGTAG 
TTAAAGTAAT 
TTGTTTTATT 
TAACTGTCTG 
TGTGCAGTAC 
TAATGCAAAG 
GCTAATGCTC 
CTTTGTCTCT 
TCTACTCCC& 
TATAGCGCCN 



ACCTGACACA 
GACCCAAAGG 
CTTAAGITCA 
TGGRTTGCTC 
TAAGQAAATG 
TTTCAGAAAT 
AAAACAAGAA 
AGATGATCCA 
CACACCTAAT 
GTTTGTCCAT 
ACCTTTCTAC 
AGGCATTTTi 
TTTTAAAGAA 
GTTCATGCAA 
AGCACCAAAC 
CTCTGCTGAC 
ATTCTOGCTT 
GATGGCAGAG 
GATTGTTGAA 
AGCCCAGCTA 
CACCACTGTA 
GGTGGTTGAA 
AGATGATAAG 
CTCCATTGCC 
AGGTTTAAAG 
TAQAATTXCC 
TGAAAATGTC 
CAAC33ACACT 
TGATCCTGAT 
AGCTAGTCTT 
TACCCATCAT 
TOTGCCCCCA 
TGTGATGATT 
TGCCAC3VGTT 
AGGTGCTGAT 
AAATGGTAGA 
CCACTCTAIT 
TCAGATGAAT 
TAGCOGAGTC 
TGATGTGTTT 
GACCCTATCT 
AATAAGAATG 
AAATACATCA 
CCAGATTTCC 
AATTTATGTT 
TGCCCAGGCG 
TATATTGAAA 
TGTGACACAT 
ATTATTATAA 
ACTACAAAAA 
TTTTTGTACA 
ATTAGAAAAC 
GTGTTTAAAG 
GAGGTGGAAA 
TGTOAAGCAA 
AGGTTGCTTO 
CTCTTTACCT 
AGAGATCTTT 
TCATAGCGGT 
TCRAAGCAGC 
TATAATGCCT 



60 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1508 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



1 11 21 31 41 51 

1 1 1 I I I 

MTQRSIAGPI CKLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLXtlAXNPQ VPENQNLISH 
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IKEMITEASF YIjFNATKRRV ffrhixilip ATWKANNNSK IKQESYEKAN VIVTDWYQAH 12D 

GDDPYTLQYR GCGKEGKYIH FTPUFLUIDN LTAGIGSRGR VPVHEtfAHEJl WGVFDEYNHD 180 

KFFYINGQNQ IKVTRCSSDl TGIFVCEKGP CPQENCIISK LFKBGCTFIY NSTQNATASI 240 

MFHQgiiSSW EFQfASTHNQ EAPNLQBTQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 300 

5 TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKCJEI 360 

RAQLHQIWSN DDRKLLVSYIi FTTVSAKTDI SICSGIiXKGF EWKKLUGKA YGSVMIIiVXS 420 

GDDKLLGWCL PTVLSSGSTI H8IALGSSAA PNUEELSRLT GGLKFFVPDI SNSNSMIDAF 480 

SRIGSGTGDI FQQHIQIjEST GENVKPHHQL, KNTVTVDNTV GNDTMFkVTW QASGPPEIII. 540 

FDPDGRKYYT NNFITNLTFR TASLWIPGTA. KPGHWTYTLN HTHHSLQALK VTVTSRASNS 600 

10 AVPPATVEAF VERDSliHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 660 

AGADVIKNDG IYSRYFFSPA ANGRYSLKVH VNHSPSISTP AH9IFGSHAM YVPGYTAUGH 720 

IQMHAPRKSV GENEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

LTTiSWTAPGE DFDQGQATSY EIHMSKSLQN IQDDENHAIL VMTSKRNPQQ AGIREIFTF5 840 

PQI8TKGPEH QPNQETMEfiH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP JJfiDPVPARDY 900 

15 lilLKOVI/rAM GLTGI3CMI WTHHTLSRK KRADKKBNGT Kli 

Seq ID NO: 319 PNA. sequence 

Nucleic Acid Accession #t hm_000228.i 
coding sequences 82.. 3600 

20 

1 II 21 31 41 SI 

I I I 1 I I 

GCTTTCAGGC GATCTGOAGA AAGAACGGCA GAACACACAG CAAGGAAAGG TCCTTTCTGG 60 

GGATCACCCC ATTGGCTGAA GA7JGAGACCA TTCTTCCTCT TGTGTTTTOC CCTGCCTGGC 120 

25 CTCCTGCATG CCCAACAAGC CTGCTCCCOT GGGGOCTGCT ATCCACCTGT TGGGGACCTG 180 

CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT GTGGACTGAC CAAGCCTGAG 240 

ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 300 

CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT CATCCTCGGG CCCCATGCGC 360 

TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 420 

30 TTCCAGCTTC AAGAAGTCAT GATGGAGTTC CAGGGGOCCA TGCCCGCCGG CATGCTGATT 480 

GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCOGACTGC 540 

ACCTCCACCT TGCCTCGGGT CCGCCAGGOT OGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 600 

CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATBGGG GGAAGGTCCA ACTTAACCTT 660 

ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 720 

35 ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 780 

CCTCOCAGOG CCTACTATGC TGTGTCCCAG CICCGrCTGC AGGGGAGCTG CTTCTGTCAC 840 

GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC CACCGCTGTG 900 

CAGGTCCACG ATGTCTBTGT CTGCCAGCAC AACACTGCCG GCCCAAATTG TGAGCGCTGT 960 

GCACCCTTCT ACAACAACCG GCCCTGGAGA COGGCGGAGG GCCAGGACGC CCATGAATGC 1020 

40 CAAAGGTGOG ACTGCAATGG GCACTCAGAG AC2VTGTCACT TTGACGCCGC TGTGTTTGCC 1080 

GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 1140 

AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCAT T CAG 1200 

GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TCCCTGTGAC 1260 

CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAOAGCGCTG TGACCTATOC 1320 

45 AAGCCGGGCT TCACTGGACT CACCTACGCC AACCOQCAGG GCTGGGACGG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGOGCG CTGCCTTTGT 1440 

CTGCCCAAGG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 1500 

AOTGGOCAGG GCTGTGAACC GTGTGCCTGC GACCOQCACA ACTCCCCTCA GCOCACAGTG IS 60 

CAACCAGTTC ACAGGGCAGT GCCCTOTCGG GAAGGCTTTG GTGGOCTGAT GTGCAGCGCT 1620 

50 GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG TGGCCACAGG ATGCOGAGCC 1680 

TGTQACTGTG ATTTCOGGGG AACAGAGGGC CCGGGCTGCG ACAAGGCATC AGGCGGCTGC 1740 

CTC1GCCGCC CTGGCTTGAC CGGGCCCCGC 1GTGACCAGT GCCAGCGAGG CTACTGCAM? 1800 

CGCTACCCGG TGTGCGTGGC CTGOCACCCr TGCTTCCAGA OCTATGATGC GGACCTCCGG I860 

GAGCAGGCCC TGCGCTTTGG TAGACXCOGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 1920 

55 GGGCTGGAGG ACCGTGGOCT GGCCTCCCGG ATCCTAGATG CAAAOAGTAA GATTGAGCAG 19 B0 

ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 2040 

GCCATOCTCT CCCTCAGGOG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 2100 

GAGACGTTGT CCCTTCCGAG AGACCTOGAG AGTCTTQACA GAAGCTTCAA TGGTCTCCTT 2160 

ACTATGTATC AGAGGAAGAG ©GAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 2220 

60 GCCTTCCGGA TGCTGAGCAC AGCCTAOGAG CAGTC3VGCCC AGGCIGCTCA GCAGGXCTCC 22BO 

GACAGCTCGC GCCTTTTGGA OCAGCTCAGG GACAGCOGGA GAGAGGCAGA GAGGCTGGTG 2340 

CGGCAOGC3GG GAGGAGGAGG AGGCAOOGGC AGCCCCAAGC TTGTGGCOCT GAGGCTGGAG 2400 

ATGTCTTGGT TGGCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 2460 

ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 2S20 

65 TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG GGGCCTTCTT GATGGCGGGG 2580 

CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCGGACCAG GCAGATGATT 2640 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGOGCTT GQAGACOCAG 2700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCCOAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

70 GAGGCCGTGC TGGCCCTGXG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTQAGGCTG AGGAAGCCAG GAGCCGAGOC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT COCTTCGGCT TATCCAGGAC 3120 

75 AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 

AAGCftGClGG GTGACTTCTG GACACGGATG GAGGAGCTOC GCCACCAAGC CCGGCAGCAG 3240 

GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGOGAGCA GGCATTGAGT 330O 

GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGIATGCTG AGTTGAAGGA OCGGTTGGGT 3360 

CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 3420 

80 GAGCTGTTTG GGGAGACCAT GGAGATGATG GACAGGATGA AAGACATGGA GTTGGAGCTG 3480 

CTGCGGGGCA GCCAGGCCAT CATGCTGCGC TCGGCGGACC TGACAGGACT GGAGAAGCGT 3540 

GTGGAGCAGA TCCQTGACCA CATCAATGGG CGCGTGCTCT ACTATGCCAC CTGCAAGTOA 3600 

TGCTACAGCT TCCAGCCCGT TGCCCCACTC ATCTGCOGCC TTTGCTTTTG GTTGGGGGCA 3660 

GATTGGGTTG GAATGCTTTC CATCTCCAG3 AGACTTTCAT GCAGCCTAAA GTACAOCCTG 3720 
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GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 3780 

GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 3fl40 

CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TAXCCCCCGC CTTTAGTTCT CCACTGGGGA 3900 

GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG TAAAAATGAA AAGCCAAATA 3960 
AAAATCTTTG G 

Seq ID HOt 320 Protein sequence 
Protein Accession #x NP_000219-l 

1 11 21 31 41 51 

111)11 

MRPFFIrLCFA LPGLLHAQQA CSRGACYPPV GDLLVGRTRF LRAS8TCGLT KPETYCTQTO 60 

EWQMKCCKCD GRQFHNYYSH RVENVASSSG PMRWWQSQND VNPVSLQU3L DRRPQLQEVM 120 

MEFQGPMPAG MLIERSSDFG KTWRVYQYLA ADCTSTFPKV RQGRPQSKQD VRCQSLPQRF ISO 

BARMK3GKVQ LNLMDLVSGI PATQSQKIQE VGEITNLRVN FTRIAFVPQR GYHPPSAYYA 240 

VSQLR1QGSC PCHGHADRCA PKPGASAGPS TAVQVHDVCV CQHNTAGPKC ERCAPFYNNK 300 

PWRPAEGQDA HEOQRCDCNG HSETCHFDPA VFAA&QGAYG GVCDNCRDHT EGKNCERCQL 360 

HYFRNRRPGA SIQETCISCE CDPDGAVPGA PCDPVTGQCV CKEHVQGERC DtCKPQFTGL 420 

TYANPQGCHR CDCNILGSRR DMFCDEESGR CLCLPNWGP KCDQCAPYHW KLASGQGCEP 480 

CACDPHNSPO PTVOPVHRAV 5CHEGFGGLM CSAAAIHQCP DRTYGDVATG CRACDCDFRG 540 

TEGPGCDKAS GRCLCRPGLT GPRCDQCQRG YCHRYPVCVA CHPCFQTKDA DLHEQALRFG 600 

RLRNATASLW SGPGLEDRGIi ASRILDAKSK IEQIKAVLSS PAVTBQSVAQ VASAILSLRR 660 

TliQGLQLDLP LEEETLSXtPR DliESLD'HSFK GLLTHYQRKR EQFEK1S3AD FSGAFRMDST 720 

AYEQSAQAAQ QVSDSSRUjD QLRDSRREAE RLVRQAGGGG GTGSPKLVAL RLEMS3LEDL 780 

TPTFNKLCGN SRQMACTPIH CPGELCPQDN GTACGSRCRO VLFRAGGAFL MAGQVAEQLR 840 

GFNAQIjQRTR QMIHAAEE6A EQIQSSAQRL ETQVSASRSQ MEEDVRRTRI* LIQQWDKLT 900 

DPETDAATTQ EVSEAVXALW LPTDSATVLQ KMNEIQAIAA RLFNVDLVL8 QTKQPIARAR 960 

RLQAEABEAR SRAHAVEGQV KDWGNLRQG TVALQEAQDT KQGTSRSIiRIj IQDKVAEVQQ 1020 

VLRPAEKIiVT SHTKQLGDFW THMEELRHQA RQQGAEAVQA QQLAEGASEQ ALSAQEGFER 1080 

IKQKYAEIiKD RLGQSSHLGB QGARIQSVKT BAEELFGETM EMMDRMKDME LEltLRGSQAI 1140 
KLRSADIiTGI* EKRVBQIRDH HJGRVLYYAT CK 

Seq ID NO i 321 DMA sequence 

Nucleic Acid Accession #: HH_o 019 44.1 

Coding sequence * 64 . . 30 B3 

1 11 21 31 41 51 

I 1 I 1 I I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGOGGCTC ACTTGGACTT 60 

TTTCACCAGG gaaatcagag acaatgatgg GGCTGTTCCC cagaactaca GGGGCTCTGG 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GA5AATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCRGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGOCAAQA 300 

TTACTTCAGA TTACCAAGCA ACCCAOAAAA TCACCTACCG AATCTCTGGA GTGQGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTOGA CCGGGAGGAA ACTCCAAOCT TOCTGATCAC ATGTCGGGCT CTAAATGGGC 480 

AAGGACTAGA TGTAGAGAAA CCACXTATAC TAACGOTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCfCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGOAAC CAGCAGGCAC ACCCATGTTC CTGCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCRGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCGCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCQATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT GTTTAGCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AQGCATCGTG AAAGTGGTGA 1080 

AGOCTCTAGA TTATGAACAA CTACAAAQCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTGGAT ACCGAGTTCA GTCAA£?OCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTOCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATOCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGAOGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGACGT TCTGaCCATA GATGAATACA 150O 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT GCTCOAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCXXSTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTC3GGCC CTCCTCAGAG 1740 

CCCaGGAACA GATACCTCCT GGAGTATACC AQVFCTCCCr GGTACTTACA GACftGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CBCAGCTTGA CACTGGAAGT CTGTCAOTGT GACAACAGGG 1860 

GCATCTG1GG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGOCCCCCT TCIGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATG3CTCAG AAGGAACAAT TCATGAOTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTpTGT GCCTOCTGTA ACAGCCAATG 2160 

GAGCGGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCAGAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGAGCA CTAAQCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAAJCA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTOGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAQGACTAC QCTGATCGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 24 60 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAOCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGOGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 25 BO 

GTTGCAQTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACIT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTXGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCOOTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCGAGCAGA 2760 
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CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGd'TCTGCT TTGTCCGCCT 2B20 
CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTX3ACCCTCT GCAGCATGGT AACTATTTAG 2BB0 
TAACGGAGAC TTACTOGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 
CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 
5 CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 30 SO 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 
ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 31 8 0 
TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 
TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAOTAAA 3300 
10 TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 322 Protein sequence 
Protein Accession If: NP_00193S.l 

15 1 11 21 31 41 51 

111)11 

KMGIiFPRT TG ALAIFVWIli VHGSLRIETK GQYDEEEMTM QQAKSRQKEE WVKFAKPCRE 60 

GEDtffiKRNPI AKITSDYQAT QKITYRISGV GIDQPPFQIF WPKNTGDIN ITATVDREET 120 

PSPLITCRAIt MAQGLDVEKP LILiTVKILDI NDtfPPVFSQQ IFttGEIEENS AfiMSLVKIIiN 180 

20 ATDADEPttHL NSKIAFXIVS QEPAGTPHFIi LSRNTGEVRT JjTNSUDREQA SSYRLWSGA 240 

DKDGEGLSTQ CECtflKVKDV NDNFPMFRDS QYSARIEENI LSSELLRFQV TDLDEEYTPN 300 

WLJWYFFT6G JflEGNWFElQT DPRTNEGILK WKALDYEQI* QOTiq^SIAVK NKAEFHQSVI 360 

SRYRVQSTPV TIQVINVREG XAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN DGGYLMIDSK. TAEIKFVKNM NRD3TFIVNK T ITAEVliA ID EYTGKTSTGr 480 

25 VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY TGPYTFAI^D QFVKLPAVWS S40 

ITTLNATSAL LRAQEQIPPG VYHIfiLVLTD SQNNRCEMPR SIiTLEVCQCD NRGICGTEYP 600 

TTSPGTRYGR PKSGRIX3PAA IGLUiIiQLLIi LIiLAPLIiIiLT CDCGAGSTGG VTGGPIPVPD 660 

GSEGTIHQWG IEGAHPEBKE ITNICVPPVT AMGADFMESS EVCXNTYARG TAVBGT8GWE 720 

KTTKLGAATE 6GGAAGFATG TVSGAASGFG AATGVGICBS GQ9GTMRTRH STGGTNXDYA 780 

30 DGAISMMFU3 6YFSQKAFAC AEEDDGQEAN DOiblYDNEG ADATGSPVGS VGCCSFIADD 640 

LDDSFLDBLG PKPKKIAEIS LGVDGEGKEV QPPSKDSGYG IESCXHIPIEV QQTOFVKCQT 900 

IiSGSQGASAT. SASGSVQPAV SrPDPLQHGN YLVTETYSAS GSbVQPSTAG FDPLl/TQNVI 960 
VTKRVICPIS SVPGNUWSPT QLRG9HTMLC TEDPCSRLI 

35 Seq ID KOi 323 DMA sequence 

Nucleic Acid Accession, tt: XM_05B069.2 
coding sequence i 1«,„1413 

1 11 21 31 41 51 

40 i | 1 I I I 

ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGQAGCTCT TCCCCTOAAC 60 

AGCTCTACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 120 

TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACTXAAIG 180 

AAGGAAAAAA TCCAAGAAAT GCAGCACTTC TTGGGTCTGA AAGTGACCGG GCAACTGGAC 240 

45 ACATCTAOCC TGGAGATGAT GCACGCACCT CGATGTGGAG TCCCCGATGT CCATCATTTC 300 

AGGGAAATGC CAGGGGGGCC CGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 360 

TACACACCTG ACATGAACCG TGAGGATGTT GACTACGCAA TCCGGAAAGC TTTCCAAGTA 420 

TGGAGTAATG TTACCCCCTT GAAATTCAGC AAGATTAACA CAGGCATGGC TGACATTTTG 480 

GTGGTTTTTG CCCGTGGAGC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGGAATC 540 

50 CTAGCCCATG CTTTTGGACC TGGATCTGGC ATTGGAGGGG ATGCACATTT CGATGAGGAC 600 

GAATTCTGGA CTAOACATTC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGAGATT 660 

GGCCATTCCT TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC: 720 

AAATATGTTG ACATCAACAC ATTTCGCCTC TCTGCTGATG ACATACGTGG CATTCAGTCC 780 

CTGTATGGAG ACCCAAAAGA GAACCAACGC TTGGCAAATC CTGACAATTC AGAACCAGCT B40 

55 CTCK3TGACC CCAATTTGAG TTTTGATGCT GTCACTACCG TGQGAAATAA GATCTTTTTC 900 

TTCAAAGACA GGTTCTTCTG GCTGAAGGTT TCTGAGAGAC CAAAGACCAG TGTTAATTTA 960 

ATTTCTTCCT TATGGCCAAC CTTGCCATCT GGCATTOAAG CTGCTTATGA AATTGAAGCC 1020 

AGAAAUCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAAT TAGC AA TTTAAGACCA 10BO 

GAQOCAAATT ATGCCAAGAG CATACATTCT TTTGGTTTTC CTAACTTTGT GAAAAAAATT 1140 

60 GATGCAGCTG TTTTTAAOCC ACGTTTTTAT AGGACCTACT TCTTTGT7AGA TAACCAOTAT 1200 

TGGAGOTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GAT1ACCAAG 1260 

AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 1320 

TATTTCTTCG AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATCACCAAA 1380 
ACACTGAAAA GCAATAGCTG GTTTGGTTGT TGA 
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Protein Accession H z P39900 



1 11 21 31 41 51 

70 | i i I \ I 

MKFI»riIIiljLQ ATASGALPUN SSTSLBKHNV IiFGERYIiEKF YGLBINKLPV TKMKYflGKLM 60 

KESQCQEHQHF IiGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPQGPVWR KHYITYRINN 120 

YTPDMNREUV DYAIRKAFQV WSNVTPLKFS KH5TGMADIL WFARGAHGD FHAFDGKGGI 1B0 

liAHAFGPGSG IGQOAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGHSS EPKAVMFPTY 240 

75 KYVDIHTFHlt SADDIRGIQS IiYGDPKBNQR LPNPDNSEPA LCDPNBSFDA VTTVGNKIFP 300 

FKDRFFHIiKV SERFICISVNL ISSLWPTLP8 GIEAAYEIEA KNGWLFKDD KYWI>ISN1J&P 360 

BPNYPKSXHS PGFPNFVKKI D AAVFN PRF Y RTYFFVDNQY HRYDBRRQHM DPGYPKLITK 420 
NFQGIGPKID AVFYSKNKYY YFPQGSNQFE YDFLLORITK TLKSHSWFGC 

80 Seq ID HO: 325 DMA seijuenca 

nucleic Acid Accession #; NM_024423.1 
Coding sequence: 64.. 2590 

1 11 21 31 41 51 
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I I I I I 1 

GGCW3GTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 
CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCQCGGAG CCGTCTGCCT GCATCTGCTG 120 
CTGACCCTGG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 3.80 
5 CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAQTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 
TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 
GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 
TCGAAGACAA GACACACTAG AGAAACXGTT CTCAGGCGTG CCAAGAGGAG ATGGGCAOCT 480 
10 ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 
GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGAOG TGGAGTTGAT 600 
AAAGAACCir TAAATTTGTT TXATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 
CCTGTGGATC GTGAAGAATA TGATGTTTTT GAT TTG ATTG CTTATGCGTC AACTGCAGAT 720 
GGATATTCAG CAGATCTGCC CCTCCC&CTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 
15 CACCCTGTTT TCACAGSAAGC AATTTATAAT TTTQAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 
CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACGTG GGCTCTTTTC TGTGCATCCC 960 
AGCAC&QGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 
TCATTGATAA TGAAAGTACA AGACATGGAT GGOCAGTTTT TTGGATTGAT AGGCACRTCA 1080 
20 ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 
TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 12 00 
GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 
GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 
GTAAAGOCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTQG AGTAAACAAT 13 B0 
25 GAAGOGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

OTTCATGTGA GGGATCTGGA TCGAGGGGGCT GAATGCACTC CTGCAGCCCA ATATGTGCGG ISOO 
ATTAAAGAAA ACTTAGCAOT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 
AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 
ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCG TGGATAGGGA GGTTGAAACT 1600 
30 CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 
ACTGGAACAC TTGCTOTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 
GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCX I860 
GATGAACCTG TCCATGGAQC TCCATTTTAT TTCAGTTTGC CCAATACTTC TOOAGAAATC 1920 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACftGCTG CCOGTCTTTC ATATCAGAAA 1980 
35 AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2 040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TC CTTG CAAT ATTACTGGGT 2160 
ATAGCACTGC TCTTTTCTGT ATTGCXAACT TTAGTATGTG GflOTTTlTQG TGCAACTAAA 2220 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 
40 CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTIATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAQGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 
GAAATGATOA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGOGGGCTGG GCATCATCAT 2460 
ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 
GAGTGGCACA GTTTTACTCA ACCCCGTGTC GGTGAAGAAT CCATTAGAGG ACACACTGGT 2S80 
45 TAAAAATTAA ACATAAAAQA AATTGCATCG ATGTAATCAG AATGAAGACC GCATGOCATC 2640 

CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTG GTTCTGTGGG 2700 
CTGCTGCAGT GAAAAGCAGG AAGAAGATGQ CCTTGACTTT TTAAATAATT TGGAACCCAA 2760 
ATTTATTACA TTAGCAGAAG CATGCSCAAA GAGATAATGT CACAGTflCTA GAATTAGGIC 2820 
TTTGTCAGAC ATXCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAATT T CAACATGTA 2080 
50 TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 2940 
AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAACAGAC AACTGGTAAA 3000 
TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTGTTTTT TTTXTTTACG 3060 
GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 3120 
ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 3180 
55 AGTATCACTA TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 3240 

ATGTTGCAGC TCATAAAGAA TTGGGACTCA OOCCTACTaC ACTACCAAAT TCATlTGACT 3300 
TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGXT 3360 
GOAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATGAG 3420 
AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTOGG GT ATAGT TTG TCCTACAATA 3480 
60 GAAAAAAGAG AGAGCTTCGT AGGDCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3540 
GAGOAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA AATTAAACTT 3600 
TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 
CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 3720 
GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT OGGGGAGCTA 3780 
65 ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTGTA 3840 
TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCG AAATAACCAT GTCCTC3CTAG 3900 
AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCAOGCT GGGGAGATTG 3960 
ATTCtTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTAC3LAAA 40 20 
TTTCATTTTT CTCCTCACTG COCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 40 BO 
70 AGGOCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 
CCITAAGTGA CTCCAGGTTT TCCACCATCC TTCAGOGTGA ATTAATTTTT AATCAGTTTG 4200 
CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 
AAAGATGAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 
TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGOCAC 4380 
75 AAGOAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAOGCTTGG 4440 
CACTGCCTTT TGCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGATCAG 4500 
CCAACCTCTT CTCTATGGCT CACCTTATTT OGAGTGAGAA ATCAAGGAGA CAGAGCTGAG 4560 
TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTOTGC AGAACAAACA 4620 
. AGGCA.TTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 4680 

80 CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 4740 
TGACCCXAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCC TTTTTTT 480O 
TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4860 
CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 
GACTACAGGC GCCCACCAOC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 4980 
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TTCACTGTGT TAGCCAGGAT GGTCTCOATC 
TCCCAAAGTG CTGGGATTAC AGGCATGACC 
GTCGTCTTCT TTTAATGTAA TCATTTTGAA 
TCAATCTTGA AATACTCAAC CAAAAGACAG 
J GCACAAAATA TTGGTCTGAQ AATGGAATTC 

TGTAACCAGA AGCCAGTTTT ATCTAACGGC 
CCCACTCACC GATCAAAACC TGCTACCTCC 
TCAAAGAGCA ACCAGTATCA CTTCCCTGTT 
TGAACATGCT GAAAACCACC TGGTCTGCAT 

10 AAATGAAAAT TTAATTTTAG GGAXTCATTT 
TCCTTATATG TGTAAGGTGA AATTTATGGT 
AGCTTTCATT TTTCCCCCAG TGAATGATTT 
TTTTCTTACT TTTATAAGGA AGCAGCTGTC 
TTTAAACAGA GTTTTAGTAT TGCTATTAAA 

15 CTGCTTAAftA TAAGCAAAAA TTGGATGCAT 

AATAAAACAA TATTAACTTQ GCTGCTTAAA 
TTTACAGATG TGGGGAGATG TAATAAAACA 
TTTAQAGATT AAATAATTCT AAGATGATCA 
GAAATAGAAA TACTCAATXA TGTCTTTGTT 

20 TCATTATCAA ATTGTCGACA TCATTAATAT 

TTGAAGCACA GCTTTACAGA TGAGTATCTA 
GTATTAAAAG TATTAGAAGG TGGTTATAAT 
ACAGGGGTTT TACTTTGAGG ACCAGTGTAG 
CAGGCAATAT TGCAGTCTTG ATTCTGCCAC 

25 GACAAGATGA TCCAACCATA AAGGTGCTCT 

GGAGTGTGCT CCCCTACAAA OGTTAAQACT 
AAAGCCTTAC ATTTTAATAT AGGTTGAACC 
ACCATTA1TT TTGTGTATGT CTTCAAGAAT 
ATACOGGATA CATTTCACGT GTOCTTCAGT 

30 GTTGAGAAGC ATGGACACTA GAGCCAGAAT 
ACTTCTGTGT GAOCTTTGAA AGGCTACTTA 
ATGAACAATG CCAGCCTCAT GGGGTTGTTG 
ACATAGAACA CTGCCTQCAC ATAGTAAAAG 
ATGTAGTTGG ATATACTACC GAACAATATC 

35 CAT AT AT ATA ATCCCQAAAC ATG 



TCCTGACCTC GTGATCCGCC TGCCTGGGCC 5040 

CACcaemx: ggccttgttt tccgtttaaa 5100 

CATGTGTGAA AGTTGATCAT ACGAATTGGA 5160 

TCGAGAAGCC AGGGGGAGAA AGAACTCAGG 5220 

TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 5250 

TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

CCAAGACTTT ACTAGTGOCG ATAAACTTTC 54 00 

TATAAAACCT CTAACCATCT CTTTGTTCTT 5460 

GTATGCCCGA ATTTGTAATT CTTTTCTCTC 5520 

CTAT ATTTT C ACATATGTAG TATTATTATT 5580 

ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

TAAAATGCAG TQGGGTTTGT TTTGCAATGT 5760 

AGAAGTTACT TTGCTTTTAA AGAAACTTGG 5820 

AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

ATAAGCAAAA ATTGGATGCA TAAAGTAATA 5940 

ATATTAACTT GGTTTCTTGT TTTTGCTGTA 6000 

CTTTGCAAAA TTATGCTTAT GGCTGGCATG 60 go 

GTATTAATGG GGAATATTTT GGACAATGTT 6120 

ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TGATACATAT GTATAATAAA TTTTGATCGG €240 

TGCAGAGTAT TCCATGAATA GTACACTGAC €300 

TCAAGGGAAA ACATGAGTTA AAAAGAAAAG 6560 

TTACAGGATA GATAATGCCT GAACTTTAAT 6420 

GTGCTTCACA GTGAATCTTT TCCCCATGCA 6480 

GATCATTTCA AAAATCTATT AGCTATATCA 6540 

AAAATTTCAA TTCCAGTAAC TTCTATTGTA 6600 

GTTCATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATTGATXTGG TTGAATATTG GGTCATAATG 6720 

GCTTGGATAT GAATCCTOGA TCTGTCACTT 6780 

TTTCCTCTCT TAGCTTTCTC ATTAAAATCA 6840 

AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

AATTATAAGT GTGAGGTAGT TGGTAAAATT 6960 

TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 702 0 



Seq ID NO a 326 Protein sequence 
protein. Accession ft= NP_077741.l 

40 1 11 21 31 41 51 

1 1 I I 1 I 

MAAAGPfcRSV RGAVCLHLLL T^VIFSRDGB ACKKVILNVP SKLBADK1IG RVNLEECFR8 60 

ADIiIRaSDPD FRVI2JDGSVY TARAVALSDK KRSFTIWIjED KRRQTQJCEVT VIiLHHQKKVS 120 

KTRHTRSTVL RRAKRRHAPI PCSMQEH9LG PFPLFLQQfVE 9DAAQNYTVP YSI6GHGVDK 180 

45 EPLNLFY3ER DTGHU?CTHP VDREEYDVFD LIAYASTADG YfiADLPLPLP IRVEDEHDHH 240 

FVFTEAlYNF KVLE6SRPGT TVGVVCATDR UEPDTMHTRI* Ky8rW2QTPR SPGLFfiVHPa 300 

TGVITTVSHY UJREWDKYS LIMKVQDMDG QFFGLIGTST C1ITVTDSND NAPTFRQKAY 360 

EAFVKSNAFtf VEILRIPIED KDIiINTAHWH VUFTUjKGNB NGHFKISTDK ETUEGVLSW 420 

KPLNYEENRQ VNfcEIGVNNB APFASDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

50 KENLAVGSXI NGYKAYDPEN R5T3UQLRYKK LHDPKGWITI DBISB9IITS KILDREVETP 540 

KNHUYNITVIi AIDKDDRSCT GTLAVNIEDV NDHPPEILQE YWICKPKMG YTDIIiAVDPD 600 

EPVHGAPFYF SLPNTSPEIS RLW3LTKVHD TAARLSYQKN AGFQEYTIPI TVKDRAGQAA €60 

TKLIiHVHLCE CTHPTQCEAT 8RSTGVILGIC HAILAILLGI ALIiFSVItfjTX. VCGVFGATKE 720 

_ . KRFPEDLAQQ NLIISNTEAP QDBR.VCSANG FMTQTTHNSS QGFCGTMGSG MKNGGQETIE 780 

55 MMKGGNQTLE SCRGAGHHHT LDSCRG3HTB VDNCRYTYSE KH8FTQPRLG EESIRGHTG 



60 



Seq ID NO* 327 DMA sequence 

MUcleic Acid Accession HMJJ01941.2 

Coding sequence: 64., 2754 



1 11 21 31 41 SI 

I I 1 1 I i 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GOCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC OCGGCGCTCC GTGCGCGGAG CCGTCTOCCT GCATCTGCTG 120 

65 CTGACCCTCG TOATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTQGAAGA GTGCTTCAGG 240 

TCTGCAQACC TCATCCGGTC AAGTGATOCT GATTTCAQAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

70 TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

G AATC TGATG CAGCACAGAA CTATACTGTC ttctactcaa TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG G60 

CCTGTGGATC GTGAAGAATA TGATGTTTTT QATTTQATTG CTTATGCGTC AACTGCAGAT 720 

75 GGATATTCAG CAGATCTGOC CCTCCCACTA CCCATCAGGQ TAGAGGATGA AAATGACAAC 7 HO 

CACCCTGTTT TCAiCAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TOTGOITCCC 960 

Q AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGQACA GAGAGGTTGT AGACAAGTAC 1020 

80 TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGOATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTAGGAAT ACCTAXAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CZAQCACAQAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 
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GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 
GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAC3CCT TGAACAGAGC CTTGGTTACA 1440 
GTTCATGTGA GGOATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 
ATTAAAGAAA ACTT AG CAGT GGGGTCAAAG ATCAACGGCT AT AAGG CAT A TGACCCCGAA 1S60 
5 AATAOAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCAOC 1620 

AT TG AT G AAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGQATAGGGA GGTTGAAACT 1680 
CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAOA AATACTTCAA 1800 
GAATATGTAG TCATTTGCAA ACCAAAAATG GOGTATAOCG ACMTTTAGC TGTTGATCCT I960 
10 GATGAACCTG TCCATGGAGC TCCATXTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 29B0 
AATGCTGGAT 1TCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGOCAAGCT 2040 
GCAACAAAAT TAT TGAGAGT TAATCTGTGT BAAT GTACT C ATCCAACTCA GTGTCGTGCG 2100 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 
15 ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTASTATGTG GAGTTTTTGG TGCAACTAAA 2220 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2260 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT QGATlTATGA CCCAAACTAC CAACAACTCT 2340 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 
GAAATGATGA AAGGAGGAAA CCAGACCTTa GAATCCTGCC GGQGGGCTGG GCATCATCAT 2460 
20 ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2S20 
GAGTGGCACA GTTTTACXCA ACCCOGTCTC GGTGAAAAAT TGCATOOATG TAATCAGAAT 25 B0 
GAAGACCGCA TGCCATCCCA AGATTATGTC CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 
AATAATTTGG AACCCAAATT TATTACATTA G CAG AAG CAT GCACAAAGAG ATAATGTCAC 2760 
25 AGTGCTACAA TTAGGTCTTT GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2B20 

TTCAATTTCA ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 
CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2340 
AACAGACAAC TGGTAAATCT CAAACTCCAG GACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 
TCTTTTTTTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAQTCCAACA 3060 
30 ATAGCTAAGT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 
ATAAACAAGA AATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 
ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 
ACCAAATTCA TTTGACTTTG GAGGCAAAAT GTGTTGAAQT GCCCTATGAA 'GTAGCAATTT 3300 
TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATGCAAT 33 60 
35 ATTTAAAATG AAATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

TAGTTTGTCC TACAATAGAA AAAAGAGAGA GCTTCCTAGG CCTGOGCTCT TAAATGCTGC 3480 
ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3S40 
GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3 600 
TAGCTTTGCT TTGCAGTCTG TTTCAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 
40 GAATACTCGC TGCAGCTGGG GTTCCCTOCT TTTTGGTAGC AAGGOTCCAG AGATGAGGTG 3720 

TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 
CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3B40 
TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 
GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 
45 GTCTOaOROC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 

CTGAATCAAG GAAAaCCAGG CCTTGTGGGC OCCCTTCTXT CGGCTTTCTG CTAAAGCAAC 4080 
ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 
AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 
TTGAATGIAT AAAAGAAAAA GATCAAGTTG TCATTTTAOA ACAGAGGGAA CTTTGGGAGA 4260 
50 AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 
GGGCAAGGAG AGGCCACAAG GAATATGGGT GGGAGTAAAA GCAACATCGT CTGCTTCATA 43 B0 
CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 
GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4S00 
AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 
55 GTTGTGCAGA ACAAACAAGG CATTCATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 
CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA AATTCTGATT TATTCCTACA 4680 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 
CCCCCCCCOC TTTTTTTTTG AGACGGAOTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 
GCTCCGATCT CTGCTCACTG AAAGCTCCGC CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 
60 GCCTCCTGAG TAGCTGGGAC TACAGGCGCC CACCACCACQ CCCGGCTAAT TTTTTGTATT 4920 
TTTAATAGAG ACGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCOATCTCC TGACCTCGTG 4980 
ATCCGOCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGG CATGACCCAC CGCTCCOGGC S040 
CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 
r _ TGATCATACG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 
65 GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG S220 

TTGCTGAAAT TTCCTGCTGT AACCAGAAGC CAGTTTTATC TAAOGGCTAC TGAAACACCC 5280 
ACTGTGTTTT GCTCACT00C TCACTCAO0G ATCAAAAOCT GCTAOCTCCC CAAGACTTTA 5340 
CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 
TAACC3VTCTC TTTGTTCTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATGCCQ3AA 5460 
70 TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA SS20 

CATATGTAGT ATTATTATTT CCTTATATGT GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 
CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 
TGTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 
GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GGTATTAAAA GAAGTTACTT S760 
75 TGCTTTTAAA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 5B20 

TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 
TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 
AATAGAAATA CTCAATTATG TCTTTGTTGT ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 
ATTATCAAAT TGTCGACATC ATTAATATAT ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 
80 GAAGCAGAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATCGGGT 6120 

ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTG CATGAATAGT ACACTGACAC 61 B0 
AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC AAGGGAAAAC ATOAGTTAAA AAGAAAAGCA 6240 
GGCAATATTG CAGTCTTGAT TCT3CCACTT ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 
CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT GAATCTTTTC CCCATGCAGG 6360 

1080 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 



AGTGTGCTCC 
AQCCTTACAT 
CATTATTTTT 
ACCGGATACA 
TGAGAAGCAT 
TTCTGTGTGA 
GAACAATGCC 
ATAOAACACT 
GrAGTTGGAT 
TATATATAAT 



OCTACAAACG 
TTTAATATAG 
GTGTATGTCT 
TTTCACGTGT 
GGACACTAGA 
CCTTTGAAAG 
AQCCTCATGG 
GOCTGCACAT 
ATACTACCGA 
CCCGAAACAT 



TTAAGACTQA 
GTTGAACCAA 
TCAAGAATGT 
CCTTCAGTAT 
GCCAGAATGC 
GCTACTTATT 
GGTTGTTGAA 
AGTAAAAGAA 
ACAATATCTA 
G 



TCATTTCAAA 
AATTTCAATT 
TCATTGGATT 
TGATTTGGTT 
TTGGATATGA 
TCCTCTCTTA 
TGATTAAATT 
TTATAAGTGT 
ATCTCTTTTT 



AATCTATTAG 
CCAGTAACTT 
TTTGTTTGTA 
GAATATTGGG 
ATCCTGGATC 
GCTTTCTCAT 
AGTTAATATA 
GAGGTAGTTG 
AGGGAAATAA 



CTATATCAAA 
CTATTGTAAC 
ATAGTAAAAT 
TCATAATGGT 
TGTCACTTAC 
TAAAATCAAT 
CCTAAAGTAC 
GTAAAATTAT 
AGTTTGTGCA 



Sec, ID NO: 328 Protein geguence 
Protein Accession fti NP 001932.1 



l 
I 

MAAAGP&R6V 
ADLXRSSDPD 
KTSHTRETVI* 
EPLNKFYIER 
PVFTEAIYNF 
TGVITTVaHY 
EAFVHEKAFN 
KPLUYEEKRQ 
KENUWGSKI 
KNELYMITVIj 
EPVHGAPFYF 
TKLLRVNLCE 
KRFPEDLAQQ 
MMKGGiJQTLS 
DRMPSQDYVL 



11 
I 

RGAVCLHLLL 
FflVLHDGSVX 
RRAKRRWAPI 
BTGNLFCTRP 
BVI.E5SRPGT 
LDRBWDKYS 
VEIJ.RIPIED 
VKLEIGVKNB 
NGYKAYDPEN 
AIDKDDRBCT 
SliFHTSPEIS 
CTHPTQCRAT 
NDIIENTEAP 



21 
I 

TLVIFSRDQE 
TARRVALrSDK 



VDREEYDVFD 
TVGWCATDR 
IjINKVQDMDG 
KDLINTANWR 
APFAKDIPRV 
RNGNGLRYKJt 
GTIiAWIEDV 
JtLWSLTKVND 
SRSTGVILGK 



TYNYEGRGSP 



LDSCRGGHTE 
AGSVGCCSBK 



31 
I 

ACKKVILNVP 
KRSFTIWIiSD 
PFPI»FLQQVE 
LTAYA9TADG 
DEPDrTMHTRL 
QFFGLIGTST 
VNFTILKGNE 
TAIiKRAIiVTV 
IiHDPKGHITI 
NENPPEILQE 
TAARJjSYQKM 
WAXEiAILLGI 
FHTQTTNNSB 
VDNCRYTYSE 
QBEDGUDFLM 



41 
1 

SKIjEADKIIG 
KRKjQTQKBVT 
SDAAQflYTVF 
YSADLPLPI.P 
KYSlIiQQTPR 
CIITVTDSHD 
VGEFK1 STDK 



DEISGSIITS 
YW1CKPKMG 
AGFQEYTIPI 
ALLFSVIiLTL 
QGFCGTHGSG 
WHflFTQPRLG 
NLFPKFITLA 



6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 



51 
I 

RVNLEECFRS 
VI^EHQKKVS 
YSISGRGVDK 
IRVBDEKDtJH 
SPGLFSVHP9 
NAP r FRQNAY 
ETNEGVIiSW 
CTPAAQYVRI 
KILDREVETP 
YTDIIAVDPD 
TVXDRAGQAA 
VCGVPGATKG 
MKNQGQETIB 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
7B0 
840 



KACTKR 



Seq ID NOt 329 UNA sequence 
35 Nucleic Acid Accession it x NM_ 016583.2 
Coding sequence i 72.. 842 

1 11 21 |1 |1 51 

40 GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC 60 

TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 120 

CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCGTGGACCA GACCCTGCCC TTGAATGTGA 180 

ATOCAGCdCT GOCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT OCCCTCAGCA 240 

ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 300 

45 TGAAGCCTGG AGGAGGTACT TCTGGTOGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 360 

CAGTGATTCC TGGCCTGAAC AACATCATTQ ACATAAAGGT CACTOACCCC CAGCTGCTOG 420 

AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCAOCATC CCTCTCGGCA 480 

TAAAGCTCCA AGTQAATACG CCOCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC S40 

TGGACATCAC TGCAGAAATC TTAOCTGIGA GAGATAAfiCA GGAGAGGATC CACCTGGTCC 600 

50 TTGGTGACTG CACCCATTOC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 660 

CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAQGGAT CTTGAATAAA GTCCTGCCTG 720 

AGTTGGTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 780 

CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 840 

AAGCCXTCCA GGAAGGGGCT GGCCTCTGOT GAGCTGCTTC CCAGTGCTGA CAGA TGGCTG 900 

55 GCCCATGTGC TGGAAQATGA CACAGTTGOC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 960 

TCCCACCAGG OGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 1020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 



Seq ID NO* 330 Protein secfueaoce 
60 Protein Accession #-. MP_057€67.1 

1 II 21 31 41 51 

(11)1! 

MFOjTGGIiIVF YGLIiAQTMAQ FGGLPVELDQ TLFLNVMPAI, PLSPTGLAGS LTNALSNQLL 60 
65 SGGJXGI LEU liPLLDILKPG GGTSGGIAGG LLGKVTSVIP GLNHIIDIKV TDPQliLELGL 120 
VQSPDGHRLY VTIPU5IKLQ VNTPIiVGASI* IjEULAVKLDIT AKILAVRDICQ ERIHLVLGDC 180 
THSPOSUQIS LLDGLGPLPI QGLLDflLTGI LNKVLPELVQ GNVCPLVNEV LRGLD1TLVH 240 
DIVHMLIHGL QFVIKV 

70 

Seq ID NO: 331 DMA sequence 

Nucleic Acid Accession #= MM_004363.1 

Coding sequence t 115.. 2223 

75 1 11 21 31 41 51 

I I I I I I 

CTCAGGGCAG AGGGAGGAAG GACAGCAGAC CAGACAGTCA CAGCAGCCTT GACAAAACGT 60 

TCCIGOAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 120 

_ _ TCTCCCTCGG COCCTCCCCA CAOATGQTQC ArCCGCTGGC AGAGGCTCCT GCTCACAGCC 180 

80 TCACTTCTAA CCTTCTGGAA COOGCCCACC ACTGCCAAGC TC3\CTATTGA ATCCACGCGC3 240 

TTCAATGTCa CAGAGGGQAA GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 300 

TTTGGCTACA GCTGOTACAA AGGTGAAAGA GTGGA3GGCA ACCGTCAAAT TATAGGATAT 360 

GTAATAGGAA CTCAACAAGC TATCCCAGGG CCOGCATACA GTGQTCGAGA GATAATATAC 420 

CXXAATGCAT CCCTGCTGAT CCAGAACATC ATCCAGAATG ACACAGGATT CTACACCCTA 480 

1081 



WO 03/042661 
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CACGTCATAA 
GAGCTTGCCCA 
GTGGCCTTCA 
CAGAGCCTCC 
TTCAATQTCA 
GCCAGGCGCA 
TCCCCTCTAA 
TCTAACCCAC 
GAGCTCTTTA 
AACTCAGACA 
CCCAAACCCT 
TTAACCTGTG 
CTCCOC3GTCA 
GTCACAAQOA 
CACAGCGACC 
TCATACACCT 
CCACCTGCAC 
TTTATCTCCA 
GCCAGTGGCC 
ccctccatct 
TGTGAACCTG 
GTCAGTCCCA 
AGAAATGACG 
GACCCAGTCR. 
TCGTCTTACC 
CCQCAGTATT 
GCCAAAATCA 
GGCCGCAATA 
CTCTCAGCTG 
TAGCAGCCCT 
TAAAGCATTT 
AGACTCTGAC 
AAATACAAAA 
TGAOGCAGGA 
ACTGCAIZTCC 
TCTGACCTGT 
AACTTTAATG 
TAATTAATTT 
TTCGCAGATT 
AAATATACTF 
AOACTTGGGA 
TCAATAAAAA 



AGTCAGATCT 
AGCCCTCCA.T 
CCTGTGAACC 
CGGTCAGTCC 
CAAGAAATGA 
GTGATTCAST 
ACACATCTTA 
CTGCACAGTA 
TCCCCAACAT 
CTGGCCTCAA 
TCATCACCAG 
AACCTGAGAT 
GTCOCAGGCT 
ATGATGTAGG 
CAGTCATCCT 
ATTACCGTCC 
AGTATTCTTG 
ACATCACTGA 
ACAGCAGGAC 
CCAGCAACAA 

aggctcagaa 

GGCTGCAGCT 
CAAOAGCCTA 
CCCTGGATOT 
TTTCGGGAGC 
CTTGGCGTAT 
CGCCAAATAA 
ATTCCATAGT 
GGGCCACTGT 
GGTGTAGTTT 
GCAACAGCTA 
CAOAGATCGA 
ATGAGCTGGG 
GAATCGCTTG 
AGTCTGGCAA 
ACTCTTGAAT 
AACTAACTGA 
CATGGGACTA 
TCAGGAAACT 
TTGTGAACAA 
AACTATTCAT 
TCTGCTCTTT 



TGTGAATGAA 
CTCCAGCAAC 
TGAGACTCAG 
CAGGCTGCAG 
CACAGCAAGC 
CATCCTQAAT 
CAGATCAGGG 
CTCTTGGTTT 
CACTOTGAAT 
TAGGACCACA 
CAACAACTCC 
TCAGAACACA 
GCAGCTGTCC 
ACCCTATGAG 
GAATGTCCTC 
AGGGGTGAAC 
GCTGATTGAT 
GAAGAACAGC 
TACAGTCAAG 
CTCCAAACCC 
CACAACCTAC 
GTCCAATGGC 
TGTATGTGGA 
CCTCTATGGG 
GAACCTCAAC 
CAATGGGATA 
TAADGGGACC 
CAAGAGCATC 
CGGCATCATG 
CTTCATTTCA 
CAGTCTAAAA 
GACCATCCTA 



AACCCQGGAG 
CAGAGCAAGA 
ACAAOTTTCT 
CAGCTTCATG 
AATGAACTAA 



AAATTQAGAC 
GAATATTTAT 
GTATAACAGA 



GAAGCAACTG 
AACTCCAAAC 
GAOGCAACCT 
CTGTCCAATG 
TACAAATGTG 
GTCCTCTATG 
GAAAATCTGA 
GTCAATGGGA 
AATAGTGGAT 
GTCACGACGA 
AACCCCGTGG 
ACCTACCTGT 
AATGACAACA 
TGTGGAATCC 
TATGGCCCAO 
CTCAGCCTCT 
GGGAACATCC 
GOACTCTATA 
ACAATCACAG 
GTGGAGGACA 
CTGTGGTGGa 
AACAGGACCC 
ATOCAGAACT 
CCGGACAOCC 
CTCTCCTGOC 
OCGCAGCAAC 
TATGCCTGTT 
ACAGTCTCTS 
ATTGGAGTGC 
GGAAGACTGA 
TTGCTTCTTT 
GCCAACATCG 
CGCACCTGTA 
GTGGAGATTG 
CTCCATCTCA 
GATACCACTG 
AAACTGTCCA 
TGAGGATTGC 
TAAGCTATCC 
ATTTACATTT 
ATTGTATGGT 
AAAA 



GGCAGTTCCO 
CCGTGGAGGA 
ACCTGTGGTG 
GCAACAOGAC 
AAACCCAGAA 
GCCCGGATGC 
ACCTCTCCTG 
CTTTCCAGCA 
CCTATACGTG 
TCACAGTCTA 
AGGATGAGGA 
GGTGGGTAAR 
GGACCCTCAC 
AGAACGAATT 
ACGACCCCAC 
CCTQCCATGC 
AGCAACACAC 
CCTGCCAGGC 



AGGATQCTGT 
TAAATGGTCA 
TCACTCTA1T 
CAGTGAGTOC 
CCATCATTTC 
ACTCGGCCTC 
ACACACAAGT 
TTGTCTCTAA 
CATCTGGAAC 
TGGTTGGGGT 
CAGTTGTTTT 
ACCAAGGATA 
TGAAACCCCA 
GTCCCAGTTA 
CAGTGAGCCC 
AAAAGAAAAG 
CACTGTCTQA 
OGAAGATCAA 
TGATTCTTTA 
ACTCTTACAG 
TCTCCCTATG 
AATATAGTTA 



GGTATACCCa 
CAAGQATGCT 
GGTAAACAAT 
CCTCACTCTA 
CCCAGTGAGT 
CCCCACCATT 
CCACGCAGCC 
ATCCACCCAA 
CCAAGCCCAT 
TGCAGAGCCA 
TGCTGTAGCC 
TAATCAGAGC 
TCTACTCAGT 
AAGTGTTGAC 

CATrrccccc 

AGCCTCTAAC 
ACAAGAGCTC 
CAATAACTCA 
GCTGCCCAAG 
GGOCTTCACC 
GAGCCTCCCA 
CAATGTCACA 
AAACCGGAGT 
CCCCCGAGAC 
TAACCCATCC 

TCTcrrrATC 

CTTGGCTACT 
TTCTCCTGGT 
TGCTCTGATA 
GCTTCTTCCT 
TTTACAGAAA 
TCTCTACTAA 
CTCGGGAGGC 
AGATCGCACC 
AAAAGAAGAC 
GAATTTCCAA 
GCAGAGAAAA 
AATGTCTTGT 
CAATTTGATA 
TGGTCGCTCC 
TTGCACAAGT 



S40 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
12 60 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
28B0 
2940 



Seq ID NO: 332 Protein sequence 
Protein Accession #: NP_0043S4,1 



HLFGYSWYJDG 
TLHV1KBDLV 
NNGSJaPVSPR 
TISPXNTSYR 



11 
I 

WCIPVJQRUiL 
BKTOGNRQII 
NEEATGQFRV 
LQLSSGNRXL 



QSLFVSPRLQ 
9P3YTYYRPG 
NSASGHERTT 
LPVSPRLQIjS 
PDSSYLSGAN 
ATGRNKSIVK. 



TTVTTITVYA 
IiSNDMHTTjTli 
VKLSIrSCHAA 
VKTTTVSAEIi 
HGNRTLTLEW 
IiNLSCHSASN 
EITVBASGTS 



21 
I 

TASLLTFHNP 
GTVIGTQQAT 
YFKLPKPSIS 
TIVFNVTHHDT 
AASNPPAQfYS 
JRPPKPFtTSN 
LSVTRNDVGP 
SNFFAQYSWL 
PKPfllSSKHS 
VTHNDARAYV 
PSPQYSWR1K 
PGLfiAGATVG 



31 
1 

PTTAKLTIES 
PGPAYSGEEI 
SNHSKPVKDK 
ASYKCBTQNP 
WFVNGTPQQS 



YSCGIQMBLS 
1DGH1QQETQ 
KPVEDKDAVA 
CGIQNSVSAN 
GIPQQHTQVIj 
IMIGVLVGVA 



41 
1 

TPFNVAEGKE 
lYPNAB^IQ 
PAVAFTCSPE 
V8ARRSDSVI 
TQBliPIPNIT 
VAIiTCBPEIQ 
VDHSDPVILN 
ELFISK1TEK 
PTCEPEAQHT 
RSDPVTLiDVli 
FIAKITPWWH 



SI 
I 

HI1QNDTGPY 
TQDATYLWHV 
IiMVLYGPDAP 

VLYGPDDPTI 
NBGLYTCQAN 
TXLWWVNGQS 
YGPDTPII9P 
GTYACFVSNL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID SO: 333 DNA sequence 

Nucleic Acid Accession NM_006952.1 

Coding sequence: 11.. 7 93 



1 
I 



TGGAAATGTG 
ATCTGACCAA 
GGCTGCCTGG 
TGTAGGCATC 
AGTATATGCC 
ACCCAACCTC 
TGATGACCAG 
CAATTGCTGX 
TGAGAATAAT 
AGAACCTCXC 
CTGCTATGAA 
ATTTGCCATT 
AATTGAATAT 



11 
I 

ATGGCGAAAG 
ATTATTGGTT 
CACAGCCTCT 
ATCGGCATAT 
ATGAAGTCGA 
TITGAAGTGG 
TTCCTGAAGC 
TGGAAAAACA 
GGCGTAAATG 
GATGCTGACT 
AACCTGGAGG 
CTGATCTCTG 
CTCTGCTGGA 
TAAGAA 



21 

i 

ACAACTCAAC 
GTTGCGGCAT 
ACCCACTGCT 
TTGTGGGCAT 
GCAGGAAAAT 
CATCTTGTAT 
AGATGCTAGA 
ATGGAGTC31G 
GICCATCAGA 
ATCCCTGaCC 
CTTGrAAACT 
GTCCAATQAA 
CTTTTTGGGT 



31 
I 

TGTTOGTTGC 
TGOCCTGACT 

tgaagccacc 
ctgcctcttc 

TCTTCTGGOG 
CACAGCAGCA 
GAGQTACCAA 
CAAAACCIGG 
CTGGGAAAAA 
TCGTCAATGC 
AGGCGTGCCT 
CCGACACGCC 
TCTOCTGQQT 



41 
) 

TTCCAGGGCC 
GCGGAGTGCA 
GACAACGATG 
TGCCTGTCTG 
TATTTCATTC 
ACACAACGAG 
AACAACAGCC 
GACAGGCTCA 
TACACATCTG 
TGTGTTATQA 
G3TTTTTATC 
TGGGGGGTTG 
ACCATGTTCT 



51 
I 

TGCTGATTTT 
TCrTCTTTGT 
ACATCTATGG 
TTCTAGGCAT 
T GATOTTT AT 
ACTTTTTCAC 
CTCCAAACAA 
TGCTCCAGGA 
CCTTCCGGAC 
ACAATCTTAA 
ACAATCAGGG 
CCTGGTTTGG 
ACTGGAGCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Sec; ID NO : 334 Protein sequence 
Protein Accession WP^OOees.l 
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X 11 21 31 41 SI 

I I 1 I 1 I 

MAKDNSTVRC FQQLLIFGNV IIGCCGIAI/T ASCIFFV8DQ HSLYPIjIiEAT DNDDIYGAAW 

IGIFVGICLF CLSVLGIVGI MKS3RKTLLA YPTLMFIVYA PEVASCITAA TQRDFFTPNL, 

FLKQHLiERYQ HNSPPNNEDQ WKNN3VTKTW DRLMLQDWCC GVNGPSDKQK YTSAFRTENN 

DADYPWPRQC CVMNNIiKEPIj NLKACKLGVP GPYHNQGCYE LISGPMNHHA WGVAWFGFAI 
LCWTPWVLL-G TMFYWSHIEY 

Seq id NOi 335 UNA sequence 

Nucleic Acid Accession #: HM_O0263S.l 

Coding sequence i 120, .473 



£0 
12 0 
180 
240 



1 
I 

CAATACAGCT 
GCTGGACTGC 
TGAQGGCCAG 
AGGCAGCTGT 
TCAATGGACA 
CGCA&3AGCC 
TCCGGTGCGC 
TCA2VGAAGTG 

TGCTGCCCTT 
GAGCTGCCTC 



11 
I 

AAGQAATTAT 
ATAAAGATTG 
CAOCTTClTG 
CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 



TCTCATCCAC 



21 
I 

CCCTTGTAAA 
GTATGGOCTT 
ATCGTGGTGG 
CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCCCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTOCAT 
TTTCCAATAA 



31 



TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTCAAGACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 



41 

1 

COGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCOC 
AGATACTGAC 
CGTTCCGCAG 
CCCATCTGOT 
ATTCAGGATG 



51 
I 

GCCAGGCCAA 
CCTGACACCA 
CTGGTTCTAG 
OGTGTTCCAT 
AAAGTCAAAG 
ATTAT CTTQA 
TGCCCAGGAA 
TGAAGOGAGC 
OCTAAGTCCC 
CCCACGGCTG 



60 
120 
130 
240 
300 
360 
42Q 
480 
540 
GOO 



Seq ID NO: 336 Protein sequence 
Protein Accession #s NP_Q 02 629-1 

1 11 21 31 41 51 

I I I I I I 

MRASSFLIW VF L XAGTLTVIj EAAVTGVFVK GQDTVKGRVP FNGQDPVKGQ VfiVMSQDKVK 60 
AQEPVKGFVS TKPGSCPT1T. 1RCAMLNPPU RCLKDTDCPG 1KKCCBGSCG MACFVPQ 

Seq ID NO: 33? UNA sequence 

Nucleic Acid Accession #t FJM_001793.2 

Coding sequence: 71- -2560 

1 11 21 31 41 51 

I I I I 1 I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCAOCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTOBCGTCT CTCCTCCTTC TCCAGGTTTG 120 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGOGGTC TTCAGGGAGG CTGAAGTGAC 1B0 

CTT6GAGGCG GGAGGCGCGQ AGCAGGAGGC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 

CTGeCCTOGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 300 

TGGCGAGACA GTCCAQGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 

ATOCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG OTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT •TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA QACAGOCCCC CTGAGGGTGT S40 

CTTCGCTGTA QAQAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGGCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGABGA 660 

CCCCATGAAC ATCTCCATCA TOSTGAOCGA CCAGAATGAC CACAAGCCCA AGTTFACCCA 720 

GGAGACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

GACAQCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACG GGAGCRCAGG 900 

CACCATCAGC GTCATCTCCA GTGGCCTGGA COGGGAAAAA GTCCCTGAGT ACAGACTGAC 960 

CATCCAGGCC ACAGACATGG ATQGGGAOGG CTOCRCCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 

GCCTGAGAAT GCAGTQGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCOC 1140 
CAAC!TCACCA GCGTGGCGTO C3CACCTACCT TATCATGGGC GQTGACGACG GGGACCATTT , 1200 

TACCATCACC ACOCACCCTG AGAGCAACCA OGGCATTCCTG ACAACCAGGA AGGGTTTGGA 1260 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTOCAC GTGGAGGATG TTGAATGAGGC 13 80 

ACCTGTGTTT GTCCCACCCT CCAAAGTGGT TGAGGTCCAG GAGGGCATCC CCACTGGQGA 1440 

GCCTGTGTGT GTCXACACTG CAGAAQACCC TGACAAGGAG AATCAAAAGA TCAGCTACGG 1500 

CATCCTGAGA GACCCftGCAQ GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 

TGTGGGCACC C1CGACCGTG AGGATCJAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCOCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

ACTGATTOAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 1740 

OCRAAGCCCT GTGOGCCAGG TGCTGAACAT CACGGACAAQ GACCTGTCTC CCCACAOCTC IBOO 

CCCTTTOCAG GCOCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCRG AGGTCAACGA I860 

GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTOCTG AAGCAGGATA CATATGACGT 1920 

GCACCTrTCT CTGTCTGAOC ATGGCAACAA AGAGCflGCTG AOGGTOATCA GGGCCACTGT 1980 

GTGCGACTGC CATGGCCATG TC33AAACCTG CCCTGGACCC TGGAAGGGAG GTXTCATCCT 2040 

CCCTGTGCTG GOGGCTGTCC TGGCTCTGCT GTTCCTOCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA OCCGTGACRA 2160 

GSTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACOGA GGTCTGGAGG CCAGGCOGGA GGTGGTTCTC CGCAATGACG TGGCACCZVAC 2280 

CATCATGCCG ACACCCATGT ACCGTOCTCG GCCAGOCAAC CCAGATGAAA TCGGCAACTT 2340 

TATAATTGAQ AACCIGAAGG CGGCTAACAC AGACCOCACA GCCOCGOCCT ACGACRCCCT 2400 

CTTGQTaTTC GACTATGAGG GCAGCGGCTC CGACGCCGCB TCCCTQAGCT CCCTCACCTC 2460 

CTCCX3CCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCOCTTCAA 2520 

GAAGCTGGCA GACMGTAOG OTGGCGGGGA GGACGACTAG GCGGCCTGCC 1GCAGGGCTG 2580 
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GGGACCAAAC 
GACTTOGGAG 
ACGTTAGAGT 
A3CACTGAAA 
TCTTACCTGC 
TACAGTGGAC 
TTTTTTTAAT 
GCTGGGCCCA 
TGGATCTCTG 
GTTGCGTTC3C 
TAAAGAAACT 



GTCAGGCCAC 
CTTGTCAGGA 
GGTT6OTTCC 
ACCTCTCCAC 
CGTAAAATGC 
TTTCTCTCTG 
GCTATCTTCA 
CTGGCCGTCC 
CGTTTTTATA 
TATAGATGAA 
TTTCCCAGAA 



AGAGCATCTC 
AGTGGCOGTA 
TTAGCCTTTC 
CTGGGCCAGG 
TCAACCCTGT 
GAATGGAACC 
AAAC3TTAQA 
TGCATTTCTG 
CTGAGTGTGC 
GQGTGAGGAC 
AAAAA 



CAAGGGGTCT 
GCAACTTGGC 
AGGATGGAGG 
GTTGCCTCAG 
GTOCTGGGCC 
TTCTTAGGCC 
GAAAGTTCTT 
GTTTCCAGAC 
CTAGGTTGCC 
AATCGTGTAT 



CRGTTCCCCC 
GGAOACAGGC 
AATGTGGGCA 
AGGCCAAGTT 
TGGGCCTGCT 
TOCTGGTGCA 
CAAAAGTGCA 
CCCAATGCCT 
OCTTATTTTT 
ATGTACTAGA 



TTCAGCTGAG 
TATGAGTCTG 
GTTTGACTTC 
TCCAGAAGCC 
GTGACTGACC 
ACTTAATTTT 
GCCCAGAGCT 
CCCATTDGGA 
TATTTTCCCT 
ACTTTTTTAT 



2640 
2700 
2760 
282 0 
2880 
2940 
3000 
3060 
3120 
3180 



Seq ID NO: 338 Protein sequence 
Protein Accession ft: NP_0017S4 .2 



1 
1 

MGLPRGPIiAS 
QEPAIjFSTDN 
KGPFPQRLNQ 
y E IiPQHAVS E 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYliIMG 
PTSTATIWH 
DPAGHIAHDP 
VNDHGPVPEP 
TWIiSLKKFI* 
GAVLAliLFli 
GLHARPEWli 



11 
I 

IiLLLOVCWLO 
DDFTVRNGBT 
UCSNKDHDTK 
NGASVEDPMN 
GWAYeiHSQ 
TAVAWKtLD 
GDDGDHFTIT 
VEDVNEAFVF 
DSGQVTAVGT 
RQITICNQSP 
KQDTYPVHLS 
I.V1jL1j1,VRKK 
RNDVAPTJIP 



21 

1 

CAASEPCRAV 
VQERRSLKER 
IFYSITGPGA 
ISHVTDQND 
EPKDPHDLHP 
ANDNAPMFDP 
THPESNQGIL 
VPPSKtfVEVQ 
LtDREDEQFVR 
VRQV7JTITDK 



31 
I 

FREAEVTLEA 
NPLKIFPSKR 
DSPPEGVPAV 
HKPKFTQDTF 
TIHR3TGTIS 
QKYEAHVPEN 
TTRKGLDFBA 
EGIPTGEFVC 
NMIYEVMVLA 
DLSPHTSPPQ 
TVIRATVCDC 
EDDTRDNVFY 
FEEIGNFIIE 



41 

1 

GGAEQEPGQA 
ILRRHKRDWV 
BKETGVTLLLN 
RGSVLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNGHTLYVEV 
VYTAEDPDKB 
MDNG9PPTTG 
AQLTDDSDIY 



RKIKEPLLLP 
TPMVRPRPAN 
DQEQDYDYliN 



Seq ID NO: 339 DMA sequence 

Nucleic Acid Accession : Bob sequence 

Coding sequence : 1 . . 672 



YGEEGGGBED 
NLKAANTDPT 
DMYGGGEDD 



51 
I 

LGKVFMGCPG 
VAPISVPENG 
KFLiDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
"TTDIjnAPNSP 
TNEAPFVLKIi 
NQKI9YRILR 
TGTL IiXiTliI D 
KTAEVNBEGD 
WKGGFILPVL 
ODYDlTQliHR 
APPYDTLLVF 



60 
120 

lao 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



ATGAGGCTCC 
OGGGGCTCCG 
AAGGGCGGGG 



GGGAGACAAC 

tgtcatgttt 
acagagccat 
aagcagtgct 

CTCCTGGAAG 
TTAGAGGGGC 
AGCTGTGGTG 
AGCCTGTCTT 



11 
1 

AAAGACCCCG 
CCTAOCGGCC 
AGGGGGCGCC 
TGCTGCTGGT 
QAOATCCAGA 
GTGAGAGAGA 
ACTGCGTTAT 
COGCTGGTTG 
AGCCCATGCC 
CACCTATCAA 
GGCTGTGGCT 
GA 



21 
1 

ACAGGC3CCGG 
AGACCOGGGG 
GCGCGCTGAC 
CGTGGGCCTA 
GGACTCCCAG 
AAACACTTTC 
AGCGGCOGTO 
TGCftGCGATG 
CTTCTTTTAC 
CTCATCAGTG 
GGCCATCCTC 



31 
1 

GOGGGTGGGA 
AQAGGCGCGC 
CCTCCCTGGG 
CDQCGGGTGT 
CGAAOGGACG 
GAQTGCCAGA 
AAAATATTTC 
GAGAGACCCA 
CICAAGTGTT 
TTCAAAGAAT 
C1GCTGCTGO 



41 
I 

GGGGCGOGCC 
GGAQQCTGCG 
CACOGCTGGG 
GGACAGAGGC 
AGGGTGACAA 
ACCCAAGGAG 
CAOGTTTTTT 
AGCCAGAGGA 
GTAAAATTCG 
ATGCTGGGAG 
CCICCATTGC 



51 
I 

CCGGGGCGGG 
AAGGTTCCAG 
GAGGATGGOG 
CAACCTGACT 
TAGAGTGTGG 
GTGCAAATGG 
CATGGTTGCG 
GAAGCGGTTT 
CTACTGCAAT 
CATGGGTGAG 
AGE2CGGCCTC 



Seq ID MOs 340 Protein sequence 
Protein Accession 4t: Eos sequence 

1 11 31 31 41 51 

I \ I 1 I 1 

MRLQRPRQAP AGGHEAPRGG RGSPYHPDPG RGARRIiRRFQ KGG EGAPR AD PPWAPLGTHA 
LliALIiWAL PRVWTDANLX ARQRDPEDSQ RTBEGDNRVW CHVCKRENTF ECQNPRRCKW 
TEPYCVIAAV KIFPRFFMVA KftCSAGCAAM ERPKPEEKRF LLEEPMPFFY LKCCKIRYCN 
LEGPPIHBSV FKBYAGJBMGE SCGGIiWIAH* LLLASIAACJL SL-S 

Seq ID NO: 341 DKA sequence 

Nucleic Acid Accession ft: XM_035292.2 

Coding sequence; S3,. 1576 



II 



1 

I I 

GCTOGCTGGG CCGCGGCTCC 
TGCGGGCCCG AAGCGGCGCG 
GGAGAAGATG CTGGCOGCCA 
CGTOACCCTG CAGCGGAACA 
TAT03GCTG3 GGCATCTTCG 
GCTGGGGCTG GTGGTGTGGG 
CGCGGAGCTC GGCACCACCA 
CTACGGCTCG CTGOCCGCCT 
ATCGCAGTAC ATCGTGGGCC 
CTGCCGGGlG CCCGAGGAGG 
GGCCGTGAAC TGCTACAGCG 
CAAGCTGCTG GCCCTGGCCC 
TGTGTCCAAT CTAGATCCCA 
TGTGCTGGCA T TAT JVC AG CG 
CACAGAGGAA ATGATCAACC 
CATCETGACG CTGGTGTACG 
GCAGATGCTG TOGTCX3GAGG 



21 
I 

CGGGTCTGCC 
CGCIAGCGGC 
AGAGCGCGGA 
TCACGCTGCT 
TGACGCCCAC 
COBCGTGCGG 
TCTCCAAATC 
TCCTCAAGCT 
TGGTCTTCGC 
CSW30CAAGCT 
7GAAGGCGGC 
TGATCATCCT 
ACTTCTCATr 
GCCTCTTTGC 
CCTACAOAAA 
TGCTGACCAA 
CCGTGGCCGT 



31 



AGGCCCGGCC 



CGGCrGGGGG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGCGAC 
CTQQATCX3AG 
CAOCTACCTG 
CGTGGCCTGC 
CACCCGGGTC 
GCTGGGCTTC 
TGAAGGCACC 
CTATGGAGGA 
CCTGCCCCTG 



41 
I 

GGTGOGCAGA 
GAGGAGAAGG 
CCGGCAGGCG 
GOCATCATCG 
AAGGAGGCAG 
ATCGTGGGGG 
TAOGCCTACA 
CTGCTCATCA 
CTCAAGCCGC 
CTCTGCSTGC 
CAGGATGCCT 
GTCCAGATOG 
AAACTGGATG 
TGGAATTACT 
GOCATCATCA 



51 
I 



AAGAGGOGCG 
AGQGCGAGGG 
TGGGGACCAT 
GCTGGCCGGG 
CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 
TGCTGCTCAC 



GOACTTCGGQ AACTATCACC 



GAAAGGGTGA 
TGGGGAACAT 
TGAATTXCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 



60 
120 

iao 

240 
300 
360 
420 
4B0 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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GTCCTGGATC 
GTTCACATCC 
CTCCATGATC 
GACGCTGCTC 
CAACTGGCTC 
TGAfiCTTGAG 
OCTCTTCCTG 
CATCATCCTC 
GTGGCTCCTC 
CCCCCAGGAG 



ATCCCCGTCT 
TCCAGGCTCT 
CACCCACAGC 
TACGCCXTCT 
TGCGTGGCCC 
CGGCCCATCA 
ATCGCCGTCT 
AGCGGGCTGC 
CAGGGCATCT 
ACATAGCCAG 



TCGTGGGCCT 
TCTTOGTQGG 
TCCTCACOCC 
CCAAGOACAT 
TGGCCATCAT 
AGGTGAACCT 
CCTTCTGGAA 
CCGTCTACTT 
TCTCCACGAC 
GAGGCCGAGT 



GTCCTGCTTC 
GTCCCGGGAA 
CGTGCCGTCC 

crrCTCOGTc 

CGGCATGATC 
GGCCCTGOCT 
GACACCCGTG 
CTTCGGGGTC 



GGCTGCCGGA 



GGCTCCGTCA 
GGCCACCTGC 
CTCGTGTTCA 
ATCAACTTCT 
TGGCTGCGCC 
GTGTTCTTCa 
GAGTGTGGCA 
TGGTGGAAAA 
CAGAAOCTCA 
GGAGCAXGC 



ATGGGTCCCT 
CCTCCATCCT 
CGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 

TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGX 



1OS0 
1140 

12 DO 
12GD 
1320 

13 B0 
1440 
1500 
1560 



Seq ID NO i 342 Protein sequence 
Protein Accession #s xp_035292.2 



l 
I 

HAGAGPKRRA 
GTIIGSGIFV 
IiEVYGSLPAP 
LLTAVNCYSV 
GNIVLAliYSG 



SIIiSMIHPQL 
RKPBLERPIK 
KPKNIjLQGIF 



11 
! 

LAAPAAESKE 
TPTGVLKEAG 
LKLWIEIill 
KAATRVQDAF 
LFAYGGKNYIj 
VAVDFGNYEIj 
LTPVPSI#VPT 
VNLAI*FVFFI 
STTVLOQKLM 



21 
I 

EARBKMIiRAX 
SPGLALWWA 
RPSSQYIVAL 
AAAKLiALAl* 
NFVTEEMINP 
GVMSWIIPVF 
CVMTIiLYAPS 
LACI»PI»IAVS 
QWPQST 



31 
I 

SADGSAPAQE 
ACGVPSIVGA 
VFATYI^LKPI* 
XIIiIjGFVQXG 
YRNLPLAIII 
VGLSCFGSVN 
KDXFSVMFF 
FWKTPVECGI 



41 



51 
I 



GEGVTLQRNI TLLNGVAI I V 
IiCYAELGTTI SK6GGDYAYM 
FPTCPVPE3BA AKLVACLCVL 
KGDVS NTLiDPN FSFBGTKLDV 
3UPIVTLVYV LTNIAYFTTL 



SFFNWLCVAI» AX IGMIWLHH 
GFTI XliSGLP VYFFGVWWKM 



Seq ID MO: 343 DMA sequence 

Nucleic Acid Accession ft* NPM)0526B.l 

Coding sequence! 16B..9B9 

1 11 21 31 

ill) 

TAAAAAGCAA AAGAATTCGC GGCCGCGTCG ACAjCGGGCTT 
TCTGGATATG AAATTCAAGC TGCTTGCTGA GTCCTATTGC 
AGCCCTGAGG AGTAGTCACT CAGTAGCAGC TGACGCGTGG 

TCTTTGAGGG actoctgagt ggggtcaaca agtactccac 

TGTCTCTGGT CTTCATCTTC CGCGTGCTGG TGTACCTGGT 

GTGATGACCA caaggacttc gactgcaata ctogccagcc 

TTGATGAGTT CTTCCCTGTG TCCCATGTGC GCCTCXGGGC 
CATGCCCCTC ACTGCTCGTG GTCATGCACG TGGCCTAC3CG 
ACCGAGAAGC CCATGGGGAG AACAGTGGGC GCCTCTACCT 
GTQGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA 
TTCTCTATGT GTTCCACTCA TTCTACCCCA AATATATCCT 
ACGCAGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC 
TTTTCACCCT CTTCATGGTG GCCACAGCTG OCATCTGCAT 
TCATCTACCT GGTGAGCAAG AGATGCCACG AGTGCCTGGC 
TGTGCACAGG TCATCACOCC CACGGTACCA OCTCTTCCTG 
CGGGTGACCT CATCTTTCTG GGCTCAGACA GTCATCCTCC 
GAGACCATGT GAAGAAAACC ATCTTGTGAG GGGCTGCCTG 
CCrGGATGGG GAGGCTCTAG CATCTCTCAT AGGTGCAACC 
CATGAGGTAG GGGCAGGCAA GAGAGAGGAT TCAQACGCTC 
TCAACTCCAG CCACCTGCCC CAOCTCGACG GCACTGGGCC 
GCTCGGTTTC CTTTTCTAGA ATGGAAATAG TQAGGGCCAA 

Seq ID NO: 344 Protein sequence 
Protein Accession fti NP_O0S259.1 



41 

I 

CCCCGAAAAC 
CGGCTGCTGG 
GTCCACCATG 
AGCCTTTGGG 
GACGGCCGAG 
CGGCTGCTCC 
CCTGCAGCTT 
GGAGGTTCAG 
GAACCCCGGC 
GGCGAGCGTG 
CCCTCCTGTG 
CAAGCOCTCA 
CCTGCTCAAC 
AGCAAGGAAA 
CAAACAAGAC 
TCTCTTACCA 
GACTGGTCTG 
TGAGAGTGGG 
TGGGAGCCAG 
AGTTCCCCCT 
TGC 



51 
1 

CTTCCCCGCT 
GAGCCAGGAG 
AACTGGAGTA 
CGCATCTGGC 
CGTGTGTGGA 
AACGTCTGCT 
ATCCTGGTGA 
GAGAAGAGGC 
AAflAAGCGGG 
GACATCGCCT 
GTCAAGTGCC 
GAGAAGAACA 
CTCGTGGAGC 
GCTCAAGCCA 
QACCTCCTTT 
GACCGCCCCC 
GCAGGTTGGG 
GGAGCTAAGC 
TTCCTAGTCC 
CTGCTCTGCA 



51 



1 
1 

CGGGCGAAGC 
CQCCCTGCTG 
CCCGGGGAGC 
CGGCGTGGGT 
GCCCTGCAAC 
TGCGTGTGAT 
CAATGCTCAG 
AAAGGCCAAA 
GCCCCTGGTG 
CACCAGTGCC 
ACTCOCCAGC 
TGAGCCTCCC 
ATTACTAAGA 



11 
I 

AGCGCGGGCA 
GCGCTCACCT 
GAGTGCGCTG 
TTCCGGGAGG 
TGGAAGAAGG 
GGGGGCACAG 
TGCCAGGAGA 
GCCAAGAAAG 
TCACATGGGG 
TTCTGTCTGC 
CCCACCCCTA 
CCAAAGCAAT 
AACACATCAA 



21 
1 

GCGAGATGCA 
CCGCGGTCGC 
AGTGGGCCTG 
GCACCTGCGG 
AGTTTGGAGC 
GCACCAAAGT 
CCATCCGCGT 
GGAAGGGAAA 
CCTGGCCACQ 
TCGTTAGCTT 
AGTQCCCAAA 
GTGAGTCCCA 
ATAAACTGAC 



31 
] 

GCACCGAGGC 
CAAAAAGAAA 
GGGGCCCTGC 
GGCCCAOACC 
CGACTGCAAG 
CCGCCAAGGC 
CACCAAGCCC 
GGACTAGACG 
CCCTCCCTCT 
TAATCAATCA 



GAGCCCGCTT 
TTTTTCCCCC 



41 

I 

TTCCTCCTCC 
GATAAGGTGA 
ACCCCCAGCA 
CAGCGCATCC 
TACAAGTTTG 
ACCCTGAAGA 
TGCACCCCCA 
CCAAGCCTGG 
CCCAGGCCCG 
TGCCCTGCCT 
ACAAGGGATT 
TTGTTCTTCC 
CAATAAAAGC 



60 
120 
ISO 
240 
300 
360 
420 
480 



1 II 21 31 41 

III t f 1 

MKHSIFEGUj SGVNKYSTAF GRXWI»SLVFX FHVLVYliVTA ERVWSDDHKD FDCNTRQPGC 
ENVCFDBPFP VSHVRLWALQ WLVTCPSIiL WMHVAYREV QEKRHEKAHG ENSGRLYIUP 
QKKHGGUMWT YVCSLVFKAfi VDIAFLYVPH SFYPKYILPP WKCHADPCP NIVDCFISKP 
SEKtnPTLPM VATAAICIIiI* NLVEIrlYLVS KRCHEdaAAR KAQAMCTGHH PHGTTS3CKQ 
DDIiLflGDIiIF LGSDSKPPXjL PDRPRDHVKK Till 

Seq ID NO: 345 DNA sequence 

NUcleic Acid Accesoion. #z KM_002391-1 

Coding sequence: 26.. 457 



60 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
180 
240 



51 
I 

TCACCCTCCr 
AGAAGGGCGG 

gcaaggattg 
ggtgcagggt 
aoaactgggg 

AGGCGOGCTA 
AGACCAAAGC 
ATGCCAAGGA 

aqatgtgacc 
tgtccctctc 

CTGGGAAGCT 
CCACAATTCC 
TCTTCTTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



1085 



WO 03/042661 



PCT/US02/3681© 



10 
15 



35 



TAATAT 

Seq ID NO: 34fi Protein sequence * 
Protein Accession #i NF_002382-1 

1 11 21 31 41 51 

I I 1 I I I 

MQHRBFlfLrLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AViGPCTPSSK DCGVGFREGT 60 
CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WQACDGGTGT KVRQGTUCKA RYNAQCQETI 120 
RVT KP CTPKT KAKAKAKKGK. GKD 

Seq ID NOt 34*7 DMA sequence 

Nucleic Acid Accession. # ; NM_006783-1 

Coding sequence: 1..786 



1 11 21 31 41 51 

t I I I I ! 

ATGGATTGGG GGACGCTGCA CACTTTCATC GGGGGTGTCA ACAAACACTC CACCAGCATC 60 

GGGAAGGTGT GGATCACAGT CATCTTTATT TTCCGAGTCA TGATCCTAGT GGTGGCTGCC 120 

20 CAGGAAGTOT GGGGTGACGA GCAAGAGGAC TTCGTCTGCA ACACACTGCA. ACCGGGATGC 1B0 

AAAAATGTGT GCTATGACCA CTTTTTOCCG GTGTCCCACA TCOGGCTCTG GGCCCTCCAG 240 

CTGATCTTCG TCTCCAGCCC AGCGCTGCTG GTGGCCATGC ATGTGGCCTA CTACAGGCAC 300 

GAAAOCACTC GCAAGTTCAQ GCGAGGAGAG AAGAGGAA1G ATTTCAAAGA CATAGAGGAC 360 

ATTAAAAAGC ACAAGGTTCG GATAGAGGGG TCGCTGTGQT GGACGTACAC CAGCAGCATC 420 

25 TTTTTCOGAA TCATCTTTGA AGCAGOCTTT ATGTATGTGT TTTACTTCCT TTACAATGGG 4 BO 

TACCACCTGC CCTGGGTGTT GAAATGTGGG ATTGACCCCT GCCCCAACCT TGTTGACTGC 540 

TTTATTTCTA GGOCAAOAGA GAAGACOGTG TTCACCATTT TTATGATTTC TGCGTCTGTC 600 

ATTTGCATGC TGCTTAACGT GGCAGAGTTG TGCTACCTGC TGCTGAAAGT GTGTTTTAGG 660 

AGATCAAAGA GAGCACAGAC GCAAAAAAAT CACCCCAATC ATGCCCTAAA GGAGAGTAAG 720 

30 CAGAATGAAA TGAATQAGCT GATTTCAGAT AGTGGTCAAA ATGCAATCAC AOGTTTGCGA 780 
AGCTAA 



Saq XD NOi 34Q Protein sequence 
Protein Accession #i NP_006774.1 

1 11 21 31 41 51 

I 1 I 1 I I 

HDWGTLHTFI GGVNKHSTSI GKVKITVIFI FRVMILWAA QEVWGUEQED FVCNTLQ^GC 60 

KWVCYBHFFP VSKlHlAfALQ LIFVSTPALL VAMHVAYYRH ETTRKFREGE KRNDFKDIED 120 

40 IKKHKVRIEG SLMWT5TTSSI FFRIIFEAAF MYVFYFXtYNG YHLPKVTjKCG IDPCPNI*VDC 180 

FISRPTKKTV FTIFMISASV ICML1MVAEL CYTJiTyKVCFR RSKRAQTQKK HPNEAIJKBSK 240 
QHEMNEL1SD SGQNAITGFP 9 

Seq ID NO; 349 DNA sequence 
45 Nucleic Acid Accession # t NMJJ02571.1 
Coding sequence: 99.-587 

I 11 21 31 41 SI 

<A I 1 I 1 1 1 

dU catccctctg gctccagagc tcagagccac OCACAGCCGC AGCCATGCTG TGCCTCCTGC 60 

TCACCCTGGG CGTGGCCCTG GTCTGTGGTG TCCCGGCCAT GGACATCCGC CAGACCAAGC 120 

AGGACCTGGA GCTOCCAAAG TTGGCAGGGA CCTGGCACTC CATGGCCATG GCGACCAACA 180 

ACATCTCCCT CRIGGCGAGA CTGAAGGCCC CTCTGAGGGT CC&CATCACC TCACTGTTGC 240 

CCACCCCCGA GGACAACCTG GAGATGGTTC TGCACAGATG GGAGAACAAC AGCTGTGTTG 300 

55 AGAAGAAGGT CCTXGGAGAG AAGACTGGGA ATOCAAAGAA GTTCAAGATC AACTATACGG 360 

TGGCGAACGA GGCCACGCTG CTCGATACTG ACTACGACAA TTTCCTQTTT CTCTGCCTAC 420 

AOGACACCAC CACCCCCATC CAGAGCATGA TGTGCCAGTA OCTGGCCAGA GTCCTGGTGG 4B0 

AGGACGATGA GATCATGCAG GGATTCATCA GGGCTTTCAG GCCCCTGCCC ACJGCACCTAT 540 

GGTACTTGCT GGACTTGAAA CAGATGGAAG AGCCGTGCCG TTTCTAGCTC ACCTCCGCCT 600 

60 CCAGGAAGAC CAGACTCCCA CCCTTOCACA CCTCCAGAGC AGTGGGACTT CCTCCTGCCC 660 

TTTCAAAGAA TAACCACAGC TCAGAAGACG ATGACGTGGT CATCTGTGTC GCCATCCCCT 720 

TCCTGCTGCA CACCTGCACC ATTGCCATGG GGAGGCTGCT CCCTGGGGGC AGAGTCTCTG 780 
GCAGAGGTTA TTAATAAACC CTTGGAGCAT G 

65 Seq ID MOs 350 Protein sequence 
Protein Accession ft; NP_0025S2.l 

1 11 21 31 41 51 

n<\ 1 I I I 1 1 

/U MDIPGTXQBli EUKUU3TWH SMAMATNNIS LMATLKAPLR "VHITSCIiPTP EDNLEIVLHR 60 

WENKSCVEKK VLGEKTGMPK KFKINYTVAN EATMjDTDXD NFIiFLCLQDT TTPIQ3MMCQ 120 
YLARVIiVEDD BIMQGFXRAF RPLPRHLWY1, LDLKQMEEPC RF 

Seq ID NO s 351 DNA sequence 
75 Nucleic Acid Accession # = NM_006500,1 
Coding sequence: 2 7.. 1967 

1 11 21 31 41 51 

« n I I 1 \ I I 

OU ACTTGCGTCT CGDCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CtGCTGCTGT CCTCGOGTGG CGGGTGTGCC CGGAGAGGCT GAGCAGCCXG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTOOGCA GCACAGCOCT TCTGAAOTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 

TCAICTTCCG TGTQCGCCAG GGCCAGGGCC AGAGCGAACC TGGGOAGXAC GAGCAGCGGC 300 
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TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGRGC 360 

GCATCTTCTT GTGCCAGGGC AAGCGCGCTC GGTCCCAGGA GTACCGCATC CAQCTCCGCG 420 

TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 460 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 

TCATCTGGTA CAAGAATGGC COTCCTCXGA AQGAGGAGAA GAACCGGGTC CACATTCAGT 600 

CGTCCCAGAC TGTGGAGTCQ AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 

TGGTTAAAGA ABACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACOGG CTGOCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 340 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 

GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

AGGAACACAG TGGGCGCTAT GAATGXCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 10 B0 

CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

ACCTDQAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCXGCGTG QCGXCTGTGC 1260 

CCAGCATACC CGGCCTQAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

GTGAAQCGTC AGGGCACCCC CGGOCCACCA TCTCCTGOAA CGTCAACGGC ACGGCAAGtG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

TGTTGGAGAC AGGTGTTGAA TGCACQGCCT CCAAOGAGCT GGGCAAAAAC ACCAGCATCC IS 60 

TCTTCCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CAOCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG i860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GOCTCCTGCA GG0CAGCAGC GGTQACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC GGAATCACTT 1980 

CAGCTCCCTT CCCraCCTQG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CCTCCAAAGG GACTAGAQAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

□GCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATOCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAOAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTQC! GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCClGCTCG OCTCTTCAAA GTCTCCTGTG 2520 

ACATTTTTTC TTTGGTCAGA AGOCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2S80 

GGCCAGGTGT GQTGGCTCAC GCCTGTAATC CCAOCACTTT GGGAGGCOGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTOA AACCCTGTCT CTACTAAAAA 2700 

TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGrGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCGGTCT CGAGGAAAAA AAAAGAAAAG 2680 

ACGCGTACCT GCGGTGAQGA AGCTGGGCGC TGTTTTCGAQ TTCAGGTGAA TTAGCCTCAA 2940 

TCCOCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAQC 3000 

GGGGAGCAGA CAAAGATGAG GTCTAGACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAOGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

AAAGCTXAAT TGTCOCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 
AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTOC AGXTGGCAGC 3420 
AGAGATCAGG GGTTACCTCT GCTTCXGAGC AAATGGCTCA AGCTCTACCA GAGCAQACAG 3480 
CTACCCTACT TTTCAGCAGC AAAACGTCOC GTATGACGCA GCACGAAGGG CCTGGCAGGC 35 40 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCOGTCCA CTT 

Seq ID NO: 352 Protein gocpience 
Protein Accession #s Nt>_00649l.l 

1 11 21 31 41 51 

| I I I 1 1 

GLPRLVCAFL IAACCCCPRV AGVPGEAEQP APELVEVHVG STALIiKDGIjS QSQGHLSHTO 60 

WF6VEKBKH.T LIFRVRQGQG QSEPGEYEQR LSLQDRGATL ALTQVTPQDE EIFLOQGKSP 126 

RSQEYRIQLR VYKAPEEPNI OVKELG1PVN S KEPE EVATC VGRMGYPIPQ VIVTTKNGRFL 180 

KBEKHSVHIQ SSQTVE9SGE* YTLQSIIiKAQ LVKSDKDAQF YCEIAIYRliPS GUHMKBSREV 240 

■ TVFVPYPTEK VWLEVEPVGM IiXBGDRVETR diADGSPPPH FSISKQKPST REAEHETTND 300 

WGVI.VI»EPAR KEHSGRYECQ AWNLDTMISIj IiBEPQEEjLVK YVSDVRVSPA APERQEGSSIj 360 

TLTCEAESSQ Dl»EFQMIiREE TDQVTjERGPV LQLHDLKREA GGGYRCVASV PSIPGLHRTQ 420 

LVKLAIFGPF WMAPKERKVW VXBJJMVLHLS CEASGHPKPT I3WNVNGTAS EQDQDPQRVI* 480 

STLHV1»VTPE UjETGVECTA SHDLQKHTSl liPLEIjVNliTT LTPD3NTTTG LSTSTASPHT 540 

RAN8T9TERK LPEPEBRGW TVAVIVCILV XAVLGAVLYF LYKKG3CLPCR R9GKQEITLP 600 
P2RKTEL.WE VK8DKLPEEM GLLQGSSGDK RAPGDQGEKY IDLRK 

fieq ID MOi 353 PNA sequence 

nucleic Acid Accession H NM_003183.3 

Coding sequence: 165.. 2639 

1 11 21 31 41 SI 

I I I 1 I I 

TCGAGCCTGG OGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGGAAAAGA GGATTGAGGG 60 
GCTAGGCCGG GCGGATCCCG TCCTCCCCCG ATGTGAGCAG TTTTCCGAAA CCCOGTCAGG 120 
CGAAGGCTGC CCAGAGAGGT GGAGTOGGTA GCGGGGCCGG GAACATGAGG CAGTCTCTCC 180 
TATTCCTGAC CAGCGTGGXT CCTTTGGTGC TGGCGCCGCG ACCTCCGGAT GACCOGGGCT 240 
TCGGCGCCCA CCAGAGACTC GAGAAGCTTG ATTCTTTGCT CTCAGACTAC GATATTCTCT 300 
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CTTTATCTAA TATCCAGCAG CATTCGGTAA GAAAAAGAGA TCTACAGACT TCAACACATG 360 

TAGAAACACT AC TAACTTTT TCAGCTTTGA AAAGGCATTT TAAATTATAC CTGACATCAA 420 

GTACTGAR.CG TTTTTCACAA AATTTCAAGG TCGTGGTGGT GGATGGTAAA AACGAAAGCG 460 

AGTACACTGC AAAATGGCAG GACTTCTTCA CTGGACACGT GGTTGGTGAG CCTGACTCTA 540 

GGGTTCTAGC CCACATAAGA GATGATGATG TTATAATCAG AATCAACACA GATGGGGCCG 600 

AATATAACAT AGAGCCACTT TGGAGATTTG TTAATGATAC CAAAGACAAA AGAATGTTAG 660 

TTTATAAATC TGAAGATATC AAGAATGTTT CACGTTTGCA GTCTCCAAAA GTGTGTGGTT 72 D 

ATTTAAAAGT GGATAATGAA GAGTTGCTCC CAAAAGGGTT AGTAGACAGA GAACCACCTG 780 

AAGAGCtTGT TCATCGAGTG AAAAGAAGAG CTGACCCAGA TCCCATGAAG AACACGTGTA 840 

AATTATTQGT GGTAGCAGAT CATCGCTTCT ACAGATACAT GGGCAGAGGG GAAGAGAGTA 900 

CAACTACAAA TTACTTAATA GAGCTAATTG ACAGAGTTGA TGACATCTAT CGGAACACTT 960 

CATGGGATAA TGCAGGTTTT AAAGGCTATG GAATACAGAT AGAGCAGATT CGCATTCTCA 1020 

AGTCTCCACA AGAGGTAAAA CCTGGTGAAA AGCACTACAA CATGGCAAAA AGTTACCCAA 1080 

ATGAAGAAAA GGATGCTTGG GATGTGAAGA TGTTGCTAGA GCAATTTAGC TTTGATATAG 1140 

CTGAGGAAGC ATCTAAAGT V TGCTTGGCAC ACCTTTTCAC ATACCAAGAT TTTGATATGG 1200 

GAACTCTTOG ATTAGCTTAT GTTGGCTCTC CCAGAGCAAA CAGCCATGGA GGTOTTTGTC 12 GO 

CAAAGGCTTA TTATAGCCCA GTTGGGAAGA AAAATATCTA TTTGAATAGT GGTTTGACGA 1320 

GCACAAAGAA TTATGGTAAA ACCATCCTTA CAAAGGAAGC TGACCTGGTT ACAACTCATG 13 BO 

AATTGGGACA TAATTTTGGA GCAOAACATG ATCCGGATGG TCTAGCAOAA TGTGCCC03A 1440 

ATGAGGACCA GGGAGGGAAA TATGTCATGT ATCCCATAGC TGTOAGTGGC GATCACGAGA 1500 

ACAATAAGAT GTTTTCAAAC TGCAGTAAAC AATCAATCTA TAAGACCATT GAAAGTAAGG 1560 

CCCAGGAGTG TTTTCAAGAA CGCAGCAATA AAGTTTGTGG OAAiCTCGAGG GTGGATGAAG 1620 

GAGAAGAGTG TGATCCTGGG ATCATGTATC TGAACAACGA CACCTOCTGC AACAGCGACT 1680 

GCACGTTGAA GGAAGGTGTC CAGTGCAGTG ACAGGAACAG TCCTTGCTGT AAAAACTGTC 1740 

AGTTTGAGAC TGCCCAGAAG AAGTGCCAGG AGGCGATTAA TGCTACTTGC AAAGGCGTGT 1900 

CCTACTGCAC AGGTAATAGC AGTGAGTGCC CGCCTCCAGG AAATGCTGAA AATGACACTG 1860 

TTTGCTTGGA TCTTGGCAAG TGTAAGGATG GGAAATGCAT CCCTTTCTGC GAGAGGGAAC 1920 

AGCAGCTGGA GTCCTGTGCA TGTAATGAAA CTGACAACTC CTGCAAGGTG TGCTGCAGGG 1980 

ACCTTTCTGG CCGCTGTGTG CCCTAIGTCG ATGCTGAACA AAAGAACTTA TTTTTGAGGA 2040 

AAGGAAAGCC CTGTACAGTA GGATTTTGTG ACATGAATGG CAAATGTGAQ AAACGAGTAC 2100 

AGGATGTAAT TGAACGATTT TGGGATTTCA TTGACCAGCT GAGCATCAAT ACTTTTGGAA 2160 

AOTTTTTAGC AGACAACATC GTTGOGTCTG TCCTGGTTTT CTCCTTGATA TTTTGGATTC 2220 

CxrrCAGCAT TCTTGTCCAT TGTGTGGATA AGAAATTGGA TAAACAGTAT GAATCTCTGT 2260 

CTCTOTTTCA CCCCAGTAAC GTCGAAATGC TGAGCAGCAT GGATTCTOCA TCGGTTCGCA 2340 

TTATCAAACC CT1TCCTGCG CCCCAGACTC CAQGCCGCCT GCAGCCTGCC CCTGTGATCC 2400 

CTTCGGOGCC AGCAGCTCCA AAACTGGACC AOCAGAGAAT GGACACCATC CAGGAAGACC 2460 

CCAGCACAGA CTCCCATATG GACGAGGATG GGTTTGAGAA GGACCCCTTC CCAAATAGCA 2520 

GCACAGC1GC CAAGTCATTT GAGGATCTCA CGGACCATCC GGTCGCCAGA AGTGAAAAGG 2S80 

CTGCCTCCTT TAAACTGCAG CGTCAGAATC GTGTTAACAG CAAAGAAACA GAGTGCTAAT 2640 

TTAGTTCTCA GCTCTTCTOA CTTAAGTGTG CAAAATATTT MATAGATTT OACCTACAAA 2700 

TCAATCACAG CTTGTATTTT GTGAAGACTG GGAAGTGACT TAGCAGATGC TGGTCATGTG 2760 

TTTGAACTTC CTGCAGGTAA ACAGTTCTTG TGTGGTTTGG CCCTTCTOCT TTTGAAAAGG 2B20 

TAAGGTGAAA GTGAATCTAC TTATTTTGAG GCTTTCAGGT TTTAGTTTTT AAAATATCTT 2660 

TTGACCTOTG GTGCAAAAOC AGAAAATACA GCTGGATTGG GTTA2GAATA TTT ACGTTTT 2940 

tgtaaattaa tcttttatat tgataacagg actgactagg GAAATGATCA GTXTTTTTTT 3000 

ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA GAAAAOTGGA 3 060 

ATAGTTTTTT TTT1TTTTTI TTTTTTTTGC CTTCAACTAA AAACAAAGGA GATAAATTTA 3120 

GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT TTTTATGTAG 3180 

CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC TTCATAATTC 3240 

TAAQACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA TG GTAG CCAG 3300 

TTGAATTTAT GOAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG JCTTTTCTGTA 3360 

TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT AGAAAATTCA 3420 

CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGOAjGAG GCTGAGGTTG 3460 
CGCCACTACA CTCCAGCClXS GGTGACAGAG TGAGATCTGC CTC 



Seq ID UO: 354 Protein B€OU^nGfe 
Proteiu AcceaBion #; nsn?_ao3i74.2 



1 
1 

MRQSIJUPLT3 
OjTSTHVSTLIf 
GEPDSRVLAH 
PKVCGYLKVD 



AKSYPKBKKD 
HGGVCPKAYY 
ABCAPNEDQQ 



TCNGVSYCTG 
KVOCSDXtGGR 
XNTFGKFIAD 
SASVRI3KPF 
PFPNSSTAAK 



11 
I 

WPFVIiAPRP 
TPSAIjKRHFK 
IRIJDDV11RI 
JJEELLPKGLV 
1>IELIDKVDD 
AWOVKMLLEQ 
SPVGKKEXYii 
GKtfVMYPIAV 
PGIMYLNNDT 
NSSBCPPPGH 
CVPYVDABQK 
M1VGSVLVFS 
PAPQTPGRLQ 
8FEDIaTDHPV 



21 
I 

PDDPGFGPHQ 
LYIaTSSTERF 
NTDGAEYNIE 
UHEPEEELVH 
IYENTSWDNA 
FSFDIAEEAS 
MSGLTSTKNY 
JBGDHENNKMF 
CCNSDCTlaKE 
AENDTVCLDI) 
33LPLRKGKPC 
LIFWIPFfllL 
PAPVJPSAPA 
ASSEKAASFK 



31 
1 

RLEKLDSLLS 
SQNFKVWVD 
PLWRFVNPTK 
RVKERADPDP 
GFKGYGIQIE 
KVCLAHLFTY 
GKTILTKEAD 
SNCSKOfilYK 
GVQC8DRNSP 
GKCKDGKCIP 
TVGFCDMNGK 
VECVDKKLDK 
APKIiIJHQRMD 



41 
I 

mmiiiSLSNi 

GKNESEYTAK 
DKHMLVYKSE 
MRHTCKIiIiW 
QIRILKSPQB 
QDFDMGT1GL 
IiVTTHBLGHN 
TIE3KAQECF 
CCKNCQFETA 
FCEREQ QlaES 
CBKRVQDVIB 
QYBSLSXkFHP 
TIQEDPSTDS 
ETEC 



51 
1 

QQHSJVRKRDL 
MQDFFTGHW 
DIKJJVSSOjQS 
ADHRFYRYMG 



FGAEHDPDGL 
QERSNKVCGN 
QKKCQEAXBiA 
CACNETDN8C 
RFWDFIEQLS 



60 
120 
18D 
240 
300 
36D 
420 
480 
540 
600 
660 
720 
780 



seq id NO? 355 una sequence 

Muclelc Acid Accession ft= MM_021832.1 

Coding sequence: 164.. 2246 

1 11 21 31 41 31 

i I I I I I 

TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGGGCGGG AGGAAAAGAG GATTGAGGGG 
CIAGGCOGGG CGGATCCCGT CCTCCCCCGA TGTGAGCAGT TTTCCGAAAC CCCGTCAGGC 
GAAQGCTGCC CAGAGAGGTG QAGTCGGTAG CGGGGCCGGG AACATGAGGC AGTCTCTCCT 
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ATTCCTGACC AGCGTGGTTC CTTTCGTGCT GGCGCCGCGA CCTCCGGATG ACCCGGGCTT 240 
CGGCCCCCAC CAGAGACTCG AGAAGCTTGA TTCITTGCTC TCAGACTACG ATATTCTCTC 300 
TTTAT CTAAT ATCCAGCAGC ATTCGGTAAG AAAAAGAGAT CT ACAGACT T CAACACATGT 360 
AGAAACACTA CTAACTTTTT CAGCTTTGAA AAGGCATTTT AAATTATACC TGACATCAAG 420 
5 TACTGAACGT TTTTCACAAA ATTTCAAGGT CGTQOTGGTG GATGGTAAAA ACGAAAGCGA 480 

GTACACTGTA AAATQ □ CAGG ACTTCTTCAC TGGACACGTG GTTGGTGAGC CTQACTCTAG 540 
GGTTCTAGCC CACATAAGAG ATGATGATGT TATAATCAGA ATCAACACAG ATGGGGCCGA 600 
ATATAACATA GAGCCACTTT GGAGATTTGT TAATGATACC AAAGACAAAA GAATGTTAGT 660 
TTATAAATCT GAAGATATCA AGAATGTTTC ACGTTTGCAG TCTCCAAAAG TGTGTGGTTA 720 
10 TTTAAAAGTG- GATAATGAAG AGTTGCTCCC AAAAGGGTTA GTAGACAGAG AACCACCTGA 780 

AGAGCTTGTT CATCGAGTGA AAAGAAGAGC TGACCCAGAT CCCATGAAGA ACACGTGTAA 840 
ATTATTGGTG GTAGCAQATC ATCGCTTCTA CAGATACATG GGCAGAGGGG AAGAOAGTAC 900 
AACTACAAAT TACTTAATAG AGCTAATTGA CAGA3TTGAT GACATCTATC GGAACACTTC 960 
ATGGGATAAT GCAGGTTTTA AAGGCTATQO AATACAGATA GAGCAGATTC GCATTCTCAA 1020 
15 GTCTCCACAA QAGGTAAAAC CTGGTGAAAA GCACTACAAC ATGGCAAAAA GTTACCCAAA 10BO 

TGAAQAAAAG GATGCTTQGG ATGTGAAGAT GTTGCTAGAG CAATTTAGCT TTGATATAGC 1140 
TGAGGAAGCA TCTAAAGTTT GCTTGGCACA CCTTTTCACA TACCAAOATT TTGATATGGG 1200 
AACTCTTGGA TTAGCTTATG TTGGCTCTCC CAGAGCAAAC AGCCATGGAG GTGTTTGTCC 1260 
AAAGGCTTAT TATAGCCCAG TTGGGAAGAA AAATAT CT AT TTGAATAGTG GTTTGACGAG 1320 
20 CACAAAGAAT TATGGTAAAA CCATCCTTAC AAAGGAAGCT GACCTGGTTA CAACTCATGA 1380 

ATTGGGACAT AATTTTGGAG CAGAACATGA TOCGGATGGT CTAGCAGAAT GTGCCCCGAA 1440 
TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT GTOAGTGGOG ATCACOAGAA 1500 
C&ATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 
CCAGQAGTGT TTTCAAGAAC GCAGCAATAA AGTTTGTGGG AACTCGAQGG TGGATGAAGG 1620 
25 AGAAGAGTGT GATCCTGGCA TCATGTATCT GAACAACGAC ACCTGCTGCA ACAGCGACTG 1680 

CACGTTGAAG GAAGGTGTCC AGTQCAGTGA CAGGAACAGT OCTTGCTGTA AAAACTGTCA 1740 
GTTTGAGACT GCCCAfiAAGA AGTGCCAGGA GG COATTAAT GCTACTTGCA AAGGCGTGTC 1800 
CTACTGGACA GGTAATAGCA GTGAGTGCCC GCCTCCAGGA AATGCTGAAG ATGACACTGr 1860 
XXGCTTGGAT CTTGGCAAGT GTAAGGATGG GAAATGCATC CCTTTCTGCG AGAGGGAACA 1920 
30 GCAGCTGGAG TCCTGTGCAT GTAATGAAAC TGACAACTCC TGCAAGGTGT GCTGCAGGGA 19H0 

CCTTTOCGGC CGCTGTGTGC CCTATGTCGA T6CTGAACAA AAGAACTTAT TTTTGAGGAA 2040 
AGGAAAGCCC TGTACAGTAG GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA 2100 
GGATGTAATT GAACGATTTT GGGATTTCAT TGACCAGCTG AG CAT CAAT A CTTTTGGAAA 2160 
GTTTTTAGCA GACAACATCG TTGGGTCTGT CCTGGTTTTC T CCT TG AT AT TTTQGATTCC 222 0 
35 TTTCAGCATT CTTGTCCATT GTGTGTAACG TCGAAATGCT GAGCAGCATG GATTCTGCAT 2280 

CGGTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTG CAGCCTGCCC 2340 
CTGTGATOCC TTCGGCGOCA GCAGCTCCAA AACTGGACCA CCAGAGAATG GACACCATCC 2400 
AGGAAGACCC CAGCACAGAC TCACATATGG ACGAGGATGG GTTTGAGAAG GACCCCTTCC 2460 
CAAATAGCAG CACAGCTGCC AAGTCATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA 2S20 
40 GTGAAAAGGC TGCCTCCTTT AAACTGCAQC GTCAGAATCG TGTTGACAGC AAAGAAACAG 2580 

AGTGCTAATT TAGTTCTCAG CTCTTCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 2640 
ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AAGTGACTTA GCAGATGCIG 2700 
GTCATGTGTT TGAACTTCCT GCAGGTAAAC AGTTCTTGTG TGGTTTGGCC! GTTCTCCTTT 2760 
TGAAAAGGTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA 2820 
45 AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG CTGGATTGGG TTATGAGTAT 2880 

TTACGTTTTT GTAAATTAAT CTTTTATATT GATAACAGGC ACTGACTAGG GAAATGATCA 2940 
GTTTTTTTTT ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA 3000 
GAAAAOTGGA ATAGTTCTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGQA 3060 
OATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 3120 
50 TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 

TTCATAATTC TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA 3240 
TGGTAGCCAG TTGAATTTAT GGAATCTACC AACIGTTTAG GGCCCTGATT TGCTGGGCAG 3300 
TTTTXCTGTA TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGT CTT 3360 
AGAAAATTCA CTATTGGCTG GOAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 3420 
55 3421 GCTGAGGTTG CGCCACTACA CPOCAGCCTG GGTGACAGAG TOAGATCTGC CTC 



Seqc ID NO: 356 Protein sequence 
Protein Accession # = hp_068604.1 

60 1 11 21 *1 41 51 

I I I I 1 I 

MRQSLiliFIiTS WPFV1APRP PBDPGFGPHQ RLEKLD9U»S DYDILSLSNI QQHSVRKRDL 60 

QTSTHVBTIiIi TFBALKRHFK LYliTSSTBRF SQNFKWWD GKNESEYTVK WQDFFTGHW 120 

QEPDSRVLAH IKDDDVIIM NTPGAEYNIE PLWRFVNDTK DKHMLVYKSE DIKJJVSRLQB 180 

65 PKVOGYIiKVD MEELLPKGIjV EREPPEFTjVE KVKRRADEDP KKNTCKLLVV ADHKPTRYMG 240 

RGEBSTTTNY T*X ELI DRVDD IYRNTBWDNA GFKGYGIQIB QIRILK3PQE VKPGEKHYNM 300 

AJCSyPKBBKD AWDVKMI»LPQ PSPDIABEAS KVCLlAHLFTY QDFDMGTLGL AYVGSPRANS 360 

EBGVCPKAYY 9FVGKKEJIYI* N3GLTSTKNY GKTHjTKEAD LVTTHELGHN FGABHDPDGL 420 

AECAPNEDQG GKYVMfPIAV SGDHENKKMF SNCSKQS1YK TIESICAQECP QERSNKVCGN 480 

70 SRVDBGBECD PGIMYLWNDT CCNBDCTLKE GVQCSDRKSP CCKNCQFETA QKKOQBAIUA 540 

TCKGVSYCTO NSSECPPPGW AEDDTVCL&L GKCKDGKCIP PCERBQQLES CACNETDNSC 60O 

XVCCRDI^GR CVPYVDABQK HIjFLRKGKPC TVGFCDMRGK CEKKVQDVIE RFWDPIDQliS 660 
INTFGKFIiAD BTIVGSVIiVFS LIFMIPFSIL VHCV 

75 Beg IP NOs 357 DMA sequence 

MUcleic Acid Accesaion 41: MM_004994.1 
Coding sequences 20.. 2143 

0 ^ 1 11 21 31 41 51 

SO | I ) \ 1 | 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GQGCTGCTGC tttgctgccc CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG DCAGAGATGC GTGGAGAGTC GAAATCTCTG GGSCCTGCQC TGCTGCTTCT 240 
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CCAGAAGCAA CTGTCCCTQC CCGAGACCGG 
GCGAACCCCA CGGTGCGGGG TCCCAGACCT 
CAAGTGGCAC CACCACAACA TCACCTATTG 
GGCGGTGATT GACGACGCCT TTGCCCGCGC 
5 CACCTTCACT CGCGTGTACA GCCGGGACGC 

GCACOOAOAC GGGTATCCCT TCGACGGGAA 
TGGCCCCGGC ATTCAGGGAG ACGCCCA.TTT 
GGGCGTCGTG GTTCCAACTC GGTTTGGAAA 
CATCTTCGAG GGCCGCTGCT ACTCTGCCTG 
10 CTGGTGCAGT ACCACGGCCA ACTACGACAC 

GAGACTCTAC ACCCGGGACG GCAATGCTOA 
CCAAGGCCAA TCCTACTCOG CCTGCACCAC 
CGCCACCACC GCCAACTACG ACCGGGACAA 
CTCGACGGTG ATGGGGGGCA ACTCGGCGGG 
15 GGGTAAGGAG TACTCQACCT GTAOCAGCOA 

TACCACCTOB AACTTTGACA GOGACAAGAA 
TTTGTTCCTC GTGGCGGCGC ATGAGTTCQG 
GCCGGAGGCG CTCATGTACC CTATGTACCG 
CQACGTGAAT GGCATCCGGC ACCTCTATGG 
20 AACCACCACC ACACCGCAGC CCACGGCXCC 
TGTCCACCCC TCAGAGCGCC CCACAGCTGG 
AGGTCCCOCC ACTGCTGGCC CTTCTACGGC 
TQCCTGCAAC GTGAACATCT TOGACGCCAT 
CAAGGATGGG AAGTACTGGC QATICTCTGA 
25 CCTTATCGCC GACAAGTGGC CCGCSCTGCC 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG 
GGTGCTGGGC CCGAGGCGTJC TGGACAAGCT 
GGGGGCCCTC CGGAGTGGCA GGGGGAAGAT 
GTTCGACGTQ AAGGOGCAGA TGGTGGATCC 
30 CCCCGGGGTG CCTTTGGACA CGCACGACGT 
CCAGGACCGC TTCTACTGGC GCOTGAGTTC 
GGGCTAOGTG ACCTATGACA TCCTGCAGTG 
GCAGTGCCAT GTAAATCCCC ACTGGOACCA 
CAAACTGGTA TTCTGTTCTG GAGGAAAGGG 
35 TCACCTTTGT TTTTTGTTGG AGTGTTTCTA 



TGAGCTGGAT AGCGCCAOGC TGAAGGCCAT 300 

GGGCAGATTC CAAACCTTTG AGGGOGACCT 3 60 

GATCCAAAAC TACTCGGAAG ACTTGCCGCG 420 

CTTCGCACTO TGGAGCGCGG TGAGGCCGCT 480 

AGACATCGTC ATCCAGTTTG GTGTOGOGGA 540 

(3GACGGGCTC CTGGCACAGG CCTTTCCTCC 600 

CGAOGATGAC GAQTTGTGGT CCCTGGGCAA 660 

CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CACCACCGAC GGTOGCTCOG AGGGCTTGCC 7B0 

COACGACCGG TTTGGCTTCT GCCCCAGCGA B40 

TGGGAAACCC TGCCAGTTTC CAT T CATCTT 900 

GGACGGTGGC TCCGACGGCT ACCGCIGGTG 960 

GCTCTTCGGC TTCTGCCCGA CCCGAGCTGA 1020 

GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGCCGOGGA GATGGGCGCC TCTGGTGCGC 1140 

GTBGGGCTTC TGCCCGGACC AAGGATACAG 1200 

CCACGCQCTG GGCTTAGATC ATTCCTCAGT 1260 

CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

TOCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

CCCOACGGTC TGCCCCACCG GACCCCCCAC 1440 

CGGCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

CACTACTGTG CCTTTOAGTC CGGTGGACGA 1560 

CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1660 

CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 

GCGCCAGGTG TOGGTGTACA CAGGCGOGTC 1800 

GGGCCTBGGA GOCGAOGTGG CCCAGGTGAC I860 

GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 1920 

CCGGAGCGCC AGCGAGGTGG ACGGGATGTT 1980 

CTTCCAGTAC OGAGAGAAAG CCTATTTCTG 20 40 

CCGQAGTGAG TTGAACCAGG TGGACCAAGT 2100 

CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

AOCCTOGGGA AGGAGOCAGT TTGCCGGATA 2220 

AGGAGTGGAG GTGGGCTGGG CCCTCXCTTC 2280 
ATAAACTTGG ATTCTCTAAC CTTT 



Seq ID NO: 35B Protein sequence 
Protein Accession #* HP_0D4985.l 

40 1 11 21 31 41 51 

I 1 I 1 I t 

MSLWQP1jVI*V IiVLGCCPAA PHQRQSTIiVI. FPGDLRTNLT DRQLAEEYLY RYGYT RYAEM 60 

RGESKSLGPA IiLIiLQKQLSL PE TO E EiD SAT LKAMRTPRCG VPDIjGRFQTF EGDLKWEHHK 120 

ITYWIQNYSH DLPRAVIDDA F ARAFALW 9 A "VTPLTPTHVY SRDADIVIQP GVAEHGDGYP 180 

45 JTOGKDGIiIjAH AFPPGPGIQG DAHFDDDKLW SLGKGWVFT HFGNADGAAC HFPFIFEGRS 240 

YSACTTDGRS DGLPWCSTTA WYDTDDRFGF CPSERLYTRD GNAEGKPCQF PPIF QGQ5 YS 300 

ACTTDGRSDG YPJHCATTANY DRDXLFGFCP TRADSTVMGG HSAGELCVFP FTF IiGKE YST 360 

CTSEGEGDGa LWCATTSNFD SDKKWGFCPD QGYSLFLVAA HEFGHALGLD HSSVFEAIiMY 420 

PWYRFTEGPP IiHKBDVNGIH HLYGPRPEPE PRPPTTTTPQ PTAPPTVCPT GPPTVHPSER 480 

50 PTAGPTGPPfi AGPTGPPTAG P9TATTVPt>S FVDDACNVNI FPAXABIGNQ LYIiFKDGKYV? 540 

RFSEGRGSRP QGPFL.IADKW PALPRKLDSV FEEPLSKKLF FFSGRQVWVY TGASVLGPER 600 

LDKLGLGADV AQVTGALR9G RGKMLIiFSGR RLHRFDVKAQ MVDEHSASEV DHMFPGVPLD 660 
TBDVFOYBHK AYFGQDSlPYW RVSSRSEEUQ VDQVGYVTYD ILQCPED 

55 Seq ID NOx 359 DMA sequence 

Nucleic Acid Accession #: NM_000213.1 
Coding sequence: 127., 5385 



1 11 21 31 41 SI 

60 I ! | i t I 

CGCCCGCBCG CTGCAGCCCC ATCTCCTAGC GGCAGCCCAG GOGCGGAGGG AGCGAGTCCG 60 

CCOCGAGGTA GGTCCAGGAC GGGCGCACAG CAGCAGCCGA GGCTGGOCGQ GAGAGGGAGG 120 

AAGAGGATGG CAGGGCCACG CCGCAGCCCA TGGGCCAGGC TQCTCCTGGC AGCCTTGATC 180 

AGOGTCRGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGl? GAAGAGCTGC 240 

65 ACG3AGTGTG TCCGTOTGGA TAAGGACTGC QCCXACTGCA CAGACGAGAT GTTCAGGGAC 300 

■ CGGCGCTGCA ACACCCAGGC GGAGCTGCXG GCCGCGGGCT GCCAGCGGGA GAG CAT CG TO 360 

GTCATGGAGA GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CGTGCGGCGC 420 

AGCCAGATGT COCCCCAAGG CCTGCGGGTC CGTCTGCGGC OCGGTGAGGA GCGGCATTTT 480 

GAGCTGGAGG TGTTTGAGCC ACTOGRGAGC CCCGTGGACC TGTACATCCT CATGGACTTC 540 

70 TCCAACTCCA TGTCCGATGA TCTGGACAAC CTCAAGAAGA TGGGGCAGAA CCTGGCTCGG 600 

GTGCTGAGCC AGCTCAOCAG OGACTACACT ATTGGATTTG GCAAGTTTGT GGACAAAGTC 660 

AGCGTCCCGC AGACGGACAT GAGGCCTGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 72 D 

CCCCOCTTCT CCTTCAAGAA CCfTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 780 

AAACTGGAGG GAGAGCGGAT CTCAGGCAAC CTGGATGCTC CTGAGGGCGG CTTCGATGCC 840 

75 ATOCTGCAGA CAGOTGTGTG CRCGAGGGAC ATTGGCTGGC GCCCGGACAG CACCCACCTG 900 

CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAQGCTG ATGGCGCCAA GDTGCTGGCT 960 

GGCATCATGA QCCGCAAOGA TGAACGGTGC CACCTGGACA CCACGGGCAC CTACACCCAG 1020 

TACAGOACAC AGGACTACCC GTCGGTGCCC ACCCTGGTGC GCCTGCTOGC CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTOT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 1140 

80 TATTTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGTC3CAACAT OGTGGAGCTQ 1200 

CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA XCOGGGCOCT AGACAG CCCC 1260 

CGAGGCCnC GGACAGAGGT CACCTCCAAG ATGTTOCAGA AGACGAGGAC TGGGTCCTTT 1320 

CACATCOGQC GGGGGQAAGT GGGEATATAC CAGGTGCAGC TGCGGGOCCT TGAGCACGTG 1380 

GATGGGACGC ACGTGTGCCA GCTGCCGGAG GACCAGAAGG GCAACATCCA TCTGAAACCT 1440 
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TCCTTCTCCQ ACGGCCTCAA GATGGACGCG GQCATCATCT GTQATGTGTG CACCTGCGAG 1500 
CTQCAAAAAG AGGTGCOGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT QTG CGGACAG 1S60 
TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 1620 
GACftTTCAQC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAOTOC 1S80 
5 CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 1740 

GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG AOGCTGCTCC 1800 
ATGGGCCAGtT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC I860 
AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CIGTGAGTGT 1920 
GGCOTCTGCC ACTGCCACCA □CAGTOGCTC TACACGGACA CCATCTGCGA GATCAACTAC 1980 
10 TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2040 
GGCACCGGCG AGAAGAAGGG GQGCACGTGT GAGGAATGCA ACTTCAAQGT CAAGATGGTG 2100 
GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAC 2160 
GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGOGCCC CTGGGCCCAA CAGCACTGTC 2220 
CTGGTGCACA AGAAGAAGGA CTGCCCXCCG GGCTCCTTCT GGTGGCTCAT CCCCC7TGCTC 2280 
15 CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CT GtGCC TGC 2340 
TOCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAADCGAG GTCftCATQGT GGGCTTTAAG 2400 
GAAGACCACT ACATGCTGCG GGAGAACCTG ATGSCCTCTG ACCACTTGGA CACGCCCATQ 2460 
CTGCGCAGCG GGAACCTCAA GGGCCBTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 2520 
CAGOGGCCTG GCTTTGCCAC TCATGCCGCC ASCATCAACC CCACAGAGCT GGTSCCCTAC 25B0 
20 GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAAOC TGCTGAAGCC TGACACTCGG 2640 
GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 2700 
TCCQGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGQCAtSC AGCCCAATGC CGGGAAAAflG 2760 
CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 282 0 
CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCrXCC ACGACCTCAA GGTGGCCCCC 2880 
25 GGCTACTACA CCCTCACTGC AGAOCAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2940 
GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGAOGAGAAG 3000 
CAGCTGCIGG TGGAGGCCAT CGACGTOCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 3060 
GTAAACATCA CCATCATCAA GQAGCAAGCC AGRGACGTGG TGTCCTTTGA GCAGCCTGAG 3120 
TTCTCGGTCA GCOGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCOGGCG TCTCCTGQAC 3190 
30 GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGa CAACCGGGAC 3240 
TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3300 
GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCGGCCA GGTCCGGCGT 33 60 
TTCCACOTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 3420 
ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 3480 
35 TCACAGCCAC CCCCTCACGG CGACCTGOGC GCCCCGCAGA ACCCCAATGC TAAGGCCGCT 3540 

GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3 60O 
GTAAAGTACT GGATTCAGGG TGACTCCGAA TCGGAAQOCC ACCTOCTCGA CAGCAAGGTG 3660 
CCCTCAGTGG AGCTCAGCAA CCTG-EACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 3720 
TACGGGGCTC AGGGCGAGGG ACCCTACRGC TCCCTGGTGT CCTGCOGCAC CCACCAGGAA 3780 
40 GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATG1CGTCT CCTCCACGGT GACCCAGCTG 3840 
AGCTOGGCTG AGCCGGCIGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 3900 
CTGGTCAACG ATGACAACCG AGCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACOCT 3960 
AAGAAGCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGOCCTACCG CTACACGGTG 4020 
AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAQCGGGAGG CCATCATCAA CCTGGCCACC 4080 
45 CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT QQACGCCCAG 4140 

AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTAOS CTCTCCATCG 4200 
GGCAGCCAGA GGCCCAGCOT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 4260 
TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGGC 4320 
TATGOCACCC ACCTGAOCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 43 80 
50 ACAOGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 4440 
GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4S00 
GACACGCCCA COCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 
CAGGAGCCGC GGTGCQAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 4620 
GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AOACCTCGGT GGTGGTGGAA 4680 
55 GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGX3 4740 
GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 
TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG COCGCTGGTG 4860 
TTCACTGCCC TGAGCCCAGA CTOGCTGCAG CTGAGCTGGG AQCGGCCACG GAGGCCCAAT 4920 
GGGGATATCG TCGGCTACCT GGTGACCTQT GAGATGGCCC AAGGAGGAGG GCCAGCCAGC 4980 
60 GCATTCCQGG TGGATGGAQA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 
AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 
GAGGGCATCA TCACCATAGA GXCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCOT 5160 
GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 5220 
AGCQCCACCG AGCCCTTCCT AGTGGATGQG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 5280 
65 GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 5340 
GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA S400 
CCCCCGCCAT GTCCCACTAO GCGTCCTCCC QACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 
CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520 
TCCTGGGAGa CATGAAGGGG GCAAGGTCCO TCCXCTGTG3 GCCCAAACCT ATTTOTAACC 5580 
70 AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
TACTG 

fieq ID KO= 360 Protein sequence 
Protein Accession #: NP__000204.1 

1 11 21 31 41 51 

{1)11) 

MAGPRPSFWA RlilAALISV ei^SGTIiftNEC KKAPVK9CTE CVRVDKDCAY CTD31PRDRR 60 

cmtqakMjAA gcqresiww bssfqiteet qidttlhrsq mspqglrvrl rpgeerhpel 120 
evfeplespv dlytijmdffin sm3ddxunlk kmgoniarvl sqktsdytig fgkpvdkv5v 180 
po/rt*0rpekl kepmphsdpp psfkbvislt edvdkfrnkl qgerisgwm) apeggfdail 240 

QTAVCTRDIG WRPDSTHLLV PSTE8AFHYE ADGANVLAGI M9RKDERCHI* DTTGTYTQYR 300 
TQDYPSVPTL VMiLAKHNII PI FAVTNYS Y SYTfEKLHTYF PVSSLGVL0JB PSSNIVEbLE 360 
EAPNRITISHI* DIRALDSPRG LRTEVTSKMF QKTRTGSFHI RHGEVGIYQV QUtALEHVDG 420 
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THVCQ1»PEDQ KGNIHLKP&F SDGLKMDAGI ICDVCTCELQ KEVRSARCSF WGDFVCGQCV 
C8EGWSGQTC NCSTGSLSDT QPCLRBGEDK PCSGRGECQC GHCVCYGBGR YEGQFCBYDN 
FQCPRTSGPI- CSDRGRCSMG QCVCEPGWTG P9CDCPI»SNA TCIDSNGGIC NGRGHCECGR 
CHCHQQSIiYT DTICEINYSA IHPGIiCEDliR SCVQCQAWGT GEKKGRTCEE CNFKVKMVDE 
LKRAEEVWR CSFRDEDDDC TYSYTMBGDG APG PNS TVIiV HKKKDCPPG5 FTWtlPLLIiIi 
r.T.PT.T AT.TJ. T. LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLREH1MA SDHLDTPMLR 
SGNIiKQRDW RWKVTMNMQR PGFATHAASI NPTELVPYGL SLRLARLCTE WIAKPDTREC 
AQLRQBVEEN EiHEVYRQISG VHKLQQTKFR OjQPNAGKKQP HTXVDTVLMA PRSAKPALIiK 
LTEKQVEQRA FHDDKVAPGY YTLTADQDAR GHVEFQEGVB LtVDVRVPIoFI RPEDDDEKQL 
LVEAIDVPAG TATIiGRRLVH ITIIKEQARD WSFEQPEFS VSRGDQVARI PVIRRVLDGG 
K9GFVSYRTQD GTAQGNRDYI PVEGELtiFQP GEAWKELQVK LI1ELQEVD8I1 URGRQVHHFH 
VQLSNPKFGA HLGQPHSTTI IIRDPDELDR SFTS0ML9SQ PPPHGDLGAP QNFMAKAAGS 
RKIKPKWE.PP SGKPMGYRVX YWIQGDSESE AHLIiDSKVPS VEUTHIiYPYC DYEMKVCAYG 
AQGEGP-YSar. VSCRTHQEVP SEPGRIAFUV VSETVTQL9W AEPAETNGEI TAYEVCYGLV 
NDDNRPIGPM KK7I»VDNPKN RKLLIEMLRE SQPYRYTVKA RNGAGWGPE& EAIIWIATQF 
KHPMSIPIIP DIPIVDAQSG EDYD3FL.MYS DDVIsRSPSGS QRPSVSBDTE HLVNGRMDFA 
PPG9TNSLHR MTTTSAAAYG THLSPHVPHR VLSTflfiTLTR DYNSLTPJSEH SHSTTLPRDY 
STLTSVSSRD SRLTA6VPDT PTRLVFSALG PTSIiRVSTWQE PRCERPLQGY SVEYQIaLNGG 
HIJIRIiNIPMP AQTSVVVBDLt LiPJJSfiYVFRV RAQSQBGWGR EREOVITIES QjVHPQSPIiCP 
IjPGSAFTLST PSAPQPIiVFT ALSPDS&QI.S KERPRRPNGD IVGYLVTCEM AQGGGPATAF 
RVDGDSPESR LTVPGLSENV PYKFXVQART TEGFGPERBG UTIESQDGG PFPQIXJ9RAG 
liFQHPLQSBY SSITTTHTSA TBPFLVDGPT LGAQHLEAGG Si»TRHVTQEF VSRTLTTfiGT 
IiSTHKDQQFF QT 

Seq IS NO i 361 DMA sequence 
Nucleic Acid Accession #: nm_013332.i 
Coding sequence • l . . 63 

1 11 21 31 41 51 

I I I I I I 

□CACGAGGGC GCTTTIGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC GCTGGTGCTT 60 

AGTAACCGAC TTTCCTCCGG ACTCCTGCAC QACCTGCTCC TACftSCCGGC GATCCACTCC 120 

□GQCTGTTCC CCCGQAGGGX CCAGAGGCCT TTCAGAAGGA GAAGGCAGCT CTGTTTCTCT 1BO 

GCAGAGGAOT AGGGTCCTTT CAGCCATGAA GCATGTGTTG AACCTCTACC TGTTAGOTGT 240 

BGTACTGACG CTACTCTCCA TCTTCGTTAG AGTGATGGAG TCCCTAGAAG GCTTACTAGA 300 

GAGCCCATCG CCTGGGACCT CX2IGGACCAC CAGAAGCCAA CTAOCCAACA CAGAGCCCAC 360 

CAAGQGCCTT CCAGACCATC CATCCAOAAG CATGTGATAA GACCTCCTTC CATACTGGOC 42 0 

ATATTTTQGA ACRCTGACCT A8ACATGTCC AGATGGGAGT CCCATTCCTA GCAGACAAGC 480 

TGAGCACOGT TGTAACCAGA GAACTATTAC TAGGCCTT0A AGAACCTGTC TAACTGGATG 540 

CTCATTGCCT GGGCAAGGCC TGTTTAGGCC GGTTGCGGTG GC7CATGCCT GTAATCCTAG 600 

CACirTGGGA GGCTQAGGTG GGTGGATCAC CTGAGGTCAO GAGTTCGAGA CCAGCCTCGC 660 

CAACATGOCG AAACCCCATC TCTACTAAAA RTACAAAAGT TAGCTGGGTG TGGTGGCAQA 720 

GGCCTGTAAT CCCAGTTCCO* TGGGAGGCTG AGGCGGGAGA ATTGCTTGAA CCOBGGGACG 780 

GAGGTTGCAG TGAAOGGAGA TDGCACTGCT GTACCCAGCC TGGGCCACRG TGCAAGACTC 84ft 

CATCTCAAAA AAAAAAAQAA AAGAAAAAGC CTGTTTAATG CACRGGTGTG AGTGOATTGC 900 

TTATGGCTAT QAGATAGGTT GATCTOGCCG TTACCCCGGG GTCTGGTGTA TGCTGTGCTT 960 

TCCTCAGCAG TATGGCTCTQ ACATCTCTTA GATGTCCCAA CTTCAGCTGT TGGGASATGG 1020 

TGATATTTTC AACCfCTACTT CCTAAACATC TGTCTQGGGT TCCTTTAQTC TTGAATGTCT 1DB0 

TATGCTCAAT TATTTQ0TGT TGAGCCTCTC TTCCACAAQA GCTCCTCCAT GTTTQGATAG 1140 

CAGTTOAAGA GGTTGTGTGG GTGGGCTGTT GGOAGTGAGG ATCGAQTOTT CAGTGCCCAT X200 

TTCTCATTTT ACATTTTAAA GTCGTTCCTC CAACATAGTS TGTATTGGTC TGAAGGGGGT 1260 

GGTQOGATGG CAAAGCCTGC TCAftGTTATG QACATTGTGG OCACCATGTG GCTTAAATGA 1320 
TTTTTTCTAA CTAATAAAGT GGAATATATA TTTCAAAAAA AAAAAAAAAA AA 

Seq ID NO i 362 Protein aegnence 
Protein Accession fti MP_0 37464.1 

1 11 21 31 41 51 

111)11 

MKHVUTLYIiL GWIjTIi1*9IF VRVMESIiEQI, ItESPSPGTSW TTRSQLANTE PTKOLPDHPS 60 
RSH 

Seq ID 230: 363 P3JA sequence 

Nucleic Acid Accession #= jflM_023 915.1 

Coding sequence: 250.. 1326 

1 11 21 31 41 51 

) I 1 I I 1 

BGCACOAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTOCCTGCC GACCTTAGTT 60 

TCAAAGCTTA TTCXTAATTA GAGACAAQAA ACCTGTTTCA ACTTGAAGAC ACOTTATGAG 12 D 

GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGA ATAA C CTATGCTGAA 180 

CtlCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AOTGCATCAC 240 

AACTGAAGAA TGGGGTTCAA CTTOACGCTT GCAAAATTAC CAAATAACGA GGTGCAOGGC 300 

CAAGAGAGTC ACAATTCAGG CAACAGGAGC GAGGGGCCAG GAAAGAACAC CACCCTTCAC 360 

AATGAATTTQ ACACAATTGT CTTGOCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 420 

TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA aOAATAAAAC CAQCTTCATA 4 BO 

TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGCTGACATT TQCATTTCGA 540 

ATAQTtCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 600 

TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TGCTTOGGCT GATAAGCATT 660 

GATOGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 720 

ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGXCTTT GCCAAACATC 7 B0 

ATCCTGACAA ATGGXCAGCC AACAaAOQAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 840 

CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTQA ACAGCTGCTT GTTTGTGGCC 900 

GTGCTGGTGA TTCTGATCGG ATGTTACATA GOCATATOCA GGTACATCCA CAAATCCAGC 960 



1092 



WO 03/042661 



PCT/US02/36810 
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AGGCAATTCA 
GTGGCTGTGT 
AGTCACTTAG 
AT TACACTTT 
TGTAGGTCAT 
ATCAGATCAC 
GTGTAGGCCT 
TTCATTATCC 



TAAGTCAQTC 
TTTTTACCTG 
ACAGGCTTTT 



TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATCG 
GTTGGAATCG 
AA 



CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC 
AAATCAAATA 
GAAQTTCGCA 
ATATGTACAA 



ACCAGAGCKT 
GCAGAATTCC 
TCCTATATTA. 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



Seg ID NO i 364 Protein sequence 
Protein Accession #i UP 076404 



i 
I 

HGPNLTLAKIi 
GLAVWIFFHI 
FYANMYTCSIV 
NGQPTEDNIH 
IGQSSRKRKH 
FLSAC23VCLD 



11 
I 

PNHEIiHGQES 
HNKTSFIFYL 
PLGLISIDRY 
ECSXLKSPLG 
NQSIRVWAV 
FIIYFFMCRS 



21 31 

1 ) 
HMfiGHRSDGP GKNTTLHNEF 
KNIWADLIM TLTFPPRXVH 
LKWKPFGDS RMYSITPTKV 
VKWHTAVTYV NSCLFVAVLV 
FFTCFLPYHI* CHIPFTFSHL 
FSRRX*FKK£>N irtrsesihs 



41 

1 

DAGFGPWYFK 
LSVCVWVIMA 
ILIGCYIAIS 
DRIiIiDESAQK 
LQSVHRfiTBVR 



Seq ID NO: 36S DNA sequence 

Nucleic Acid Accession # : NM_005365. 

Coding sequences 1- - 948 



1 
I 

ATGTCTCTCG 
GAGGACTTGG 
TCCrCTSACA 
CCTCAGGGAG 
QAGGGCTCCA 
GAGTTCATQT 
CACAAATATC 
AATTACAAQC 
TTTGGCACTG 
CTTQGCCrCT 
CTGATCATTG 
TGGGAAGCGT 
CCCAGGAAGC 
CCCGGCAGTG 
AG CTATGAGA 
CCATCCCTTr 



11 

1 

AGCAGAGGAG 
GCCTGATGGG 
GCAAGGAGGA 
GCGCTTCCTC 
GCAGTCAAGA 
TCCAAGAAGC 
GAGTCAAGGA 
GCTACTTTCC 
ATGTGAAGGA 
CGTGCGATAG 

TGAGTGTGAT 
TGCTCACCCA 
ATCCTGOGCA 
AGGTCATAAA 
ATGAAGAGGT 



21 
I 

rCCGCACTGC 
TGCACAGGAA 
GGAGGTGTCI 
CTCCAXTTCC 
AGAGGAAGAG 
ACTGAAATTG 
GCCGGTCACA 
TGTOATCTTC 
GGTGGACCCC 
CATGCIGGGT 
GATCCTAAOC 
GGGGGTGTAT 
AGATTGGGTG 
CTACGAGTTC 
TTATTTGGTC 
TTTGGGRGAG 



31 
1 

AAGCCTGATG 
CCCACAGGCG 
GCTGCTGGGT 
GTCTACTACA 
CCAAGCTCCT 
AAGGTGGCTG 
AAGGCAGAAA 
GGCAAAGCCT 
GCGGGCCACT 
GATGGTCATA 
AAAQACAACT 
GTTGGGAAGG 
CAGGAAAACT 
CTGTGGGGTT 
ATGCTCAATG 
GAGCAAGA3G 



AAGACCTTGA 



CATCAAGTCC 
CXTTATGGAG 
CGGTCGACCC 
AGTTGGTTCA 



CCGAGTTCAT 
CCTACATDCT 
GCATGCCCAA 
GCGCOOCTGA 
AGCACATGTT 
ACCTGGAGTA 
CCAAGGCCCA 
CAAGAGAGCC 
GAGTCTGA 



CAGGGTTGTT 
TTTTACTTTT 
CTGCAAAGAA 
CTTTTTCATG 
GAGTGAAAGC 
TTACACTGAT 
AAXGTTTCTT 



51 
1 

IIFVASILLN 
FIIiCRYTSVL 
VlSLPNIILT 
RYIHKSSRQF 
ILYYCKE1TL 
IYYDYTDV 



51 
1 

AGCCCAAGGA 
GACTACCTCC 
TCCCCAGAGT 
CCAATTGGAT 
AGCTCAGCTG 
TTTCCTGCTC 
CGTCATCAAA 
GCAGGTGATC 
TGTCACTGCT 
GGCCGCCCTC 
AGAGGTTATC 
CTACGGGGAG 
CCGGCAGGTG 
CGCTGAAACC 
CATCTGCTAC 



1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID HO? 366 Protein sequence 
Protein Accession tf: NP_O05356.1 

1 11 21 

1 I I 

HSLBQRSPHC KPUEDDEAQG SDl/GLMGAQE 
PQGGAgSsrS VYYTLWSQFD EGSS3QEEEE 
HKYRVKEPVT KA^CTjBSVIK KYKRYFPVIF 
I^GLSCDSMLG DGHSMPKAAL L 1 1 VTjGVILtT 
PRKLLTQDW QEKYLEYRQV PGGDPAHYEF 
PSLYEEVLGB EQEGV 



Seq ID MO> 367 DMA sequence 
Nucleic Acid Accession 4 : NM_014400 
Coding sequences 86,. 112 6 



l 
\ 

GGTTACTCAT 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 



11 
i 

CCTGGGCTCA 
GAGCAGGACG 
GCAGGCTGGC 
TGCGTGCAGA 
CCQGGCGTGG 
TGGCTGGCAQ 




CTGCAACGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTOTGTCCAG 



CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 



TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTG 
CTTCTGCTGC 



AAGCTCAACC 
AACGQCGTGG 
CCGCCGGTCG 

aacgtcacct 
gatgaattct 
tgccaggggt 

CCACCCCTTG 
ACGACTTCTA 
ACCAGTCAGA 
TTGACXGGAG 
GGGGOG CZCCC 
CITCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 



AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT 
TCACCTCGCG 
AGTGCTACAG 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCOGCTGTAA 
TCGGGCTGCC 
CCTCGGCCCC 
CTCCGAGACA 
GGGCCGCTGG 
AGCAGCCCCA 
CCGTGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTXTGGGA 



31 
1 

CCCBAGCTCG 
GCCCGCCAGG 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CAICCAGCTG 
GGCGCTCGAC 

CAAOGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGAGCTC 
OCCTOCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGTCCTA 
TGGGTAGCCC 
TGTTTTTOCA 
AATAAAATAC 



41 
1 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGOGGTGG 
CTCCCCGGCA 
CAGCAATGCG 
CCGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCCAGGGT 
CGCAACAAGA 
CCCACGACTG 
ACATGCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CrGTGAGCTT 
TCTTC3>CATC 
ACATTCCOCA 
CGTTGTATAT 1 



60 
120 
180 
240 
300 



31 41 51 

1 ! 1 

PTGEBSBT Tfi SSDSKEHEVS AAGSSSPPQS 
PSSSVDPAQL EFHFQEALKL KVABLVHFJJJ 
GKASEFMQVI FGTDVKEVDP AGHSYILVTA 
KDNCAPEEVI HSALSVMGVY VGKEHKPYGB 
LWGSKAHAET SYKKVXNYIAT ML51AREPICY 



51 
I 

AGGCAGGGGG 



AGGCCCTGGA 
TGAAGACAGT 
AGAOCATCCA 
AGAATGACOG 
CTCAGGATOG 
ATGAGAGTGC 
AOGGGTGGCA 
ACAAGGGCTG 
CTGTCCGGGG 
TCACGCTCAG 
CCTACTTCTC 
TGGCCTCAAC 
CCAAACCX3VT 
CCCGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 



60 
120 
1B0 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



1093 



WO 03/042661 
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GGGTGTTCTA 

AGGATGCTAA 
GGTGGGACAA 
ATCGGTTCCC 
CTTATGTCTG 
TTGTATAGTG 



GCTTTTTGAG 
ATGTTAGGAC 
GCTTCCTACT 
TGGCTCCCCA 
CAT ATGTCX T 
TGTGTGATCA 
AAAAAAAA 



GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTG 
AGAGTGAGAG AAGTCAGCTG TCACGGGGAA GGTGAGAGAG 
CACTTTCTCC TAGCCAGCCT GGACTTTGGA GCGTGGGGTG 
CTCTAAGCRC TGCCTCCCCT ACTCCCCGCA TCrrTOGGGA 
CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 
GTTTCTGGCA CATAAATGCC TCAATAAAGA T T TAATTACT 



1380 
1440 
1500 
156D 
1620 



Seq ID HO: 368 Protein gequence 
Protein AcceEBion #c VP 05 52 IS 



1 11 21 31 41 SI 

1 1 I I I 1 

MDPARKAGAQ AMIWTAGWLL IiIaLLiRGGAQA LECYSCVQKA DBQ C SPNKMK TVKCAPGVDV 

CTEAVGAVJET IHGOFSIAVX GCGSGI>PGKN DRGLDLHGLL AFIQLQQCAQ DRCBIAKLNLT 

SRALDPAGNE SAYPPNGVEC YSCVGLSHEA CQGTSPFWS CYNASDHVYK GCPDGNVTLT 

AANVTVS1£>V RGCVQDEFCT HDGVTGPGFT LfiGSCCQGSR QXSDLRNKTY FSFRIPPLVR 
LPPPEPTTVA STTSVTTSTS AFVRFTSTTK PMPAPTSQTP KQGVEHEAfiR DEEPRIjTGGA 

AQHQDRSNSG QYPAKGGPQQ PHtJKGCVAPT AGLAAI>ItIiAV AAGVLL 

Seq ID NO; 369 DMA sequence 

Nucleic Acid Accession #t NM_00S32 9.1 

Coding sequence s 1.-1662 



1 

i 

ATGCCGGTGC 
GTGCTQOGTG 
CACTAOCTGT 
CTTTTTGOCT 
TCCCCGCGGC 
TTGCGCAAGT 
GTGGTGGATG 
GGCQGCACCG 
GGTGAGACGG 
AGCACCTTCT 
TTCAAGGCCC 
GATCCRGCCT 
GTGQGGQGAG 
GTGCGOTACT 



GACTGGXACC 
AOCAAOGGAa 
ACAGAGACCC 
TACTTCCGGG 
TACOAGTCAG 
TTCTACCGGG 
ATTATCAAGG 
CTCTACTCCC 
ATCAACAAAT 
CTCATTCCTG 
TGCCAGGACC 



AAGAAGCCGG 



11 
1 

AGCTGACGAC 
GCRTCCTGGC 
CCTTCGGCCT 
TCCTGGAGCA 
GGGGCTCGGT 
GCCTGCGCTC 
GCAAGCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTGCATCAT 
TCGGCGATTC 
GCACCATCGA 
ATGTCCAGAT 
GGATGGDCTT 
GTGGGCCCTT 
ATCAQAASTT 
TCCTGAGOCT 
CCACTAAGTA 
AGTGGCTCTA 
TGGTCACGGG 
Q CCGCATCTG 
CCACCTADGC 
TCCTCTATAT 
CTGGCTGGGG 
TGTCCATCTG 
TGTTCAGTGA 
GGGTGGCCCT 
AGCAGTACAG 



21 
I 

AGOCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGGCGCATG 
GGCACTGTGC 
GGCCCAGCQC 
GGAGGAGGOC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGQ 
G3TGGACTAC 
GATGCTTCOA 
CCTCAACAAG 
CAACGTGGAG 
GGGCATGTAC 
CCTAGGCAQC 
TGGCTACCGA 
CCTCCGGTGG 
CAACTCTCIG 



GTCCAOCCTT 
CACCTCTGGC 
GGTGGCAGTT 
GACAGAGCTA 
CCTCATGCTA 
CTTGGCTTTT 



31 

I 

GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGACGTGCCG 
ATTGCCGCAT 
ATCTCCTTCC 
TACATGCTGG 
TGGCGCAGCA 
ATGGACCGTG 
GGAGGCAAGC 
ATCCAGGTGT 
GTGCTGGAGG 
TAGQACTCAT 
OGGGCCTGCC 
CGCAACAGCC 
AAGTGCAGCT 
ACTAAGTATA 
CTCAACCAGC 
TGGTTCCATA 
TTCXTCCTCA 

ctcttcctgc 

CGGGGCAATG 
CTGCCOGCCA 
CGAAAAACCA 
CTCCTGGAGG 
GCCTTCCTTG 
TATCTGGCCA 
GCTGAGGTGT 



41 
I 

CCAGOCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGCGGGATGT 
GCGAGGTCAT 
GCGACTCTGA 
AGGATCCCCA 
GGATTTCCTT 
AGTCCTACTT 
TCCTCCAGCA 
TCGGGGATGA 
CCGOGCGCTC 
AAACCCGCTG 
AGCACCACCr 
TTGCCACGGT 
TGACGGTGCA 
CAGAGATGAT 
AGATCTTTGG 
TTGTGGTGAA 
GGCTGGCCTA 



TCATCGCCCG 




Seq ID NO; 370 Protein sequence 
Protein Accession #: NP 005320.1 



60 
120 
180 
240 
300 



51 
1 

TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAOCIGCCC 
CCCTGACTAC 
GOTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 
CACTGTGCTG 
AGTAGGGGGA 
CCTGAGCAGC 
TGGCTGTGTG 
GTTCCTGGAG 
CCGGCAOCTC 
CAAGTGCCTC 
GAGCAAGTCr 
CTGGATGACC 
TATACAGCTT 



1 
I 

MPVQIjTTALR 
LFAF1J3HRHM 
WDGUHQEDA 
STPSCIMQKW 
VGGDVQITJiK 
DWYHQKFLGS 
YFREWLYH3L 
IIKATYACFXj 
E*IPVSIWVAV 
KKPEQYSIAF 



11 
1 

WGTSLFALA 
RRAGQALKLP 
YMIiDlFHEVL 
GGKKEVHYTA 
YDSWIEFLBS 
XCSFGDDRHIi 
WFHKHKLWMT 
RGKAEMIJPM9 
UjEGLAYTAY 
AEV 



21 
I 

VLGGILAAYV 
9PRRGSVALC 
GGTEQAGFFV 
FKALGDSVDY 
VRYHMAFNVB 
1HRVLSLGYR 
YBSWTGFFP 
LYSliLYKSBL 
CQDtiF93TPLi 



31 
I 

TGYQFIHTJ2K 
IAAYQEDPBY 
HRSN FHRAG E 
IQVCDSDTVL 
RAOQSYFGCV 
TKYTAHSKCL 
FFklATVTQIi 
IjPAKZFAIAT 
AFLVSGAIIiY 



41 

I 

HYliSFGLYGA 
LRKCLRSAQR 
GETEASLQEG 
DPACTIBMLR 
QCISGPLGHY 
TETFTKYLRW 
FYRGRIWNJL 



60 
120 
180 
240 
300 
360 
420. 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
X440 
1500 
1560 
1620 



GCYHVALLKI* 



51 
I 

lUGraiililQS 
IBFPDWCWM 
MDRVHDWRA 
VLEEDPQVGG 
RNSLLQQFLE 
LNQQTRWaKS 
UIiI/TVQLVG 
RKTIWKFIG 
YIiAIlARRCG 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 



Seq ID NOi 371 UNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-7095 

1 11 21 31 41 51 

) 1 1 1 I I 

CACACATACa CACGCACGAT CTCACXTCGA TCTATACACT GGAGGATTAA AACAAACAAA 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 
CDGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 
CAGCTCCTCT GTQTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 
CTTGTTGAAG AGATTGGCTG OTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 



60 
120 
180 
240 
300 
360 
42 0 



1094 



WO 03/042661 



PCT/US02/36810 



AACACATTCA TT CAT AACA.C TQGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 
GTCAGCGGAG GAGTTTCAQA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCRCTGGGGA 540 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATRSTTTAQ AAGGACAAAA ATTTCCACTT 600 
GAOATGCAAA TCTACTGCTT TGATGCGGAC OGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
5 GGAAAAGGGA AGTTAAGAGC TT TAT C CATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC B40 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAQAT 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGQCT GTTTTTTGTG AAGTTCTTAC ARTGCAACAA 960 
10 TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAQ 102 0 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAQTTTGT 10 SO 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
15 GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 13S0 
AATCCTGAAC TTGATCTTTT COCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
GAAGAGGGAA AAGACATTGA AGAAGGGGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1S00 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCAjCAA CACACTACAA TCGCATAGGG 1B60 
20 ACGAAATACA ATGAAGCCAA QACTAACCGA TCCCCAAGAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
GAAGGT ACT T CAQCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTQTCGG GGACTGCAQA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG i860 
25 AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 192 0 
GCAACTTCTG CTATCCCATT CATCTCIGAG AACATATCCC AAGGGTATAT ATTTTOCTCC I960 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 21 6 0 
30 AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGOCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATQCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AAOGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
OTATACAATG GTGAGACACC rCTTCAACCT TCCTACRGTA GTGAAGTCTT TCCTCTAGTC 2460 
35 ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2SB0 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 
TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 
GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG QTGATTTGCT ATTAGAGCCC 2760 
40 AGCCTTGCTC AGTATTCTGA TGTGCTGTCC AjCTACTCATG CTGCTTCAGA GACGCTGGAA 2620 
TTTGGTAGTa AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 
AGCAGTGATG CCATQATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 
GATAATGAGG GCTOCCAACA C7VTCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGOG GCCCTAGCCA TATACCAATA 3060 
45 CCTAAGTCTT OGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGGCCTCTCT 3120 

GQTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 31B0 
GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 
ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 
50 CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CICTGTTTOC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 
GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAG ACATACTGTC 3S40 
TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTGAA CTCAGCTCTT ATTTTATGAG 3660 
55 ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 
GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCGAGTG ATCCAATATT GGTTGAAACC 3780 
CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 
AGTGAAAACA TGCTGCACTC TACATCTGTA CCZAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 
ATGCACTCTG CTTCACTTCA. AGGTTTGACC ATTTCCTATQ CAAGTGAGAA ATATGAACCA 3960 
60 GTTTTGTTAA AAAGXGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGQCCAATTT GGAGATTAAC CftGGCOCATC CCCCAAAAGG AAGGCATGTA 4080 
TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA T AAQC TTATA 4140 
CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 
ATTCCAACAG TTGCTTCTQA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 
65 GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 
TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 43 BO 
TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 
AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 
CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 
70 CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 
TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 
TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 
CCTCTCftGCC CTGAATCTAA AGCATGGGCA GTTCIGACAA GTGATGAAOA A AGTGGA TCA 4800 
GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCaCAGATTT CAGTTTTGCA 4860 
75 GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCXZAGGTT 4980 
TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 
GAATCCGAGA AGAAGGCAGT TATAOCCCTT QTGATCGTGT CAGCCCTGAC TXTTATCTGT 5100 
. _ CTAGTGGTTC TTGTGGGTAT TCTC3VTC T AC TGQAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

80 TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAQATG ATGTCGGAGC AATTOCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 
CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 
CACAAGAATC GATACATAAA TATQ3TTGCC TATGATCATA GCAGGGTTAA GCEAGCACAG 5460 
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CTTGCTGAAA AGGATGGCAA ACTGACTGAT TAT AT CAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAQCTTATAT TGCTGCOCAA GGCCCACTGA AATCCACAGC TGAAGATTTC S5Q0 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 

AAAGQAAGGA GAAAATGTGA TCAGTACTGG CCTGCOGATG GGAGTGAGGA GTACGGGAAC 5700 

5 TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA G A.C CCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC OCTGCCAGTG 5880 

CTGACCTTTG TOAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 594D 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

10 CAGATTCAAC ACGAAGOAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTOCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 63 00 

15 AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTQGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATBG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACOC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 64 BO 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

20 ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCOCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 67B0 

GCTGCCAA.TA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GAOGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGOATGTT 6900 

25 TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAOAATCCA 7020 

TCCACCICTC TGGACAGTAA TGGTGCAGCA TXGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTQTTAT CTGTTGATTT CCCATCACCT 7200 

30 GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAAT GTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTOAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG "XGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATOTTTG TTTTTAGTQT CAAATTTTTA 7440 

GCTGTATrTG TAG CAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 75 00 

35 AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7 660 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7Q00 

40 GAAATACCTT CATTTTBAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTOCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 

A - Seq ID K0* 372 Protein secrasncei 
45 Protein Accession ft: built from XP_031379 

1 11 21 31 41 51 

111111 
MRIItKRFX«Ae IQI1L1CVCRLD wangyyrqqr klvbeigmsy tgalnqjmhg kkyptcnspk 

50 QSPINIDEDL TQWVSUOCI. KPQGWDKTSI* ENTFIENTGK TVEINIiTNDY RVSGGVBEMV 
EKASKITFHM GKCKMS8DGS EHSLEGQKFP LEMQIYCEDA DRFSSFEEAV KGKGKLRAL9 
XIiFEVGTEEH LDFKAIIDGV ESVSRFGKQA AUDPFILLNL LPNSTDKYYI YNGSttTSPPC 
TDTVDW3VPK DTVSISESQI. AVFCBVLTKQ QSGYVHLMDY LQNNFRSQQY KFSRQVFSSY 
TGKEEIHEAV CSSEPENVQA DPENYTSLliV TWERPRWYD TMIEKFAVI.Y QQLBGEDQTK 
55 HEFJjJTJGYQD LGAILNWUjP NMSYVLQIVA rCTNGLYGKV SDQE»rvHMPT DNPELDLFPE 
LIGTEBIIKB EKBGKD IEEG AIVHPGRDSA TNQIRKKEPQ I6TTTHYNRX GTKYNEAKTN 
RSPTRGSEFS GKGDVPNTSL NSTBQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 
GSKTVLRSPH MNLSGTAESL NTV9ITEYEB BSLLTSFKLD TGAEDSSGSS PATSAIPFI8 
EKTSCGYIFS SENPBTITTD VLiIPESAHKA 9EDSTSSGSE ESUCDPSMBG KVHFPSSTDl 
60 TAOPBVG3GR ESFliQTNYTPJ IRVOESEKTT KSFSAGFVMS QGPBVTDI>EM PHYSTFAYPP 
TEVTPBAFTP SSRQ0DI*VfiT VNWYSQTTQ PVYUGETPLQ PSYSSEVFPL VTPI*Ll£lNQI 
LNTTPAASQS DSALKATPVF PSVDtfSFESI I>SSYDGAPLL PFSSASFSfiE LFRHLHTVBQ 
ILPOVTSATB SDKVELHASI, PVAGGOLLLE PSUVQY3DVL STTHAASBTL EFGSESGVtiY 
UTIMPBQVEP PSSDAKMKAR SSGPEPSYAb SDKEGSQHIF TVSYSSAIPV HDSVGVTYQG 
65 SLFSGPSHIP IPKSSLITPT ASI*LQFTHAL SGDGEHSGAS SDSEFLLiPDT EX3LTALNISS 

PVSVAEFTYT TSVFGOTNKA LfiKSEIIYGW ETKLQIPSFN EMVYPSESTV MPNMYDNVNK 
LNA8LQETSV SISffTKGMFP GSLAHTTTKV FDHEISQVPE UNFSVQPTHT VSQASGDTSE. 
KPVIiSANSBP ABSDPASSEM ItSPBTQLLFY BTSASFBTEV I&QPSFQASD VDTT>LKTVIiP 
AVPSDPILVB TPKVDKISST MIiHLrVSNSA SSENMIiHSTS VPVFDVSPT3 HMHeTiSLQGlj 
70 TISYASEJCjnS PVLI.K8ESSH QWPSIjYSND ELFQTAHLEI NQAHPPKGHH VFATPVL91D 
EPIiNTIiINKIi IHSDEILTfiT KBBVTGKVFA GIPTVA3DTF VSTDHEVPIG NGEVAITAVS 
PHRDGSVTST KUjFPSKATS ETjSHSAKSDA GLVGGGBDGD TDEEGDEDDD DRG8PGLSIE 
KCMSCSSYRS SQFKVMND6D THENSLMDQH NPISYSLSEN SEEDNRVTSV SSDSQTGHDR 
„ _ 3PGKSPSAKG tfSQKHNDGKE EBTOIQTGSAL LFLSPESKAW AVT.TSDEESG SGQGTSDSLAT 

75 ENETSTDFSF ADTNEKDADG IIiAAGDSEIT PGFPQSPTSS VTBENSEVFB VSBAEASHSS 

RESRIGXiAEG LESEKKAVIP LVIVfiALTFI CLWItVGILI YHRKCFQTAH FYLEDSTSPR 
VISTPPTPIF PISDDVGAIP IKEPPKHVAD JDHAS8GFTEE FETLKEFYQE VQSCTVDLG1 
TADS8NBPDN KHKMRYTNTV AYOH8RVKLA QLAEKDGKLT DYIKAKXVDG YNRPKAY1AA 
a _ QGPLXSTAED FHEMIWBHNV EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFXVTQRSVQ 

80 VIAYYTVHHF TLRNTKIKKG SQRGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 

AKRHAVGPW VH.CBAGVQRT GTYIVIjBEMIi QQIQHEGTVN IFGFLKHLHS QRNYLVQTEE 
QYVFIUDTLV EAII»SKETSV LDSHIHAYVN ALLIPGPAGK TKIrERQFQLL BQSNIQQfiDY 
3AALKQCNRE KNRTS9IIPV E RSRVGX BSL SGEGTDYINA 3YIMGYYQSN EFIITQHPIJ* 
HTIKDFWRMI WDHNAQI>VW5 IPDGQNMAED EFVYWPKKDE PINCE9FKVT LMAEEHKCLS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7BO 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
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HEEK&3IQDF ILEATQDDYV DEVJIHFQCPK HPNPDSPISK TFELISVIKE EAANRDGFMI 
VHDEHQGVTA GTFCAliTTIiM HQI*EKENSVD VYQVAKMINL MRFGVFADIE OYQFLYKVIL 
SLVSTRQEEM P8TSI*DSNGA ALPDGMIAES UBSLV 

6eq ID NOi 3 7j dha aequence 

Nucleic Acid Accession tt= built from NM_002B51 
148-4518 



2220 
2260 



5 

Coding sequence: 

. _ 1 11 21 31 41 51 

10 i i ; i i 1 

CACACATACG CACGCACGAT CTCACTTQ3A TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTOG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
COGCGAGGGG CCGCAGACCG TCTGGAAATG COAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATQGAT ACTACAOACA ACAGAGAAAA 240 
15 CTTGTTGAAG agattggctg GTCCTATACA GGAGCACTOA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG CX3CAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 
GTC&GCGGAG GAGTTTCAQA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
20 AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
QAOATGCAAA TCTACTGCTT TGATGCGGAC COATTTTCAA GTTTTGAGGA AGCAQTCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TT3TTTGAGG TTGGGACAQA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCQAA AGTGTTAGTC GTTTTGGGAA GCAGGClGCT 780 
TTAGATCCAT TCATACTGTT BAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC B40" 
25 AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC CTCATACACT OGAAAGGAAG AGATTCATGA AGCAOTTTGT 1080 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATAGCAPCCT TCTTGTTACA 1140 
30 TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATQGCTA TCAAGACTTG 1260 
GGTGCTATTC TCAATAATTT GCTACOCAAT ATGAGTTATQ TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAAT0 a CTTAT ATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 13 BO 
AATCCTGAAC TTGATCTTTT OCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
35 GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC C7GGTAGAGA CAGTGCTACA 1S00 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 156 0 
ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
40 GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTDQa GGACTGCAGA ATCCTIAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG I860 
AOTTTATTGA OCAGTTTCAA GCTTGATACT GGAGCTGAA3 ATTCTTCRGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCRTT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 19 B0 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAQ AATCTGCTAG AAATGClTCC 204 0 
45 GAAGATTCAA CTTCAICAGG TTCAOAAGAA TCACTAAAGG ATCGTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCICTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA OGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTtlCTG CAOGCCCAGT GATGTCACAG GGTCGCTCAG TTACAGATCT GQAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATOC 2340 
50 TCGAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG C3VGAGGOCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGQG 2460 
TTGGAATCOG AGAAOAAGGC AGTTATACCC CTTGTGATCQ TGTCAGCCCT QACTTTTATC 2520 
TGTCTAGTGG TTCTTGTGGQ TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 
TTTTACTTAG AGGACAGTAC ATCCCCEAGA GTTATATCCA CACCTOCAAC ACCTATCTTT 2640 
55 CC2AAtTTTCAG ATGATGTCQG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTEGCAGAT 2700 
TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAQACAC TGAAAGAjQTT TTAGCAGGAA 2760 
GTGCAGAGCT GTACTGTTOA CTTAGGTATT ACAGCAGACA GCTCCAACCA COCACACAAC 2 820 
AAGCACAAGA ATCGATACAT AAATATCGTV GCGTATGATC ATAGCAGGOr TAAGCTAGCA 28 B0 
CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATQCCAATTA TGTTGATGGC 2940 
60 TACAACAQAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 
TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGAXAAC AAACCTCGTG 30 60 
GAGAAAGGAA GGAGAAAATG TGATCAGTAC TOQCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 
AACTTTCXGG TCACTCAGAA GAGTGTOCAA GTGCTTGGCT ATTATACTGT GAGGAATTTT 3180 
ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACOCAG TGGACGTGlG 3240 
65 GTCAGACAGT ATCACTACAC GCAGTQGCCT GACATQGGAG TACCAGAGTA CXCCCTGCCA 33O0 
OTGCTGACCT TTGTGAGAAA GGCAGOCTAT QCCAAGOGCC ATGCAGTGGG GCCTGTTGTC 3360 
GTOCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATAlA TTGTGCTAGA CAGTATCTTG 3420 
C31GCAGATTC AACACGAAOG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCOTTCA 3480 
CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 
70 GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCA3XSC CTATGTTAAT 3600 
GCACrCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCICCTG 3660 
AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAQCAATG CAACRGGOAA 3720 
AAGAATOGAA CTTCTTCTAT CATCCCTQTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 
AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA OCAGAGCAAT 3840 
75 GAATTCATCA TTACCCAGCA OOCTCTC3CTT CATAOCATCA AGGATTTCTG GAGGATGATA 3900 
TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GOCAAAACAT GGCAGAAGAT 3960 
GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAA1T GTGAGAGCTT TAAGGTCACT 4020 
CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 
0 ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTXTCA GTGTCCTAAA 4140 

80 TGGOCAAATC CAGATAGCCC CATTAGTAAA ACTTTTOAAC TTATAAGTGT TATAAAAGAA 4200 

GAAGCTGCCA ATAGGGATGG QCCTATGATT QTlCATGATG AGCATGGAGG AGTOAD3QCA 4260 
GQAA CTTTCr GTGCTCTGAC AAGCCTTATG CACCAACTAG AAAAAGAAAA TTCGGTGGAT 4320 
GTTTACCAGG XAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTQAG 43 80 
CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 
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